Posted inAI Amazon RSS A post from Amazon AWS : Amazon SageMaker launches the updated inference optimization toolkit for generative AI Posted by By 3. December 2024 Today, Amazon SageMaker is excited to announce updates to the inference optimization toolkit, providing new…
Posted inAI Amazon RSS A post from Amazon AWS : Syngenta develops a generative AI assistant to support sales representatives using Amazon Bedrock Agents Posted by By 3. December 2024 This post was written with Zach Marston and Serg Masis from Syngenta. Syngenta and AWS…
Posted inAI Amazon RSS A post from Amazon AWS : Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker Posted by By 3. December 2024 This post is co-written with Abhishek Sawarkar, Eliuth Triana, Jiahong Liu and Kshitiz Gupta from…
Posted inAI Amazon RSS A post from Amazon AWS : Unlock cost savings with the new scale down to zero feature in SageMaker Inference Posted by By 3. December 2024 Today at AWS re:Invent 2024, we are excited to announce a new feature for Amazon…
Posted inAI Amazon RSS A post from Amazon AWS : Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference Posted by By 3. December 2024 Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability…
Posted inAI Amazon RSS A post from Amazon AWS : Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – part 1 Posted by By 3. December 2024 The generative AI landscape has been rapidly evolving, with large language models (LLMs) at the…
Posted inAI Amazon RSS A post from Amazon AWS : Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2 Posted by By 3. December 2024 In Part 1 of this series, we introduced Amazon SageMaker Fast Model Loader, a new capability…
Posted inAI Amazon RSS A post from Amazon AWS : Fast and accurate zero-shot forecasting with Chronos-Bolt and AutoGluon Posted by By 2. December 2024 Chronos-Bolt is the newest addition to AutoGluon-TimeSeries, delivering accurate zero-shot forecasting up to 250 times…
Posted inAI Amazon RSS A post from Amazon AWS : How Amazon Finance Automation built a generative AI Q&A chat assistant using Amazon Bedrock Posted by By 2. December 2024 Today, the Accounts Payable (AP) and Accounts Receivable (AR) analysts in Amazon Finance operations receive…
Posted inAI Amazon RSS A post from Amazon AWS : Cohere Rerank 3.5 is now available in Amazon Bedrock through Rerank API Posted by By 1. December 2024 We are excited to announce the availability of Cohere’s advanced reranking model Rerank 3.5 through…