All Updates

All Updates

icon
Filter
Product updates
Amazon SageMaker adds new capabilities for faster auto-scaling of AI models
Generative AI Infrastructure
Jul 25, 2024
This week:
Funding
Immatics raises USD 150 million in public funding
Precision Medicine
Yesterday
Funding
Immatics raises USD 150 million in public funding
Cell & Gene Therapy
Yesterday
Product updates
Asus launches servers with AMD EPYC 9005-series CPUs and MI325X accelerators to enhance product offering
Generative AI Infrastructure
Yesterday
Partnerships
A2Z signs agreement with Trixo for smart cart services in Mexico and Central America
Automated Stores
Yesterday
Product updates
Nikon releases additional powder feeder for metal additive manufacturing
Additive Manufacturing
Oct 10, 2024
Product updates
BigRep launches industrial filament dryer ‘DRYCON'
Additive Manufacturing
Oct 10, 2024
Industry news
Titomic joins DNV's project to standardize additive manufacturing
Additive Manufacturing
Oct 10, 2024
Product updates
Google rolls out ‘Imagen 3’ image-generation model for Gemini users
Foundation Models
Oct 10, 2024
Funding
BaCta raises EUR 3.3 million in pre-seed funding to produce bio-synthetic rubber
Bio-based Materials
Oct 10, 2024
Partnerships
XtalPi's Ailux Biologics licenses AI platform XtalFold to Janssen for biologics discovery and engineering
AI Drug Discovery
Oct 10, 2024
Generative AI Infrastructure

Generative AI Infrastructure

Jul 25, 2024

Amazon SageMaker adds new capabilities for faster auto-scaling of AI models

Product updates

  • Amazon SageMaker has introduced a feature that speeds up the scalability of GenAI models, reducing their adaptation time. It automates the scaling process for these models based on demand.

  • This new feature comes with the ability to deploy single or multiple models using SageMaker inference components. Its advanced routing strategies ensure effective load balance to the underlying instances of an endpoint. It incorporates auto-scaling, which optimizes the number of instances in use and the quantity of model copies deployed, reacting to real-time changes in demand.

  • Amazon claims that the new feature brings numerous benefits including reducing infrastructure costs, enhancing throughput, and minimizing latency. The incorporation of high-resolution metrics  increases the speed of auto scaling, reduces detection time and improves the overall scale-out time of gen AI models, thereby optimizing performance and cost-efficiency as demand fluctuates.

  • Analyst quicktake: Amazon SageMaker is consistently adding new features to lead the integrated solution space for LLMs. In June 2024 , it launched a fully managed MLflow feature to improve ML workflows, offering comprehensive experiment tracking, evaluations, and a model registry across various SageMaker components.

Contact us

Gain access to all industry hubs, market maps, research tools, and more
Get a demo
arrow
menuarrow

By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.