All Updates

All Updates

icon
Filter
Product updates
OctoML launches self-optimizing compute service for AI
Machine Learning Infrastructure
Jun 14, 2023
This week:
Funding
Matr Foods raises EUR 20 million in debt funding to build mycelium meat factory
Plant-based Meat
Today
Last week:
M&A
Platform Science to acquire Trimble's global transportation telematics business units
Truck Industry Tech
Yesterday
Funding
Whatfix raises USD 100 million in Series E funding to expand business
EdTech: Corporate Learning
Yesterday
Product updates
Sky Mavis launches cross-game onboarding solution
Web3 Ecosystem
Sep 14, 2024
Funding
Bicara Therapeutics raises USD 315 million in IPO; plans to develop lead candidate ficerafusp alfa
Precision Medicine
Sep 13, 2024
Partnerships
Massive Bio and Foundation Medicine partner to improve cancer clinical trial enrollment
Precision Medicine
Sep 13, 2024
Partnerships
Moffitt Cancer Center partners with AstraZeneca to advance oncology cell therapies
Cell & Gene Therapy
Sep 13, 2024
Product updates
Quandela launches European quantum computer in North America
Quantum Computing
Sep 13, 2024
Partnerships
IonQ achieves high qubit gate fidelity on barium development platform
Quantum Computing
Sep 13, 2024
Partnerships
Massive Bio and Foundation Medicine partner to improve cancer clinical trial enrollment
Clinical Trial Technology
Sep 13, 2024
Machine Learning Infrastructure

Machine Learning Infrastructure

Jun 14, 2023

OctoML launches self-optimizing compute service for AI

Product updates

  • OctoML, an ML model optimization and deployment platform, has launched the latest iteration of its services, OctoAI. This self-optimizing infrastructure service is designed to assist companies in building and deploying AI applications, with a particular emphasis on generative AI applications.

  • OctoAI is a managed computing service that supports businesses in utilizing pre-existing open-source models and refining them using their own data to host personalized models. Users can easily prioritize their preferences, such as latency or cost, and OctoAI will automatically determine the appropriate hardware for their needs. 

  • Moreover, the service automatically optimizes these models, resulting in additional cost savings and performance improvements. It also determines the most suitable platform for running the models, whether it be NVIDIA’S GPUs or AWS' Inferentia machines.

  • The new platform also provides access to a library of popular open-source large language models (LLMs), such as Stable Diffusion 2.1, Dolly v2, LLaMA 65B, Whisper, FlanUL, and Vicuna, which developers can use to build their AI applications. 

Contact us

Gain access to all industry hubs, market maps, research tools, and more
Get a demo
arrow
menuarrow

By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.