All Updates

All Updates

icon
Filter
Product updates
Galileo launches 'Galileo Luna' to evaluate performance of LLMs
Generative AI Infrastructure
Jun 6, 2024
This week:
Funding
GrayMatter Robotics raises USD 45 million in Series B funding to accelerate AI-powered robotics solutions
Smart Factory
Yesterday
Funding
Vecna Robotics raises USD 100 million in Series C funding; appoints new COO
Logistics Tech
Yesterday
Funding
Vecna Robotics raises USD 100 million in Series C funding; appoints new COO
Smart Factory
Yesterday
Funding
FairNow raises USD 3.5 million to advance AI governance solutions
Generative AI Infrastructure
Yesterday
Partnerships
Gravitics develops testing gauntlet for larger spacecraft in collaboration with NASA
Space Travel and Exploration Tech
Yesterday
M&A
knownwell acquires Alfie Health to integrate AI in primary and obesity care services
Telehealth
Yesterday
Funding
Pomelo Care raises USD 46 million in Series B funding to expand virtual maternal care
Telehealth
Yesterday
Funding
Isar Aerospace raises EUR 65 million, backed by NATO Innovation Fund
Space Travel and Exploration Tech
Yesterday
Product updates
Beyond Meat releases new Beyond Sausage, expanding its Beyond IV product line
Plant-based Meat
Yesterday
Product updates
Funding
SurrealDB raises USD 20 million in Series A; launches beta version of Surreal Cloud
Data Infrastructure & Analytics
Yesterday
Generative AI Infrastructure

Generative AI Infrastructure

Jun 6, 2024

Galileo launches 'Galileo Luna' to evaluate performance of LLMs

Product updates

  • Galileo has launched “Galileo Luna,” a suite of evaluation foundation models (EFMs) specifically designed to evaluate the performance of LLMs like OpenAI's GPT-4 and Google's Gemini Pro.

  • These Luna EFM models, which are LLMs, have been fine-tuned to detect hallucinations, data leakages, context quality errors, and malicious prompts. Benchmark tests showed Luna EFMs outperforming existing evaluation models by up to 20% in accuracy.

  • The company claims Luna EFMs are faster, more cost-effective, and more accurate than current methods, including human evaluations and other LLMs like GPT-4. The company CEO stated that Luna EFMs can evaluate responses at a scale necessary for enterprises, being 97% cheaper, 11x faster, and 18% more accurate than OpenAI’s GPT-3.5.

  • Analyst QuickTake: The development of Luna has been an important step for Galileo, a leading GenAI evaluation company since early 2021. In February 2024 , the company also introduced retrieval augmented generation (RAG) and Agent Analytics to enhance the creation and dependability of AI applications, aiming for more accurate and transparent AI responses. 

Contact us

Gain access to all industry hubs, market maps, research tools, and more
Get a demo
arrow
menuarrow

By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.