All Updates

All Updates

icon
Filter
Product updates
Meta releases new AI research models for multimodal tasks
Generative AI Applications
Jun 18, 2024
This week:
Partnerships
Qualcomm and Google partner to develop AI-driven automotive solutions
Auto Tech
Yesterday
Product updates
Meta AI releases LayerSkip to accelerate inference in LLMs
Generative AI Infrastructure
Yesterday
Funding
Freeform secures funding from NVIDIA's NVentures
Additive Manufacturing
Oct 22, 2024
Product updates
Flexxbotics announces compatibility with LMI Technologies for quality inspection
Smart Factory
Oct 22, 2024
Funding
Oxla raises USD 11 million in seed funding to drive commercialization
Data Infrastructure & Analytics
Oct 22, 2024
Product updates
Cohesity enhances Gaia, its AI assistant, with visual data exploration and expanded data sources
Data Infrastructure & Analytics
Oct 22, 2024
Product updates
Finzly launches FedNow service through BankOS platform in AWS marketplace
FinTech Infrastructure
Oct 22, 2024
Product updates
Runway launches Act-One for AI facial expression motion capture
Generative AI Applications
Oct 22, 2024
Product updates
Ideogram launches Canvas for image manipulation and generation
Generative AI Applications
Oct 22, 2024
Partnerships
UiPath partners with Inflection AI to integrate AI solutions for enterprises
Generative AI Applications
Oct 22, 2024
Generative AI Applications

Generative AI Applications

Jun 18, 2024

Meta releases new AI research models for multimodal tasks

Product updates

  • Meta's Fundamental AI Research (FAIR) team has announced the public release of several new AI models and tools for researchers. These include image-to-text and text-to-music generation models, a multi-token prediction model, and a technique for detecting AI-generated speech. The models are being released under various licenses ranging from research-only to commercial.

  • Chameleon, which was publicly released, is a family of mixed-modal models that can process and generate text and images. The Chameleon 7 billion and 34 billion models, released under a research-only license, can reportedly handle tasks involving visual and textual understanding, such as image captioning. Another model, JASCO, is designed for text-to-music generation and allows users to control aspects like chords, drums, and melodies through text inputs.

  • A multi-token prediction model for code completion is also being released under a non-commercial, research-only license. Meta is also releasing AudioSeal, an audio watermarking technique for detecting AI-generated speech within longer audio snippets.

Contact us

Gain access to all industry hubs, market maps, research tools, and more
Get a demo
arrow
menuarrow

By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.