Cartesia

Overview
News
Generative AI Applications?
Product stageSegments
Minimum Viable Product
?
Audio generation
?

Cartesia specializes in developing real-time multimodal AI models powered by state space models (SSMs). The company's SSM architecture offers significant advantages over traditional transformer models by scaling linearly with sequence length and enabling efficient, high-throughput inference. Unlike transformers that process every past token, SSMs update the model's state and discard previous tokens as they stream in, making them ideal for real-time applications.

In May 2024, Cartesia launched Sonic, an ultra-fast text-to-speech model that generates expressive, lifelike speech with less than 90 ms latency to first audio. Sonic operates locally without internet connection and includes features like emotion control, speed adjustment, and prompting capabilities. The platform outperforms existing market solutions in voice quality, stability, and accuracy as validated through blind human preference tests by third-party evaluators like Labelbox. The company built and optimized its own SSM inference stack to serve Sonic with low latency and high throughput at scale.

Cartesia's technology enables developers to create AI applications across various sectors including customer service, healthcare, robotics, gaming, transportation, education, and security. The company developed the widely cited Mamba architecture that demonstrates SSMs can match transformer performance using fewer computational resources.

Key customers and partnerships

Sonic API has been adopted by hundreds of customers ranging from startups to public companies for applications including customer service, debt collection, interview screening, voiceovers, and interactive character voices. The platform has gained particular traction among startups building real-time voice agents in the interactive voice response (IVR) market, which was valued at USD 6 billion as of December 2024.

Cartesia partnered with Crusoe Cloud to support the training of their models on H100 GPU clusters, and has expanded this partnership multiple times. The company's Sonic product is also available through the AWS Marketplace.


Sources

Disclaimer: This company profile has been generated using data obtained through automated web searches and advanced generative AI technology. While we strive to ensure the accuracy and reliability of our sources, auto-generated information could be outdated or inaccurate and should be verified independently.
HQ location:
1766 18th Street San Francisco CA USA
Founded year:
2023
Employees:
11-50
IPO status:
Private
Total funding:
USD 22.0 mn
Last Funding:
USD 22.0 mn (Series Unknown; Dec 2024)
Last valuation:
-
Key competitors
Filter by the segments to which the disruptor belongs
All Segmentsexpand
 
Loading...
Loading...
Loading...
Loading...
Product Overview
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
Product Metrics
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
Company profile
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
-
Loading...
Loading...
Loading...
Loading...
Funding data are powered by Crunchbase
arrow
menuarrow
Click here to learn more
Get a demo

By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.