All updates

All updates

icon
Filter
Product updates
FuriosaAI launches RNGD AI inference chip for high-performance LLM and multimodal model inference
Generative AI Infrastructure
Aug 26, 2024
This week:
Orijin raises seed funding for product development and expansion
Conservation Tech
Yesterday
Orijin raises seed funding for product development and expansion
Smart Farming
Yesterday
Product updates
Perplexity adds OpenAI o1 model and develops homepage widgets
Foundation Models
Yesterday
Partnerships
Nikola partners with WattEV to supply 22 BEVs
Truck Industry Tech
Yesterday
M&A
Zebra Technologies to acquire Photoneo from Photoneo Brightpick Group for undisclosed sum
Logistics Tech
Yesterday
Funding
Scope Technologies increases private placement offering to CAD 1.8 million
Machine Learning Infrastructure
Yesterday
Funding
Firefly Neuroscience raises USD 12.4 million in growth funding to commercialize technology
AI Drug Discovery
Dec 31, 2024
Listing
Nasdaq affirms delisting of OpGen after failed appeal
Precision Medicine
Dec 31, 2024
Funding
Rumble raises USD 775 million in strategic investment to support growth
Creator Economy
Dec 31, 2024
Product updates
InstaDeep releases open-source genomics AI model Nucleotide Transformers
Foundation Models
Dec 31, 2024
Generative AI Infrastructure

Generative AI Infrastructure

Aug 26, 2024

FuriosaAI launches RNGD AI inference chip for high-performance LLM and multimodal model inference

Product updates

  • FuriosaAI, an AI semiconductor company, has launched RNGD, an AI accelerator chip for data center inference. The chip is designed for high-performance LLM and multimodal model inference.

  • RNGD features a non-matmul Tensor Contraction Processor (TCP)-based architecture, a robust compiler optimized for TCP, and 48 GB of HBM3 memory. The chip has a TDP of 150 W compared to 1,000+ W for leading GPUs and can deliver 2,000 to 3,000 tokens per second throughput performance for models with around 10 billion parameters. 

  • RNGD is currently sampling to early access customers, with broader availability expected in early 2025. FuriosaAI claims that RNGD offers a perfect balance of efficiency, programmability, and performance. The company states that RNGD is a sustainable and accessible AI computing solution that meets the industry's real-world needs for inference, with the ability to run models like Llama 3.1 8 billion efficiently on a single card.

Contact us

Gain access to all industry hubs, market maps, research tools, and more
Get a demo
arrow
menuarrow

By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.