Etched.ai is a chip startup founded by Harvard dropouts Gavin Uberti and Chris Zhu, focusing on developing specialized artificial intelligence (AI) accelerator chips dedicated to large language model (LLM) acceleration. Their aim is to dramatically enhance the performance and reduce the cost of running AI models like GPT-4 by designing purpose-built chips optimized for the transformer architecture used in LLMs.
Etched.ai's first chip, codenamed Sohu, is designed to have substantial memory and support large batch sizes, enabling it to achieve 140 times higher throughput per dollar compared to an Nvidia H100 PCIe card when processing GPT-3 tokens. The company claims that this significant performance uplift is primarily driven by impressive throughput capabilities rather than a substantial cost differential.
By specializing in LLM acceleration, Etched.ai aims to provide a more efficient and cost-effective solution for organizations seeking to leverage the power of large language models.
By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.