GMI Cloud provides GPU cloud infrastructure and containerization services specialized for GenAI and high-performance computing workloads across five data centers (October 2024).
Leveraging strong GPU supply chain expertise and partnerships with NVIDIA and Realtek, the company offers fast access to top GPU models through the NVIDIA Partner Network. Its Taiwan-based data center and APAC location further optimize GPU delivery, reducing shipping times and costs compared to US competitors. Additionally, the company distinguishes itself from competitors by providing distinctive features such as customized private cloud services and built-in support for NVIDIA NIM, which simplifies integration with NVIDIA hardware and software.
The platform provides on-demand access to NVIDIA H100 GPUs with 80GB VRAM via its Kubernetes-based Cluster Engine, which utilizes NVLink and InfiniBand for high-speed GPU clustering. It also supports multiple ML frameworks, enabling customizable deployment environments. GMI also provides AI consulting services, including those for model training, fine-tuning, and scaling.
Key customers and partnerships
As of October 2024, GMI Cloud served over a dozen clients in the telecom, healthcare, and research industries, including Digital Ocean, Headline, and UbiOps.
In October 2024, the company partnered with Thailand energy firm Banpu to power GMI Cloud and Taiwanese electronics manufacturer Wistron to co-develop products with the startup.
By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.