AMD announced the "Instinct MI325X," a GPU designed for GenAI workloads in data centers. The MI325X, slated to launch in Q4 2024, claims to surpass NVIDIA's H200 chip in memory capacity, bandwidth, and peak theoretical performance (30% faster).
The Instinct MI325X offers up to 288 GB of HBM3e high-bandwidth memory and a memory bandwidth of 6 TB/ps. Further, it offers a peak theoretical throughput for 8-bit floating point (FP8) and 16-bit floating point (FP16) at 2.6 petaflops and 1.3 petaflops, respectively. Eight of these GPUs can fit into the Instinct MI325X platform, capable of running GenAI models with up to a trillion parameters.
Analyst QuickTake: The launch preceded AMD’s MI300 chip in December 2023 , claimed to be the highest-performing AI accelerator. While the Instinct MI325X aims to contend with NVIDIA's H200 chip (expected to be released in Q2 2024), its performance against the company’s top-of-the-line Blackwell GPUs (expected to launch in late 2024/early 2025) remains unclear.
By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.