Sunday, June 2, 2024

AMD scales its GPU accelerators for AI

During an opening keynote at COMPUTEX in Taiwan, AMD's CEO, Dr. Lisa Su, unveiled the company's next gen Instinct MI325X GPU accelerator with up to 288GB of HBM3E memory for release later this year. AMD will pursue an annual cadence for new product releases.

Following the Instinct MI325X, the next AMD Instinct MI350 series, powered by the new AMD CDNA 4 architecture, is expected to be available in 2025 bringing up to a 35x increase in AI inference performance compared to AMD Instinct MI300 Series with AMD CDNA 3 architecture. Expected to arrive in 2026, the AMD Instinct MI400 series is based on the AMD CDNA “Next” architecture.

“The AMD Instinct MI300X accelerators continue their strong adoption from numerous partners and customers including Microsoft Azure, Meta, Dell Technologies, HPE, Lenovo and others, a direct result of the AMD Instinct MI300X accelerator exceptional performance and value proposition,” said Brad McCredie, corporate vice president, Data Center Accelerated Compute, AMD. “With our updated annual cadence of products, we are relentless in our pace of innovation, providing the leadership capabilities and performance the AI industry and our customers expect to drive the next evolution of data center AI training and inference.”

Finally, AMD highlighted the demand for AMD Instinct MI300X accelerators continues to grow with numerous partners and customers using the accelerators to power their demanding AI workloads, including:

  • Microsoft Azure using the accelerators for Azure OpenAI services and the new Azure ND MI300X V5 virtual machines.
  • Dell Technologies using MI300X accelerators in the PowerEdge XE9680 for enterprise AI workloads.
  • Supermicro providing multiple solutions with AMD Instinct accelerators.
  • Lenovo powering Hybrid AI innovation with the ThinkSystem SR685a V3
  • HPE is using them to accelerate AI workloads in the HPE Cray XD675.