To that finish, AMD has launched the MI325X with better reminiscence capability and bandwidth than the Intuition MI300X, which launched final December. The MI325X is predicated on the identical CDNA 3 GPU structure, in contrast with 192GB of HBM3 high-bandwidth reminiscence and 5.3 TB/s in reminiscence bandwidth within the MI300X.
AMD stated AI inference efficiency within the MI325X gives 40% sooner throughput with an 8-group, 7-billion-parameter Mixtral mannequin over Nvidia’s top-of-the-line Hopper H200, 30% decrease latency with a 7-billion-parameter Mixtral mannequin, and 20% decrease latency with a 70-billion-parameter Llama 3.1 mannequin.
AMD is planning an eight-node platform for subsequent yr, much like Nvidia’s DGX Pods. With eight MI325X GPUs related over AMD’s Infinity Material, the platform will supply 2TB of HBM3e reminiscence, 48 TB/s of complete reminiscence bandwidth, 20.8 petaflops of FP8 efficiency, and 10.4 petaflops of FP16 efficiency, AMD stated.
The MI325X will start delivery in techniques from Dell Applied sciences, Lenovo, Supermicro, Hewlett Packard Enterprise, Gigabyte, and several other different server distributors beginning within the first quarter of subsequent yr, the corporate stated.
Learn extra processor information
- Enfabrica seems to be to speed up GPU communication: Enfabrica’s Accelerated Compute Material SuperNIC (ACF-S) silicon is designed to ship greater bandwidth, better resiliency, decrease latency and better programmatic management to information middle operators operating data-intensive AI and HPC.
- Nvidia claims effectivity features of as much as 100,000X: Nonetheless, the chipmaker’s dramatic declare for the efficiency features of its GPUs is over a 10-year span, and solely applies to at least one kind of calculation.
- Intel launches Xeon 6 processors and Gaudi 3 AI accelerators: Intel has formally launched its subsequent Xeon 6 server processors in addition to the Gaudi 3 AI accelerators, making some fairly large boasts within the course of.
- Inflection AI shifts to Intel Gaudi 3, difficult Nvidia’s AI chip lead: The announcement follows IBM’s latest partnership with Intel, signaling a rising curiosity in Intel’s AI {hardware}.
- Intel’s Altera spinout launches FPGA merchandise, software program: Altera CEO Sandra Rivera shares ‘large, audacious, formidable aim’ to dominate FPGA market.