Thursday, 15 Jan 2026
Subscribe
logo logo
  • Global
  • Technology
  • Business
  • AI
  • Cloud
  • Edge Computing
  • Security
  • Investment
  • More
    • Sustainability
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
  • 🔥
  • data
  • revolutionizing
  • Secures
  • Stock
  • Investment
  • Future
  • Funding
  • Growth
  • Top
  • Power
  • Center
  • technology
Font ResizerAa
Silicon FlashSilicon Flash
Search
  • Global
  • Technology
  • Business
  • AI
  • Cloud
  • Edge Computing
  • Security
  • Investment
  • More
    • Sustainability
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Silicon Flash > Blog > AI > Revolutionizing AI Training with AMD GPUs: A Milestone Achievement
AI

Revolutionizing AI Training with AMD GPUs: A Milestone Achievement

Published November 25, 2025 By Juwan Chacko
Share
3 Min Read
Revolutionizing AI Training with AMD GPUs: A Milestone Achievement
SHARE

Summary:
1. Zyphra, AMD, and IBM collaborated to test AMD’s GPUs for large-scale AI model training, resulting in the creation of ZAYA1.
2. ZAYA1 is a Mixture-of-Experts model built entirely on AMD GPUs and networking, offering a viable alternative to NVIDIA for scaling AI.
3. The model was trained on AMD’s Instinct MI300X chips, Pensando networking, and ROCm software on IBM Cloud infrastructure, showcasing competitive performance and cost-effectiveness.

Article:
Zyphra, in conjunction with AMD and IBM, embarked on a year-long endeavor to evaluate the capabilities of AMD’s GPUs and platform for supporting large-scale AI model training. The culmination of their efforts is ZAYA1, a groundbreaking Mixture-of-Experts foundation model that challenges the industry’s reliance on NVIDIA for scaling AI operations.

The collaborative effort saw ZAYA1 being trained on AMD’s Instinct MI300X chips, Pensando networking, and ROCm software, all deployed on IBM Cloud infrastructure. Noteworthy is the conventional setup employed by Zyphra, resembling an enterprise cluster but devoid of NVIDIA components. This approach signifies a significant milestone in providing a viable second option for businesses seeking to expand their AI capacity without compromising on performance.

ZAYA1’s performance is reported to be on par with, and in some aspects surpassing, established open models in reasoning, mathematics, and coding. The model’s architecture, leveraging compressed attention, refined routing systems, and residual scaling, showcases its capability to compete with larger peers such as Qwen3-4B and Gemma3-12B. Additionally, the model’s Mixture-of-Experts structure enables efficient memory management during inference and reduces serving costs.

The implementation of AMD GPUs posed challenges in adapting mature NVIDIA-based workflows to ROCm. Zyphra meticulously optimized model dimensions, GEMM patterns, and microbatch sizes to align with the preferred compute ranges of the MI300X GPUs. Storage considerations were also addressed to enhance performance, ensuring efficient training runs and streamlined operations.

See also  Uptick in UK AI Sector Investment Surpasses £2.9B Milestone

Maintaining the integrity of training clusters over extended periods presented challenges, which Zyphra mitigated through its Aegis service. By monitoring logs and system metrics, the team swiftly identified and rectified issues, enhancing job uptime and reducing operational burden. Distributed checkpointing further improved efficiency, enabling faster saves and ensuring uninterrupted training rhythm.

The ZAYA1 AMD training milestone underscores the maturity of AMD’s ecosystem for large-scale model development, offering a compelling alternative to NVIDIA. While transitioning entirely from NVIDIA clusters may not be practical, leveraging AMD for specific stages can enhance memory capacity and training volume without significant disruption. In conclusion, organisations can benefit from adopting a flexible approach to AI procurement, leveraging diverse vendor offerings to optimize performance and scalability in AI operations.

TAGGED: Achievement, AMD, GPUs, milestone, revolutionizing, training
Share This Article
Facebook LinkedIn Email Copy Link Print
Previous Article The Decline of Novo Nordisk: Understanding the 5.6% Stock Drop The Decline of Novo Nordisk: Understanding the 5.6% Stock Drop
Next Article Verizon Layoffs Leave 168 Workers in Washington State Struggling Verizon Layoffs Leave 168 Workers in Washington State Struggling
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
LinkedInFollow

Popular Posts

Protecting Investors: Leading Securities Law Experts in Fraud Prevention

California remains at the forefront of investment fraud losses in the United States, with residents…

May 9, 2025

StackAI Secures $16 Million in Series A Funding to Accelerate Growth

StackAI Secures $16M in Series A Funding to Enhance AI Agent Platform StackAI, a leading…

May 7, 2025

Spark Biomedical Raises $15M in Series A Funding

Spark Biomedical Raises $15M in Series A Funding for Wearable Neurostimulation Technology Spark Biomedical, a…

April 30, 2025

The Surprising Surge of Rigetti Computing: A Closer Look at Their Rapid Growth

Summary: 1. Rigetti Computing stock saw a significant increase in trading this week, outperforming the…

September 21, 2025

Revolutionizing Network Security: How ZTNA Architecture Sets SSE Vendors Apart

Essential Features of Secure Service Edge (SSE) Secure Service Edge (SSE) is crucial for securing…

June 9, 2025

You Might Also Like

AI Revolution: How UK Young Adults are Turning to Artificial Intelligence for Financial Advice
AI

AI Revolution: How UK Young Adults are Turning to Artificial Intelligence for Financial Advice

Juwan Chacko
Revolutionizing AI Training: Huawei Chips Power Cutting-Edge Model Development
Global Market

Revolutionizing AI Training: Huawei Chips Power Cutting-Edge Model Development

Juwan Chacko
Accelerating Oncology Research: AstraZeneca’s Investment in In-House AI
AI

Accelerating Oncology Research: AstraZeneca’s Investment in In-House AI

Juwan Chacko
Navigating Enterprise AI: A Buyer’s Guide
AI

Navigating Enterprise AI: A Buyer’s Guide

Juwan Chacko
logo logo
Facebook Linkedin Rss

About US

Silicon Flash: Stay informed with the latest Tech News, Innovations, Gadgets, AI, Data Center, and Industry trends from around the world—all in one place.

Top Categories
  • Technology
  • Business
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2025 – siliconflash.com – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?