
Oracle and AMD have expanded their collaboration to deliver high-performance cloud infrastructure tailored for the next generation of AI workloads. Oracle will become one of the first hyperscalers to offer an AI supercomputer cluster equipped with the newly launched AMD Instinct™ MI355X GPUs on Oracle Cloud Infrastructure (OCI). The move aims to give enterprises advanced compute capabilities at significantly better price-performance ratios for training and inference of large-scale AI models.
Oracle's latest infrastructure upgrade on OCI brings zettascale AI clusters powered by AMD Instinct MI355X GPUs. With support for up to 131,072 GPUs, these clusters are engineered for enterprises needing ultra-scalable compute to train and deploy complex AI and agentic applications.
"To support customers that are running the most demanding AI workloads in the cloud, we are dedicated to providing the broadest AI infrastructure offerings," said Mahesh Thiagarajan, Executive Vice President, Oracle Cloud Infrastructure. "AMD Instinct GPUs, paired with OCI's performance, advanced networking, flexibility, security, and scale, will help our customers meet their inference and training needs for AI workloads and new agentic applications."
The AMD Instinct MI355X is built for cutting-edge AI tasks. It offers:
These features make it possible for customers to execute large language models directly in memory with greater speed and efficiency.
OCI will be the first to feature AMD Pollara AI NICs on backend networks. These provide advanced RoCE features like programmable congestion control and compliance with Ultra Ethernet Consortium (UEC) standards for ultra-low latency networking.
On the orchestration side, OCI’s high-frequency AMD Turin CPUs—featuring up to 3TB of system memory—function as powerful head nodes to handle data processing and job management, ensuring high GPU utilization and performance optimization.
"AMD and Oracle have a shared history of providing customers with open solutions to accommodate high performance, efficiency, and greater system design flexibility," said Forrest Norrod, Executive Vice President and General Manager, Data Center Solutions Business Group, AMD. "The latest generation of AMD Instinct GPUs and Pollara NICs on OCI will help support new use cases in inference, fine-tuning, and training, offering more choice to customers as AI adoption grows."
The AMD ROCm software stack will be integral to this rollout, supporting a range of open-source tools, libraries, and compilers. Customers will be able to move existing workloads to OCI without vendor lock-in, allowing greater architectural freedom and easier AI and HPC development.
Oracle and AMD’s deepened alliance reflects growing enterprise demand for scalable, efficient infrastructure to power today’s rapidly evolving AI landscape. The zettascale cluster is positioned to be a cornerstone for businesses seeking cutting-edge performance, flexibility, and speed for AI and next-gen applications.
Oracle offers integrated suites of applications plus secure, autonomous infrastructure in the Oracle Cloud. For more information about Oracle, please visit us at oracle.com.