Home
News
Tech Grid
Data & Analytics
Data Processing Data Management Analytics Data Infrastructure Data Integration & ETL Data Governance & Quality Business Intelligence DataOps Data Lakes & Warehouses Data Quality Data Engineering Big Data
Enterprise Tech
Digital Transformation Enterprise Solutions Collaboration & Communication Low-Code/No-Code Automation IT Compliance & Governance Innovation Enterprise AI Data Management HR
Cybersecurity
Risk & Compliance Data Security Identity & Access Management Application Security Threat Detection & Incident Response Threat Intelligence AI Cloud Security Network Security Endpoint Security Edge AI
AI
Ethical AI Agentic AI Enterprise AI AI Assistants Innovation Generative AI Computer Vision Deep Learning Machine Learning Robotics & Automation LLMs Document Intelligence Business Intelligence Low-Code/No-Code Edge AI Automation NLP AI Cloud
Cloud
Cloud AI Cloud Migration Cloud Security Cloud Native Hybrid & Multicloud Cloud Architecture Edge Computing
IT & Networking
IT Automation Network Monitoring & Management IT Support & Service Management IT Infrastructure & Ops IT Compliance & Governance Hardware & Devices Virtualization End-User Computing Storage & Backup
Human Resource Technology Agentic AI Robotics & Automation Innovation Enterprise AI AI Assistants Enterprise Solutions Generative AI Regulatory & Compliance Network Security Collaboration & Communication Business Intelligence Leadership Artificial Intelligence Cloud
Finance
Insurance Investment Banking Financial Services Security Payments & Wallets Decentralized Finance Blockchain Cryptocurrency
HR
Talent Acquisition Workforce Management AI HCM HR Cloud Learning & Development Payroll & Benefits HR Analytics HR Automation Employee Experience Employee Wellness Remote Work
Marketing
AI Customer Engagement Advertising Email Marketing CRM Customer Experience Data Management Sales Content Management Marketing Automation Digital Marketing Supply Chain Management Communications Business Intelligence Digital Experience SEO/SEM Digital Transformation Marketing Cloud Content Marketing E-commerce
Consumer Tech
Smart Home Technology Home Appliances Consumer Health AI
Interviews
Anecdotes
Think Stack
Press Releases
Articles
Tech Events 2025
  • Home
  • /
  • News
  • /
  • AI
  • /
  • Agentic AI
  • /
  • Red Hat Launches AI 3 to Scale Distributed AI Inference in Production
  • Agentic AI

Red Hat Launches AI 3 to Scale Distributed AI Inference in Production


Red Hat Launches AI 3 to Scale Distributed AI Inference in Production
  • by: Source Logo
  • |
  • October 14, 2025

Red Hat, the world’s leading open source solutions provider, announced Red Hat AI 3, a major evolution of its enterprise AI platform. The platform integrates Red Hat AI Inference Server, RHEL AI, and Red Hat OpenShift AI, enabling organizations to simplify high-performance AI inference at scale and move workloads from proofs-of-concept to production more efficiently.

Quick Intel

  • Red Hat AI 3 simplifies distributed AI inference across hybrid and multi-vendor environments.

  • Supports any AI model on any hardware accelerator, from datacenters to cloud and edge.

  • Enables enterprise-scale production of large language models (LLMs) with llm-d and vLLM.

  • Unified platform supports collaborative AI workflows for IT and AI engineers.

  • Provides foundation for agentic AI systems and next-generation AI applications.

  • Partners include AMD, NVIDIA, and ARSAT for accelerated, scalable AI workloads.

From Training to Enterprise AI Inference

Red Hat AI 3 emphasizes the shift from AI model training to inference, delivering production-ready capabilities. Key innovations include:

  • llm-d: Distributed, Kubernetes-native LLM inference for intelligent scheduling and disaggregated serving.

  • vLLM integration: High-performance, scalable serving system for large AI models, including Mixture-of-Experts (MoE).

  • Well-lit Paths: Prescriptive deployment guidance to simplify model deployment on hybrid cloud environments.

  • Cross-platform support: Enables LLM inference across NVIDIA and AMD accelerators.

Unified Platform for Collaborative AI

Red Hat AI 3 unifies workflows for IT and AI engineers, providing tools to scale from experimentation to production:

  • Model as a Service (MaaS): IT teams can centrally manage and deliver models on demand.

  • AI Hub: Central catalog for exploring, deploying, and monitoring foundational AI models.

  • Gen AI Studio: Hands-on environment for prototyping generative AI applications with interactive model playgrounds.

  • Curated validated models, including gpt-oss, DeepSeek-R1, Whisper, and Voxtral Mini.

Building the Foundation for Next-Generation AI Agents

Red Hat AI 3 enables scalable agentic AI systems through:

  • Unified API layer based on Llama Stack for OpenAI-compatible LLM integration.

  • Model Context Protocol (MCP) adoption for seamless AI model interoperability.

  • Modular toolkit for model customization, including InstructLab, Docling, synthetic data generation, and evaluation hub for model validation.

Supporting Quotes

“With Red Hat AI 3, we are providing an enterprise-grade, open source platform that minimizes complexity, cost and control challenges. By bringing distributed inference with llm-d and a foundation for agentic AI, we enable IT teams to operationalize next-generation AI on their own terms.” — Joe Fernandes, VP & GM, AI Business Unit, Red Hat“As Red Hat brings distributed AI inference into production, AMD provides the high-performance foundation with EPYC™ processors and Instinct™ GPUs to deliver scalable enterprise AI.” — Dan McNamara, SVP & GM, Server and Enterprise AI, AMD“Red Hat OpenShift AI enabled ARSAT to move from concept to production in just 45 days, improving service while preserving data sovereignty.” — Mariano Greco, CEO, ARSAT“Scalable, high-performance inference is key to the next wave of generative and agentic AI. Red Hat AI 3 empowers teams to move swiftly from experimentation to production workloads at scale.” — Ujval Kapasi, VP, Engineering AI Frameworks, NVIDIA

About Red Hat, Inc.

Red Hat is the open hybrid cloud technology leader, delivering a trusted, consistent foundation for IT innovation and AI applications. Its portfolio includes cloud, developer, AI, Linux, automation, and application platform technologies, enabling enterprises to deploy applications anywhere—from datacenter to edge. Red Hat invests in open ecosystems, partnering with customers to build, connect, automate, secure, and manage IT environments, supported by consulting services and award-winning training.

  • Red HatAIEnterprise AIAgentic AIOpen Source AI
News Disclaimer
  • Share