Red Hat introduces unified platform for managing enterprise artificial intelligence deployments

Matt Hicks President and Chief Executive Officer
Matt Hicks President and Chief Executive Officer
0Comments

Red Hat has announced the launch of Red Hat AI Enterprise, an integrated platform designed to deploy and manage artificial intelligence (AI) models, agents, and applications across hybrid cloud environments. The new product is part of the company’s broader AI portfolio, which includes Red Hat AI Inference Server, Red Hat OpenShift AI, and Red Hat Enterprise Linux AI. Alongside this release, the company introduced Red Hat AI 3.3 with updates aimed at improving performance and expanding features across its suite of AI tools.

The introduction of Red Hat AI Enterprise aims to address challenges organizations face in moving from experimental projects to fully operationalized and governed AI systems. Many enterprises have struggled to move beyond pilot phases due to fragmented tools and inconsistent infrastructure. The new platform seeks to unify model and application lifecycles so that IT teams can manage AI as a standardized system within their organizations.

Joe Fernandes, vice president and general manager of the AI Business Unit at Red Hat, said: “For AI to deliver true business value, it must be operationalized as a core component of the enterprise software stack, not as a standalone silo. Red Hat AI Enterprise is designed to bridge the gap between infrastructure and innovation by providing a unified metal to agent platform. By integrating advanced tuning and agentic capabilities with the industry-leading foundation of Red Hat Enterprise Linux and Red Hat OpenShift, we are providing the complete stack – from the GPU-accelerated hardware to the models and agents that drive business logic. Additionally, with Red Hat AI 3.3 organizations can move beyond fragmented pilots to governed, repeatable and high-performance AI operations across the hybrid cloud.”

Red Hat states that its new solution provides core functions such as high-performance inference for generative models using vLLM inference engine technology; lifecycle management for governance; flexibility in deployment on various hardware platforms; support for NVIDIA’s Blackwell Ultra processors; AMD MI325X accelerators; and previewed support for Intel CPUs for cost-effective small language model inference.

A notable feature is integration with NVIDIA through co-engineering efforts on the Red Hat AI Factory, combining capabilities from both companies’ enterprise platforms.

The latest version also introduces Models-as-a-Service (MaaS) in technology preview mode—allowing internal users self-service access via an API gateway—and enhancements such as expanded model options available through OpenShift’s catalog. These include compressed versions of popular models like Mistral-Large-3 and Nemotron-Nano.

Additional improvements focus on security through a new Python Index repository delivering hardened versions of essential tools; real-time telemetry for observability into workload health; integrated NeMo Guardrails previewed for operational safety enforcement; improved Whisper speech processing speeds; geospatial data support; enhanced tool calling functionality; pooled GPU resource management with automatic checkpointing for training jobs.

Red Hat continues positioning itself as a provider focused on making enterprise-level adoption of advanced machine learning more accessible by offering end-to-end solutions built atop widely used open-source technologies like Linux and Kubernetes.

More information about these developments can be found in the official blog post. Details about joint work with NVIDIA are available via this link. For those interested in further updates or technical roadmaps related to these products, additional sessions are scheduled by the company.



Related

Marvin R. Ellison Chairman, President and Chief Executive Officer

Lowe’s Companies, Inc. to host first quarter 2026 earnings call on May 20

Lowe’s Companies, Inc. will hold its First Quarter 2026 Earnings Conference Call on May 20. The event will be webcast online with supplemental materials provided shortly before it begins.

Michael Ferris Senior Vice President and Chief Strategy Officer

Boomi and Red Hat announce collaboration on integrated agentic AI stack

Boomi and Red Hat have announced a new partnership aimed at simplifying how enterprises deploy large-scale artificial intelligence solutions. Their joint platform seeks to unify disparate tools into one system focused on secure governance and operational efficiency.

Brian Moynihan Chair of the Board and Chief Executive Officer

Bank of America Chair and CEO Brian Moynihan to participate in Bernstein conference May 27

Brian Moynihan, Chair and CEO of Bank of America, will participate in the Bernstein Strategic Decisions Conference on May 27. The bank has provided contact details for both investors and reporters seeking more information.

Trending

The Weekly Newsletter

Sign-up for the Weekly Newsletter from North Wake News.