Red Hat and Google Cloud expand partnership to advance enterprise-grade AI

Red Hat and Google Cloud expand partnership to advance enterprise-grade AI
Matt Hicks, President and Chief Executive Officer — Red Hat, Inc.
0Comments

Red Hat and Google Cloud have announced an expanded partnership aimed at advancing artificial intelligence (AI) for enterprise applications. The collaboration will unite Red Hat’s open source technologies with Google Cloud’s infrastructure and Google’s Gemma family of open models.

Brian Stevens, senior vice president and Chief Technology Officer at Red Hat, stated, “With this extended collaboration, Red Hat and Google Cloud are committed to driving groundbreaking AI innovations with our combined expertise and platforms.” He highlighted the integration of vLLM and Red Hat’s technologies with Google Cloud as a means to equip developers with resources for building high-performing AI solutions.

The partnership will focus on several initiatives: launching the llm-d open source project with Google as a founding contributor, enabling support for vLLM on Google Cloud TPUs and GPU-based virtual machines to enhance AI inference, delivering Day 0 support for vLLM on Gemma 3 model distributions, supporting Red Hat AI Inference Server on Google Cloud, and contributing to Google’s Agent2Agent protocol.

Mark Lohmeyer, vice president and general manager of AI and Computing Infrastructure at Google Cloud, remarked, “The deepening of our collaboration with Red Hat is driven by our shared commitment to foster open innovation and bring the full potential of AI to our customers.”

The initiative also includes Red Hat becoming an early tester for Google’s Gemma 3 model. This aims to deliver immediate support for vLLM—an open source inference server that accelerates generative AI applications. The integration of vLLM with Google Cloud TPUs allows developers to maximize resources while achieving necessary performance and efficiency.

Red Hat has launched the llm-d open source project in response to the complexities organizations face in shifting from AI research to real-world deployment. This project aims to optimize costs, enhance workload efficiency, and foster innovation across heterogeneous resources.

Additionally, Red Hat’s AI Inference Server is now available on Google Cloud. This helps enterprises optimize model inference across hybrid cloud environments by leveraging Google’s infrastructure. Furthermore, Red Hat’s contribution to the Agent2Agent protocol underscores their joint commitment to open AI by facilitating communication between users or agents across diverse platforms.

Both companies are poised to bring these advancements into practice through their participation in community-powered innovation efforts. The announcement was made during the ongoing Red Hat Summit.



Related

Marvin R. Ellison Chairman, President and Chief Executive Officer

Lowe’s Companies, Inc. to host first quarter 2026 earnings call on May 20

Lowe’s Companies, Inc. will hold its First Quarter 2026 Earnings Conference Call on May 20. The event will be webcast online with supplemental materials provided shortly before it begins.

Michael Ferris Senior Vice President and Chief Strategy Officer

Boomi and Red Hat announce collaboration on integrated agentic AI stack

Boomi and Red Hat have announced a new partnership aimed at simplifying how enterprises deploy large-scale artificial intelligence solutions. Their joint platform seeks to unify disparate tools into one system focused on secure governance and operational efficiency.

Brian Moynihan Chair of the Board and Chief Executive Officer

Bank of America Chair and CEO Brian Moynihan to participate in Bernstein conference May 27

Brian Moynihan, Chair and CEO of Bank of America, will participate in the Bernstein Strategic Decisions Conference on May 27. The bank has provided contact details for both investors and reporters seeking more information.

Trending

The Weekly Newsletter

Sign-up for the Weekly Newsletter from North Wake News.