Red Hat launches new platform for enhanced generative AI across hybrid clouds

Red Hat launches new platform for enhanced generative AI across hybrid clouds
Michael Ferris Senior Vice President and Chief Strategy Officer — Red Hat
0Comments

Red Hat has announced the launch of its AI Inference Server, a development aimed at enhancing generative AI capabilities across hybrid cloud environments. The server is built on the vLLM community project and integrates Neural Magic technologies to improve speed, efficiency, and cost-effectiveness. This initiative aligns with Red Hat’s vision of enabling any generative AI model to run on any AI accelerator in any cloud setting.

Joe Fernandes, Vice President and General Manager of Red Hat’s AI Business Unit, emphasized the importance of inference in AI operations: “Inference is where the real promise of gen AI is delivered…but it must be delivered in an effective and cost-efficient way.” The new server aims to meet these demands by providing a common inference layer that supports various models across different environments.

The vLLM project, which originated from the University of California, Berkeley in 2023, forms the backbone of this offering. It supports high-throughput generative AI inference and multi-GPU model acceleration. Red Hat’s adoption of vLLM as part of its solution underscores its role as a standard for future AI inference innovations.

Key industry figures have expressed support for Red Hat’s latest venture. Ramine Roane from AMD highlighted their collaboration with Red Hat to provide efficient generative AI solutions using AMD Instinct GPUs. Jeremy Foster from Cisco noted that the server offers speed, consistency, and flexibility necessary for modern AI workloads. Intel’s Bill Pearson remarked on their excitement about integrating Intel Gaudi accelerators with the server to enhance performance and efficiency.

NVIDIA’s John Fanelli also commented on the potential benefits: “With open, full-stack NVIDIA accelerated computing and Red Hat AI Inference Server…developers can run efficient reasoning at scale across hybrid clouds.”

Red Hat aims to simplify deploying generative AI through this innovation while supporting third-party platforms for greater flexibility.



Related

Marvin R. Ellison Chairman, President and Chief Executive Officer

Lowe’s launches exclusive live music perks for loyalty members through Live Nation partnership

Lowe’s has announced exclusive live music benefits for its loyalty program members through a partnership with Live Nation. Perks include discounted tickets for children, complimentary chair rentals at concerts for early arrivals, sweepstakes opportunities for free tickets all year long as well as access to new tailgate events.

Harry K. Sideris‌, President and Chief Executive Officer at Duke Energy Florida

Duke Energy joins Careers Electric coalition to train next-generation energy workforce

Duke Energy has joined the Careers Electric coalition aiming to train thousands of workers for skilled trades jobs focused on electrification needs. The initiative begins in North Carolina with partnerships across education and industry sectors targeting both high school students and community college expansions.

D. Reid Wilson Secretary

Second application period for Dam Safety Grant funding closes June 19

The Department of Environmental Quality’s Dam Safety Program is accepting applications until June 19 for its grant program supporting dam repairs following Hurricane Helene. $3.4 million remains available after an initial round of funding was completed.

Trending

The Weekly Newsletter

Sign-up for the Weekly Newsletter from North Wake News.