Nvidia H100 GPUs set time to beat in MLPerf generative AI benchmark debut

Nvidia’s H100 Tensor Core GPUs have gained recognition for their AI performance, particularly in large language models (LLMs) that power generative AI. The GPUs achieved impressive results in all eight MLPerf training benchmarks, including the new test for generative AI. A cluster of 3,584 H100 GPUs, developed by Inflection AI and operated by CoreWeave, completed a GPT-3-based training benchmark in under 11 minutes. Inflection AI used the H100 GPUs to create an advanced LLM for its personal AI assistant, Pi. By using Nvidia Quantum-2 InfiniBand networking, CoreWeave achieved similar performance to local data center setups.

According to Nvidia, users and industry-standard benchmarks agree about the AI performance of the company’s H100 Tensor Core GPUs, particularly in relation to the large language models (LLMs) that drive generative AI.

The H100 GPUs have achieved impressive results in all eight tests of the latest MLPerf training benchmarks, including the new MLPerf test designed for generative AI. Nvidia says the performance is demonstrated both at the individual accelerator level and at scale within massive server environments.

A cluster comprising 3,584 H100 GPUs was assembled by start-up Inflection AI and operated by CoreWeave, a specialized cloud service provider for GPU-accelerated workloads. In the tests, they ran an extensive GPT-3-based training benchmark in under 11 minutes.

CoreWeave’s co-founder and CTO Brian Venturo said their customers are leveraging CoreWeave’s fleet of H100 GPUs to construct generative AI and LLMs at scale.

Inflection AI said it has utilized the H100 GPUs to develop an advanced LLM that serves as the foundation for their first personal AI assistant, named Pi (personal intelligence). The company plans to function as an AI studio, creating personal AI systems that users can engage with using simple and intuitive methods.

CoreWeave claims they delivered similar performance from the cloud to what Nvidia achieved from an AI supercomputer running in a local data center. That, said Nvidia, is a testament to the low-latency networking of the Nvidia Quantum-2 InfiniBand networking that CoreWeave uses.

Mustafa Suleyman, CEO of Inflection AI, said anyone can experience the capabilities of a personal AI assistant based on his company’s large language model, which was trained using CoreWeave’s network of H100 GPUs.

Inflection AI was founded in early 2022 by Mustafa and Karén Simonyan from DeepMind, along with Reid Hoffman. The company said it is collaborating with CoreWeave to establish one of the world’s largest computing clusters utilizing Nvidia GPUs.

Furthermore, Nvidia noted that data centers accelerated with Nvidia GPUs use fewer server nodes, so they use less rack space and energy. In addition, accelerated networking boosts efficiency and performance, and ongoing software optimizations bring X-factor gains on the same hardware.

For a deeper dive into the optimizations fueling Nvidia’s MLPerf performance and efficiency, click here.

Artificial IntelligenceGamingIntel

Intel’s 11-gen i5 and i9 Rocket Lake CPUs

Significant improvement over Gen-10

Arma 3: In the thick of combat

I jumped back into the saddle last night playing Arma 3 with Vietnam War Mod, a realistic open-world military tactical shooter video game from Bohemia Interactive.

Reimagining the esports experience

Riot Games teams up with AWS on a new playbook to make the studio a major contender in the electronic sports arena. The studio also opens remote broadcast center.

2024 Worldwide CAD Report

March 20, 2024

The Worldwide CAD Report JPR’s CAD market report has been published since 2005. As a result, it comes with a strong historical perspective as well as current data on the rapidly changing CAD industry. The 2024 report provides information on market segments, individual company market shares, new workflows, and new players.

Worldwide CAD Report Executive Summary
Worldwide CAD Report Table of Contents

learn more

The Arm IPO—background and possibilities – Predictions, potentials, and pitfalls

September 9, 2023

The Arm IPO—background and possibilities - Predictions, potentials, and pitfalls In the first part of this report, we look at the buildup to the Arm IPO. We look at where the company is on the starting blocks, pose questions about valuation, and the edge IoT and data center opportunities. The second section covers the competitive landscape, Arm product pricing models, why Arm GPU isn’t in the desktop and data center yet, and the relationship with Arm China. The third section covers the failed sale to Nvidia—and the reported new strategy for Arm growth revealed through the dispute with Qualcomm, including new licensing and royalty models and the chipset business. The report is full of insights, predictions, and some never-before-revealed aspects of Arm China and the IP industry.

Table of Contents

learn more

TV Gaming Hardware market study – advanced financial modeling of the global TV Gaming Hardware market

April 19, 2023

TV Gaming Hardware market study - Bi-annual, advanced financial modeling of the global TV Gaming Hardware market TV and Cloud Gaming market study – advanced financial modeling of the global PC Gaming Hardware market. Jon Peddie Research’s TV and Cloud Gaming market study is a supply-side series, it establishes the TV Gaming Hardware market size by value, platform, and unit shipments. TV and Cloud Gaming market study subscription consists of two issuances per year and gives one year of history, a current year estimate, and a three-year forecast. For a subscription that includes models of the market released bi-annually please click here. Contact us now if you would like to receive a sample of the report.

learn more

Nvidia H100 GPUs set time to beat in MLPerf generative AI benchmark debut

Related posts

Intel’s 11-gen i5 and i9 Rocket Lake CPUs

Arma 3: In the thick of combat

Reimagining the esports experience

Recent products