Nvidia Unveils Next-Generation Rubin AI Platform for 2026

At the Computex show in Taiwan, Nvidia previewed the Blackwell Ultra chip and the next-generation Rubin AI platform.

Bloomberg News

June 3, 2024

4 Min Read
Nvidia has previewed the Blackwell Ultra chip and the next-generation Rubin AI platform
Jensen Huang at Computex 2024 in TaiwanImage: Bloomberg

(Bloomberg) -- Nvidia Corporation Chief Executive Officer Jensen Huang said the company plans to upgrade its AI accelerators every year, announcing a Blackwell Ultra chip for 2025 and a next-generation platform in development called Rubin for 2026.

The company – now best known for its artificial intelligence data center systems – also introduced new tools and software models on the eve of the Computex trade show in Taiwan. Nvidia sees the rise of generative AI as a new industrial revolution and expects to play a major role as the technology shifts to personal computers, the CEO said in a keynote address at National Taiwan University.

Nvidia has been the main beneficiary of a massive flood of AI spending, helping turn the company into the world’s most valuable chipmaker. But it now looks to broaden its customer base beyond the handful of cloud-computing giants that generate much of its sales. As part of the expansion, Huang expects a larger swath of companies and government agencies to embrace AI – everyone from shipbuilders to drug developers. He returned to themes he set out a year ago at the same venue, including the idea that those without AI capabilities will be left behind.

“We are seeing computation inflation,” Huang said on Sunday. As the amount of data that needs to be processed grows exponentially, traditional computing methods cannot keep up and it’s only through Nvidia’s style of accelerated computing that we can cut back the costs, Huang said. He touted 98% cost savings and 97% less energy required with Nvidia’s technology, saying that constituted “CEO math, which is not accurate, but it is correct.”

Related:Data Center Chips in 2024: Top Trends and Releases

Shares of Taiwan Semiconductor Manufacturing Company and other suppliers rose after the announcement. TSMC’s stock climbed as much as 3.9%, while Wistron Corporation gained 4%.

Huang said the upcoming Rubin AI platform will use HBM4, the next iteration of the essential high-bandwidth memory that’s grown into a bottleneck for AI accelerator production, with leader SK Hynix largely sold out through 2025. He otherwise did not offer detailed specifications for the upcoming products, which will follow Blackwell.

“I think teasing out Rubin and Rubin Ultra was extremely clever and is indicative of its commitment to a year-over-year refresh cycle,” said Dan Newman, CEO and chief analyst at Futurum Group. “What I feel he hammered home most clearly is the cadence of innovation, and the company’s relentless pursuit of maximizing the limit of technology including software, process, packaging and partnerships to protect and expand its moat and market position.”

Related:Ampere Unveils 256-Core Processor in Data Center Power Play

Nvidia got its start selling gaming cards for desktop PCs, and that background is coming into play as computer makers push to add more AI functions to their machines.

Microsoft and its hardware partners are using Computex to show off new laptops with AI enhancements under the branding of Copilot+. The majority of those devices coming to market are based on a new type of processor that will enable them to go longer on one battery charge, provided by Nvidia rival Qualcomm.

While those devices are good for simple AI functionality, adding an Nvidia graphics card will massively increase their performance and bring new features to popular software like games, Nvidia said. PC makers such as Asustek Computer Inc. are offering such computers, the company said. 

To help software makers bring more new capabilities to the PC, Nvidia is offering tools and pretrained AI models. They will handle complex tasks, such as deciding whether to crunch data on the machine itself or send it out to a data center over the internet.

Separately, Nvidia is releasing a new design for server computers built on its chips. The MGX program is used by companies such as Hewlett Packard Enterprise Co. and Dell Technologies Inc. to allow them to get to market faster with products that are used by corporations and government agencies. Even rivals Advanced Micro Devices and Intel Corporation are taking advantage of the design with servers that put their processors alongside Nvidia chips. 

AMD CEO Lisa Su took the stage at Computex the day after Huang’s comments, sketching out her company’s progress in AI chips. AMD is speeding up the introduction of its AI processsors as it seeks to close the gap with Nvidia in the fast-growing field.

Nvidia’s earlier-announced products, such as Spectrum X for networking and Nvidia Inference Microservices – or NIM, which Huang called “AI in a box” – are now generally available and being widely adopted, the company said. It’s also going to offer free access to the NIM products. The microservices are a set of intermediate software and models that help companies roll out AI services more quickly, without having to worry about the underlying technology. Companies that deploy them then have to pay Nvidia a usage fee.

Huang also promoted the use of digital twins in a virtual world that Nvidia calls the Ominverse. To show the scale possible, he showed a digital twin of planet Earth, called Earth 2, and how it can help conduct more sophisticated weather pattern modeling and other complex tasks. He noted that Taiwan-based contract manufacturers such as Hon Hai Precision Industry Company, also known as Foxconn, are using the tools to make plans and operate their factories more efficiently.

About the Author(s)

Bloomberg News

The latest technology news from Bloomberg.

Subscribe to the Data Center Knowledge Newsletter
Get analysis and expert insight on the latest in data center business and technology delivered to your inbox daily.

You May Also Like