Nvidia reportedly cancels quad-die Rubin Ultra GPU in favor of dual-GPU design, report claims — complex design purportedly scrapped over 'manufacturing execution concerns'

Nvidia reportedly cancels quad-die Rubin Ultra GPU in favor of dual-GPU design, report claims — complex design purportedly scrapped over ‘manufacturing execution concerns’

In a bid to offer unbeatable performance, Nvidia had planned to use four GPU chiplets in its Rubin Ultra AI accelerator due in 2027. However, due to concerns about the manufacturability of such a solution, the company decided to cancel it in favor of a dual-GPU design that is easier to produce, according to SemiAnalysis.

Tom’s Hardware Premium Roadmaps

a snippet from the HBM roadmap article — (Image credit: Future)

High-Bandwidth Memory (HBM) Roadmap
Nvidia Enterprise GPU and CPU Roadmap
AI accelerator Roadmap
Desktop GPU Roadmap
3D NAND Roadmap

Nvidia’s Rubin Ultra GPU with four compute chiplets was arguably one of Nvidia’s most ambitious projects in recent years, as it not only doubled performance compared to the original Rubin (which uses two compute chiplets), but also increased the complexity of Nvidia’s data center GPUs to levels never seen before. However, connecting four near reticle-sized dies using existing advanced packaging technologies is a tremendous engineering challenge, and cooling four complex dies and 16 HBM4E modules is hard and costly. As a result, due to ‘manufacturing execution concerns,’ Nvidia reportedly canceled Rubin Ultra in its four compute dies form in favor of a design with two compute chiplets. Note that the information is unofficial, so take it with a grain of salt. We’ve reached out to Nvidia for comment.

Nvidia data center GPU roadmap 2025 showing Rubin and Rubin Ultra — (Image credit: Nvidia)

As a consequence, Nvidia’s ‘new’ Rubin Ultra would be around half as powerful as the original one, which would certainly make it less competitive against contending offerings, namely AMD’s Instinct MI500-series. Of course, Nvidia will still likely optimize its Rubin Ultra design to squeeze some additional performance out of the AI accelerator to justify the upgrade.

Also, keep in mind that Nvidia’s Rubin Ultra uses HBM4E memory instead of HBM4 used by the original Rubin. Furthermore, starting with Rubin GPUs, Nvidia plans to offer liquid-cooled Kyber rack-scale systems that increase GPU count per scale-up domain to at least 144 packages, which will increase compute performance that Nvidia will sell to its customers.

SemiAnalysis notes that the impact of the cancellation of an AI accelerator with 16 HBM4E packages could have an impact on the HBM market in general, as the ‘new’ Rubin Ultra will only use eight HBM4E modules.

The purported cancellation of Rubin Ultra with four compute chiplets would also mean that one Rubin Ultra GPU with two compute chiplets will cost less than the original one. Meanwhile, since Nvidia is mostly focused on selling rack-scale solutions rather than on individual GPUs, it remains to be seen how this impacts the actual spending of Nvidia’s partners, since if they have to buy more systems to get more GPUs, they will likely spend more than they would if they had to buy fewer systems with the same number of compute chiplets.

Nvidia reportedly cancels quad-die Rubin Ultra GPU in favor of dual-GPU design, report claims — complex design purportedly scrapped over ‘manufacturing execution concerns’

Nvidia reportedly cancels quad-die Rubin Ultra GPU in favor of dual-GPU design, report claims — complex design purportedly scrapped over ‘manufacturing execution concerns’

Recent Posts

Categories

Subscribe to our newsletter!

Quick links

Legal

Nvidia reportedly cancels quad-die Rubin Ultra GPU in favor of dual-GPU design, report claims — complex design purportedly scrapped over ‘manufacturing execution concerns’

Nvidia reportedly cancels quad-die Rubin Ultra GPU in favor of dual-GPU design, report claims — complex design purportedly scrapped over ‘manufacturing execution concerns’

Related Posts

AMD’s 96-core beast with watercooling engraved into CPU joins car and industrial parts in a 2,000W direct die cooling setup — $12,000 CPU runs at 5.3 GHz, devours 1,300W, and still runs cooler than your gaming PC

Builder customizes 3D-printed PC case with worthwhile upgrades — premium 3D printing template with magnetically attached panels boasts ease of use, customizability, and design flair

Intel’s upcoming 42-core Nova Lake SKU allegedly upgraded to 44 cores — New config frees up 6P+12E tiles that could trickle down as locked bLLC variants

Dutch Secretary of Defense threatens to ‘jailbreak’ nation’s F-35 jet fighters — says it’s just like jailbreaking an iPhone, in response to questions over software independence

Recent Posts

Categories

Subscribe to our newsletter!