Nvidia reportedly cancels quad-die Rubin Ultra GPU in favor of dual-GPU design, report claims — complex design purportedly scrapped over ‘manufacturing execution concerns’
In a bid to offer unbeatable performance, Nvidia had planned to use four GPU chiplets in its Rubin Ultra AI accelerator due in 2027. However, due to concerns about the manufacturability of such a solution, the company decided to cancel it in favor of a dual-GPU design that is easier to produce, according to SemiAnalysis.

- High-Bandwidth Memory (HBM) Roadmap
- Nvidia Enterprise GPU and CPU Roadmap
- AI accelerator Roadmap
- Desktop GPU Roadmap
- 3D NAND Roadmap
Nvidia’s Rubin Ultra GPU with four compute chiplets was arguably one of Nvidia’s most ambitious projects in recent years, as it not only doubled performance compared to the original Rubin (which uses two compute chiplets), but also increased the complexity of Nvidia’s data center GPUs to levels never seen before. However, connecting four near reticle-sized dies using existing advanced packaging technologies is a tremendous engineering challenge, and cooling four complex dies and 16 HBM4E modules is hard and costly. As a result, due to ‘manufacturing execution concerns,’ Nvidia reportedly canceled Rubin Ultra in its four compute dies form in favor of a design with two compute chiplets. Note that the information is unofficial, so take it with a grain of salt. We’ve reached out to Nvidia for comment.

As a consequence, Nvidia’s ‘new’ Rubin Ultra would be around half as powerful as the original one, which would certainly make it less competitive against contending offerings, namely AMD’s Instinct MI500-series. Of course, Nvidia will still likely optimize its Rubin Ultra design to squeeze some additional performance out of the AI accelerator to justify the upgrade.
Also, keep in mind that Nvidia’s Rubin Ultra uses HBM4E memory instead of HBM4 used by the original Rubin. Furthermore, starting with Rubin GPUs, Nvidia plans to offer liquid-cooled Kyber rack-scale systems that increase GPU count per scale-up domain to at least 144 packages, which will increase compute performance that Nvidia will sell to its customers.
SemiAnalysis notes that the impact of the cancellation of an AI accelerator with 16 HBM4E packages could have an impact on the HBM market in general, as the ‘new’ Rubin Ultra will only use eight HBM4E modules.
The purported cancellation of Rubin Ultra with four compute chiplets would also mean that one Rubin Ultra GPU with two compute chiplets will cost less than the original one. Meanwhile, since Nvidia is mostly focused on selling rack-scale solutions rather than on individual GPUs, it remains to be seen how this impacts the actual spending of Nvidia’s partners, since if they have to buy more systems to get more GPUs, they will likely spend more than they would if they had to buy fewer systems with the same number of compute chiplets.