Nvidia unveils new GPU designed for long-context inference
On the AI Infrastructure Summit on Tuesday, Nvidia introduced a brand new GPU referred to as the Rubin CPX, designed for context home windows bigger than 1 million tokens. A part of the chip large’s forthcoming Rubin sequence, the CPX is optimized for processing giant sequences of context and is supposed for use as a part of a broader “disaggregated inference” infrastructure method. For…













