{"id":689072,"date":"2025-09-09T19:45:17","date_gmt":"2025-09-09T16:45:17","guid":{"rendered":"https:\/\/buradabiliyorum.com\/en\/nvidia-unveils-new-gpu-designed-for-long-context-inference\/"},"modified":"2025-09-09T19:45:17","modified_gmt":"2025-09-09T16:45:17","slug":"nvidia-unveils-new-gpu-designed-for-long-context-inference","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/nvidia-unveils-new-gpu-designed-for-long-context-inference\/","title":{"rendered":"Nvidia unveils new GPU designed for long-context inference"},"content":{"rendered":"<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU called <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/nvidianews.nvidia.com\/news\/nvidia-unveils-rubin-cpx-a-new-class-of-gpu-designed-for-massive-context-inference\">the Rubin CPX<\/a>, designed for context windows larger than 1 million tokens. <\/p>\n<p class=\"wp-block-paragraph\">Part of the chip giant\u2019s forthcoming Rubin <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/watch-movies-tv-seriess\/\" data-internallinksmanager029f6b8e52c=\"8\" title=\"Watch Movies &amp; TV Series\" target=\"_blank\" rel=\"noopener\">series<\/a>, the CPX is optimized for processing large sequences of context and is meant to be used as part of <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/developer.nvidia.com\/blog\/nvidia-rubin-cpx-accelerates-inference-performance-and-efficiency-for-1m-token-context-workloads\/\">a broader \u201cdisaggregated inference\u201d infrastructure approach<\/a>. For users, the result will be better performance on long-context tasks like video generation or software development.<\/p>\n<p class=\"wp-block-paragraph\">Nvidia\u2019s relentless development cycle has resulted in enormous profits for the company, which brought in $41.1 billion in data center sales in its most recent quarter.<\/p>\n<p class=\"wp-block-paragraph\">The Rubin CPX is slated to be available at the end of 2026.<\/p>\n<\/div>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/buradabiliyorum.com\/en\/category\/technology\/\" target=\"_blank\" >Technology<\/a><\/span> category.<\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techcrunch.com\/2025\/09\/09\/nvidia-unveils-new-gpu-designed-for-long-context-inference\/\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU called the Rubin CPX, designed for context windows larger than 1 million tokens. Part of the chip giant\u2019s forthcoming Rubin series, the CPX is optimized for processing large sequences of context and is meant to be used as part of a broader \u201cdisaggregated&#8230;<\/p>\n","protected":false},"author":1,"featured_media":689073,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/03\/GettyImages-2205210966.jpg?resize=1200,800","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[77337,75885,80446,77134],"class_list":["post-689072","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology","tag-ai","tag-enterprise","tag-gpu","tag-nvidia"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/689072","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=689072"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/689072\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/689073"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=689072"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=689072"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=689072"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}