{"id":672527,"date":"2025-05-30T11:28:31","date_gmt":"2025-05-30T08:28:31","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/deepseeks-distilled-new-r1-ai-model-can-run-on-a-single-gpu\/"},"modified":"2025-05-30T11:28:31","modified_gmt":"2025-05-30T08:28:31","slug":"deepseeks-distilled-new-r1-ai-model-can-run-on-a-single-gpu","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/deepseeks-distilled-new-r1-ai-model-can-run-on-a-single-gpu\/","title":{"rendered":"DeepSeek&#8217;s distilled new R1 AI model can run on a single GPU"},"content":{"rendered":"<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">DeepSeek\u2019s updated R1 reasoning AI model might be getting the bulk of the AI community\u2019s attention this week. But the Chinese AI lab also released a smaller, \u201cdistilled\u201d version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably sized models on certain benchmarks.<\/p>\n<p class=\"wp-block-paragraph\">The smaller updated R1, which was built using the Qwen3-8B model Alibaba launched in May as a foundation, performs better than Google\u2019s Gemini 2.5 Flash on AIME 2025, a collection of challenging math questions. <\/p>\n<p class=\"wp-block-paragraph\">DeepSeek-R1-0528-Qwen3-8B also nearly matches Microsoft\u2019s recently released Phi 4 reasoning plus model on another math skills test, HMMT.<\/p>\n<p class=\"wp-block-paragraph\">So-called distilled models like DeepSeek-R1-0528-Qwen3-8B are <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/general\/\" data-internallinksmanager029f6b8e52c=\"3\" title=\"General\" target=\"_blank\" rel=\"noopener\">general<\/a>ly less capable than their full-sized counterparts. On the plus side, they\u2019re far less computationally demanding. <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/nodeshift.com\/blog\/how-to-install-qwen-3-locally\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">According<\/a> to the cloud platform NodeShift, Qwen3-8B requires a GPU with 40GB-80GB of RAM to run (e.g., an Nvidia H100). The full-sized new R1 needs <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/dev.to\/ai4b\/comprehensive-hardware-requirements-report-for-deepseek-r1-5269\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">around a dozen 80GB GPUs<\/a>.<\/p>\n<p class=\"wp-block-paragraph\">DeepSeek trained DeepSeek-R1-0528-Qwen3-8B by taking text generated by the updated R1 and using it to fine-tune Qwen3-8B. In a dedicated web page for the model on the AI dev platform Hugging Face, DeepSeek describes DeepSeek-R1-0528-Qwen3-8B as \u201cfor both academic research on reasoning models and industrial development focused on small-scale models.\u201d<\/p>\n<p class=\"wp-block-paragraph\">DeepSeek-R1-0528-Qwen3-8B is available under a permissive MIT license, meaning it can be used commercially without restriction. Several hosts, including <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/x.com\/lmstudio\/status\/1928092450410648000\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">LM Studio<\/a>, already offer the model through an API.<\/p>\n<\/div>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/category\/technology\/\" target=\"_blank\" >Technology<\/a><\/span> category.<\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techcrunch.com\/2025\/05\/29\/deepseeks-distilled-new-r1-ai-model-can-run-on-a-single-gpu\/\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>DeepSeek\u2019s updated R1 reasoning AI model might be getting the bulk of the AI community\u2019s attention this week. But the Chinese AI lab also released a smaller, \u201cdistilled\u201d version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably sized models on certain benchmarks. The smaller updated R1, which was built using the Qwen3-8B model&#8230;<\/p>\n","protected":false},"author":1,"featured_media":672528,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/01\/GettyImages-2196223480.jpg?resize=1200,825","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[77337,4973,153752],"class_list":["post-672527","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology","tag-ai","tag-china","tag-deepseek"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/672527","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=672527"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/672527\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/672528"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=672527"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=672527"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=672527"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}