{"id":114574,"date":"2020-11-18T16:00:22","date_gmt":"2020-11-18T13:00:22","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/amd-announces-new-instinct-mi100-gpu-breaks-the-10-tflops-barrier-in-fp64-cloudsavvy-it\/"},"modified":"2020-11-18T16:00:22","modified_gmt":"2020-11-18T13:00:22","slug":"amd-announces-new-instinct-mi100-gpu-breaks-the-10-tflops-barrier-in-fp64-cloudsavvy-it","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/amd-announces-new-instinct-mi100-gpu-breaks-the-10-tflops-barrier-in-fp64-cloudsavvy-it\/","title":{"rendered":"#AMD Announces New \u201cInstinct MI100\u201d GPU, Breaks the 10 TFLOPS Barrier in FP64 \u2013 CloudSavvy IT"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_85 counter-hierarchy ez-toc-counter ez-toc-custom ez-toc-container-direction\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table of Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-6a3e07c93cc07\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #dd3333;color:#dd3333\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #dd3333;color:#dd3333\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-6a3e07c93cc07\" checked aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/buradabiliyorum.com\/en\/amd-announces-new-instinct-mi100-gpu-breaks-the-10-tflops-barrier-in-fp64-cloudsavvy-it\/#A_Card_For_The_HPC_Market\" >A Card For The HPC Market<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/buradabiliyorum.com\/en\/amd-announces-new-instinct-mi100-gpu-breaks-the-10-tflops-barrier-in-fp64-cloudsavvy-it\/#Can_ROCm_Live_Up_to_CUDA\" >Can ROCm Live Up to CUDA?<\/a><\/li><\/ul><\/nav><\/div>\n<p><strong>&#8220;#AMD Announces New \u201cInstinct MI100\u201d GPU, Breaks the 10 TFLOPS Barrier in FP64 \u2013 CloudSavvy IT&#8221;<\/strong><\/p>\n<div id=\"article-content-area\">\n<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-8034\" src=\"https:\/\/www.cloudsavvyit.com\/thumbcache\/0\/0\/48301387cddb344c254ee94cde7f04d7\/p\/uploads\/2020\/11\/db6591d4-1.png\" alt=\"\" width=\"700\" height=\"313\" onload=\"pagespeed.lazyLoadImages.loadIfVisibleAndMaybeBeacon(this);\" onerror=\"this.onerror=null;pagespeed.lazyLoadImages.loadIfVisibleAndMaybeBeacon(this);\"\/><\/p>\n<p>With the rising demand for HPC and AI-powered cloud <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>lications comes a need for very powerful datacenter GPUs. Usually NVIDIA is the king of this field, but AMD\u2019s latest MI100 GPU presents some serious competition.<\/p>\n<h2 role=\"heading\" aria-level=\"2\"><span class=\"ez-toc-section\" id=\"A_Card_For_The_HPC_Market\"><\/span>A Card For The HPC Market<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The card is fast, seriously fast. NVIDIA\u2019s high end A100 GPU peaks at\u00a09.7 TFLOPS in FP64 workloads. The new \u201cAMD Instinct MI100\u201d leaps past that at 11.5 TFLOPS.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-8036\" src=\"https:\/\/www.cloudsavvyit.com\/thumbcache\/0\/0\/2f6292f97a12046b544017f07246a4f8\/p\/uploads\/2020\/11\/8d6e8097.png\" alt=\"\" width=\"700\" height=\"312\" onload=\"pagespeed.lazyLoadImages.loadIfVisibleAndMaybeBeacon(this);\" onerror=\"this.onerror=null;pagespeed.lazyLoadImages.loadIfVisibleAndMaybeBeacon(this);\"\/><\/p>\n<p>Of course, NVIDIA\u2019s cards support other speedup techniques for AI-specific workloads in different number formats, such as the TensorFloat-32 precision format and fine-grained structured sparsity. For AI and Machine Learning workloads, NVIDIA is still king, as their cards are built specifically for tensor-based operations.<\/p>\n<p>But, for <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/general\/\" data-internallinksmanager029f6b8e52c=\"3\" title=\"General\" target=\"_blank\" rel=\"noopener\">general<\/a> purpose High Performance Computing, the MI100 takes the crown for raw compute power. Plus, it\u2019s nearly half the price, and is much more efficient per watt.<\/p>\n<p>On top of the other improvements, the new architecture also brings mixed-precision improvements, with their \u201cMatrix Core\u201d <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/technology\/\" data-internallinksmanager029f6b8e52c=\"4\" title=\"Technology\" target=\"_blank\" rel=\"noopener\">technology<\/a> delivering 7x greater FP16 performance compared to their prior generation of cards.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-8038\" src=\"https:\/\/www.cloudsavvyit.com\/thumbcache\/0\/0\/6588bb99d6c97c60163b08630e9b8d86\/p\/uploads\/2020\/11\/292c692b.png\" alt=\"\" width=\"700\" height=\"216\" onload=\"pagespeed.lazyLoadImages.loadIfVisibleAndMaybeBeacon(this);\" onerror=\"this.onerror=null;pagespeed.lazyLoadImages.loadIfVisibleAndMaybeBeacon(this);\"\/><\/p>\n<p>AMD CPUs and Instinct GPUS are <a rel=\"nofollow noopener noreferrer\" target=\"_blank\" href=\"https:\/\/www.olcf.ornl.gov\/frontier\/\">both powering two of the US Department of Energy\u2019s exascale supercomputers<\/a>. The \u201cFrontier\u201d supercomputer is planned to be built next year with current Epyc CPUs and MI100s, and will deliver more than 1.5 exaflops of peak computing power. The \u201cEl Capitan\u201d supercomputer is planned to be built in 2023 on next gen hardware, and will deliver more than 2 exaflops of double precision power.<\/p>\n<h2 role=\"heading\" aria-level=\"2\"><span class=\"ez-toc-section\" id=\"Can_ROCm_Live_Up_to_CUDA\"><\/span>Can ROCm Live Up to CUDA?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Of course, all of this power is useless if the software doesn\u2019t support it. It\u2019s no secret that NVIDIA has managed to make machine learning a bit of a walled garden.<\/p>\n<p>NVIDIA\u2019s compute framework is called <a rel=\"nofollow noopener noreferrer\" target=\"_blank\" href=\"https:\/\/en.wikipedia.org\/wiki\/CUDA\">CUDA<\/a>, or\u00a0Compute Unified Device Architecture. It\u2019s proprietary, and only works with their cards. But since their cards have historically been the fastest, many applications are only built with CUDA support first and foremost.<\/p>\n<p>There are cross-platform programming models, most notably OpenCL, which AMD supports very well <a rel=\"nofollow noopener noreferrer\" target=\"_blank\" href=\"https:\/\/rocmdocs.amd.com\/en\/latest\/\">with their ROCm platform<\/a>. Both NVIDIA cards and AMD cards support OpenCL, but because NVIDIA only supports it by transpiling to CUDA, it\u2019s actually slower to use OpenCL with an NVIDIA card. Because of this, not all applications will support it.<\/p>\n<p>Ultimately, you\u2019ll need to do your own research and see if the application you intend to run can be run on AMD cards, and maybe be prepared for some tinkering and bug fixing. NVIDIA GPUs on the otherhand are mostly plug and play, so even if AMD is faster, NVIDIA can continue to hinder them with closed-source software.<\/p>\n<p>However, this situation is getting better\u2014AMD is committed to open sourcing everything and creating an open environment. Tensorflow and PyTorch, two very popular ML frameworks, both support the ROCm ecosystem.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-8035\" src=\"https:\/\/www.cloudsavvyit.com\/thumbcache\/0\/0\/03aae3c52689bd7fd58e8278df8fefac\/p\/uploads\/2020\/11\/301c98b9.png\" alt=\"\" width=\"697\" height=\"479\" onload=\"pagespeed.lazyLoadImages.loadIfVisibleAndMaybeBeacon(this);\" onerror=\"this.onerror=null;pagespeed.lazyLoadImages.loadIfVisibleAndMaybeBeacon(this);\"\/><\/p>\n<p>Hopefully the raw specs of AMD\u2019s latest offerings can push the industry to a more competitive environment. After all, they\u2019re being put to use in supercomputers\n<\/p><\/div>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMLG0nwswvr63Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\">For forums sites go to <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/forum.buradabiliyorum.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">Forum.BuradaBiliyorum.Com<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/technology\/\" target=\"_blank\" rel=\"noopener noreferrer\">Technology category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/www.cloudsavvyit.com\/8032\/amd-announces-new-instinct-mi100-gpu-breaks-the-10-tflops-barrier-in-fp64\/\" target=\"_blank\" rel=\"noopener noreferrer\">Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>&#8220;#AMD Announces New \u201cInstinct MI100\u201d GPU, Breaks the 10 TFLOPS Barrier in FP64 \u2013 CloudSavvy IT&#8221; With the rising demand for HPC and AI-powered cloud applications comes a need for very powerful datacenter GPUs. Usually NVIDIA is the king of this field, but AMD\u2019s latest MI100 GPU presents some serious competition. A Card For The&#8230;<\/p>\n","protected":false},"author":1,"featured_media":114575,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/www.cloudsavvyit.com\/p\/uploads\/2020\/11\/db6591d4-1.png","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[],"class_list":["post-114574","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/114574","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=114574"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/114574\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/114575"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=114574"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=114574"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=114574"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}