{"id":72138,"date":"2020-09-21T20:52:39","date_gmt":"2020-09-21T17:52:39","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/ai-devs-created-a-lean-mean-gpt-3-beating-machine-that-uses-99-9-fewer-parameters\/"},"modified":"2020-09-21T20:52:39","modified_gmt":"2020-09-21T17:52:39","slug":"ai-devs-created-a-lean-mean-gpt-3-beating-machine-that-uses-99-9-fewer-parameters","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/ai-devs-created-a-lean-mean-gpt-3-beating-machine-that-uses-99-9-fewer-parameters\/","title":{"rendered":"#AI devs created a lean, mean, GPT-3-beating machine that uses 99.9% fewer parameters"},"content":{"rendered":"<p>&#8220;<strong>#AI devs created a lean, mean, GPT-3-beating machine that uses 99.9% fewer parameters<\/strong>&#8221;<\/p>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/watch-movies-tv-seriess\/\" data-internallinksmanager029f6b8e52c=\"8\" title=\"Watch Movies &amp; TV Series\" target=\"_blank\" rel=\"noopener\">watch Movies<\/a> or TV series visit the <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/dizi.buradabiliyorum.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">Dizi.BuradaBiliyorum.Com<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><img decoding=\"async\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2017\/07\/robots-796x433.jpg\" \/><\/p>\n<div>\n                                AI researchers from the Ludwig Maximilian University (LMU) of Munich have developed a bite-sized text generator capable of besting OpenAI\u2018s state of the art GPT-3 using only a tiny fraction of its parameters.<\/p>\n<p>GPT-3 is a monster of an AI system capable of responding to almost any text prompt with unique, original responses that are often surprisingly cogent. It\u2019s an example of what incredibly talented developers can do with cutting-edge algorithms and software when given unfettered access to supercomputers.<\/p>\n<p>But it\u2019s not very efficient. At least not when compared to a new system developed by LMU researchers Timo Schick and Hinrich Schutze.<\/p>\n<p><em>[Read: OpenAI reveals the pricing plans for its API \u2014 and it ain\u2019t cheap]<\/em><\/p>\n<p>According to a recent <a rel=\"nofollow noopener noreferrer\" target=\"_blank\" href=\"https:\/\/arxiv.org\/pdf\/2009.07118.pdf\">pre-print paper<\/a> on arXiv, the duo\u2019s system outperforms GPT-3 on the \u201csuperGLUE\u201d benchmark test with only 223 million parameters:<\/p>\n<blockquote><p>In this work, we show that performance similar to GPT-3 can be obtained with language models whose parameter count is several orders of magnitude smaller. This is achieved by converting textual inputs into cloze questions that contain some form of task de<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">script<\/a>ion, combined with gradient-based optimization; additionally exploiting unlabeled data gives further improvements.<\/p>\n<\/blockquote>\n<p>Parameters are variables used to tune and tweak AI models. They\u2019re intimated from data \u2013 in essence the more parameters an AI model is trained with, the more robust we expect it to be.<\/p>\n<p>When a system using 99.9% less model parameters is able to best the best at a benchmark task, it\u2019s a pretty big deal. This isn\u2019t to say that the LMU system is better than GPT-3, nor that it\u2019s capable of beating it in tests other than the SuperGLUE benchmark \u2013 which isn\u2019t indicative of GPT-3\u2019s overall capabilities.<\/p>\n<p>The LMU system\u2019s results come courtesy of a training method called pattern-exploiting training (PET). According to Open AI policy director Jack Clark, writing in the weekly <a rel=\"nofollow noopener noreferrer\" target=\"_blank\" href=\"https:\/\/us13.campaign-archive.com\/?u=67bd06787e84d73db24fb0aa5&amp;id=ef5072d878\">ImportAI newsletter<\/a>:<\/p>\n<blockquote><p>Their approach fuses a training technique called PET (pattern-exploiting training) with a small pre-trained Albert model, letting them create a system that \u201coutperform GPT-3 on SuperGLUE with 32 training examples, while requiring only 0.1% of its parameters.\u201d<\/p>\n<\/blockquote>\n<p>Clark goes on to point out that, while it won\u2019t outperform GPT-3 in every task, it does open new avenues for researchers looking to push the boundaries of AI with more modest hardware.<\/p>\n<p>For more information check out the duo\u2019s paper <a rel=\"nofollow noopener noreferrer\" target=\"_blank\" href=\"https:\/\/arxiv.org\/pdf\/2009.07118.pdf\">here<\/a>.<\/p>\n<p><em>H\/t:\u00a0<a rel=\"nofollow noopener noreferrer\" target=\"_blank\" href=\"https:\/\/twitter.com\/jackclarkSF\">Jack Clark<\/a> and <a rel=\"nofollow noopener noreferrer\" target=\"_blank\" href=\"https:\/\/twitter.us13.list-manage.com\/subscribe?u=67bd06787e84d73db24fb0aa5&amp;id=6c9d98ff2c\">ImportAI<\/a><\/em><\/p>\n<p><em><span>So you\u2019re interested in AI? Then\u00a0<\/span><span>join our online event, TNW2020<\/span><span>, where you\u2019ll hear how artificial intelligence is transforming industries and businesses.<\/span><\/em><\/p>\n<p class=\"c-post-pubDate\">\n                                    Published September 21, 2020 \u2014 17:52 UTC<\/p><\/div>\n<p><script async src=\"\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><script data-src=\"https:\/\/connect.facebook.net\/en_US\/sdk.js#xfbml=1&amp;appId=378011798897423&amp;version=v2.6\" id=\"socialSrcFacebook\" type=\"text\/template\"><\/script><\/p>\n<blockquote>\n<p style=\"text-align: center;\"><strong>if you want to watch Movies or Tv Shows go to <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/dizi.buradabiliyorum.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">Dizi.BuradaBiliyorum.Com<\/a> <\/span> for forums sites go to <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/forum.buradabiliyorum.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">Forum.BuradaBiliyorum.Com<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/technology\/\" target=\"_blank\" rel=\"noopener noreferrer\">Technology category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/thenextweb.com\/neural\/2020\/09\/21\/ai-devs-created-a-lean-mean-gpt-3-beating-machine-that-uses-99-9-fewer-parameters\/\" target=\"_blank\" rel=\"noopener noreferrer\">Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>&#8220;#AI devs created a lean, mean, GPT-3-beating machine that uses 99.9% fewer parameters&#8221; If you want to watch Movies or TV series visit the Dizi.BuradaBiliyorum.Com AI researchers from the Ludwig Maximilian University (LMU) of Munich have developed a bite-sized text generator capable of besting OpenAI\u2018s state of the art GPT-3 using only a tiny fraction&#8230;<\/p>\n","protected":false},"author":1,"featured_media":72139,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/img-cdn.tnwcdn.com\/image\/neural?filter_last=1&fit=1280,640&url=https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2017\/07\/robots.jpg&signature=5fcc857cf86c3835a880c08cf8a93783","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[],"class_list":["post-72138","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/72138","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=72138"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/72138\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/72139"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=72138"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=72138"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=72138"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}