{"id":405671,"date":"2022-02-15T02:35:07","date_gmt":"2022-02-14T23:35:07","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/reward-may-not-be-enough-for-agi-but-its-worth-a-try\/"},"modified":"2022-02-15T02:35:07","modified_gmt":"2022-02-14T23:35:07","slug":"reward-may-not-be-enough-for-agi-but-its-worth-a-try","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/reward-may-not-be-enough-for-agi-but-its-worth-a-try\/","title":{"rendered":"#Reward may NOT be enough for AGI \u2014 but it\u2019s worth a try"},"content":{"rendered":"<p>&#8220;<strong>#Reward may NOT be enough for AGI \u2014 but it\u2019s worth a try<\/strong>&#8221;<br \/>\n<img decoding=\"async\" src=\"https:\/\/img-cdn.tnwcdn.com\/image?fit=796%2C417&amp;url=https%3A%2F%2Fcdn0.tnwcdn.com%2Fwp-content%2Fblogs.dir%2F1%2Ffiles%2F2022%2F02%2FUntitled-design-5.jpg&amp;signature=eb9ec97cf4258c92370f2d123447c28f\" \/><\/p>\n<div>\n                            DeepMind has been connected to artificial <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/general\/\" data-internallinksmanager029f6b8e52c=\"3\" title=\"General\" target=\"_blank\" rel=\"noopener\">general<\/a> intelligence since birth.<\/p>\n<p>The lab was launched with <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/deepmind.com\/research\/publications\/2021\/Reward-is-Enough\">a mission to<\/a> develop AGI, was cofounded by a researcher <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/twitter.com\/ShaneLegg\/status\/1404405011241738247\">who coined the term<\/a>, and has made <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/deepmind.com\/blog\/article\/real-world-challenges-for-agi\">some compelling advances<\/a> in the field.<\/p>\n<p>It also recently produced a provocative paper\u00a0on the subject: \u201c<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/deepmind.com\/research\/publications\/2021\/Reward-is-Enough\">Reward is Enough<\/a>\u201d<\/p>\n<p>The study <span class=\"s1\">hypothesizes that AGI could be achieved through a single <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>roach: reinforcement learning.<\/span><\/p>\n<p><span class=\"s1\">This technique provides feedback in the form of a \u201creward\u201d \u2014 a positive number that tells an algorithm that the action it just performed will benefit its goal. <\/span><\/p>\n<p><span class=\"s1\">The approach has shown promise in programs such as MuZero, which mastered multiple <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/game\/\" data-internallinksmanager029f6b8e52c=\"7\" title=\"Game\" target=\"_blank\" rel=\"noopener\">game<\/a>s mastered multiple games without being told their rules. DeepMind called the system a \u201csignificant step forward in the pursuit of general-purpose algorithms.\u201d\u00a0<\/span><\/p>\n<p>\u201cReward is Enough\u201d suggests that reinforcement learning alone could lead to AGI.<\/p>\n<p><a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/bdtechtalks.com\/2021\/07\/07\/ai-reward-is-not-enough-herbert-roitblat\/\">This theory has been challenged<\/a> by many computer scientists \u2014 including some at DeepMind. But <span class=\"s1\">Doina Precup, one of the paper\u2019s co-authors, told TNW that the study merely sought to probe the possibilities.<\/span><\/p>\n<p>\u201cU<span class=\"s1\">ltimately, we want to test this as a hypothesis and to think of it in the context of other methods as well,\u201d said Precup, who heads up DeepMind\u2019s Montreal office.<\/span><\/p>\n<p>Indeed, reinforcement learning is just one approach that the Google subsidiary is exploring. In a new episode <span>of the <\/span><a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/deepmind.com\/learning-resources\/deepmind-the-podcast\" data-saferedirecturl=\"https:\/\/www.google.com\/url?q=https:\/\/deepmind.com\/learning-resources\/deepmind-the-podcast&amp;source=gmail&amp;ust=1644958346354000&amp;usg=AOvVaw1U9FBtO1cKTvSi7LRZEvUr\"><span>DeepMind podcast<\/span><\/a>, the lab\u2019s researchers discuss the promise of various pathways to AGI.<\/p>\n<p>Among the reward-is-enough skeptics is Raia Hadsell, the company\u2019s director of robotics, who notes the difficulty of designing an all-powerful reward that leads to AGI. DeepMind cofounder Shane Legg, meanwhile, suspects that reinforcement learning may have to combine with learning algorithms.<\/p>\n<p>Precup also has doubts that reward alone is enough, but she believes it could be a crucial ingredient in AGI.<\/p>\n<p class=\"p1\"><span class=\"s1\">\u201cBecause it\u2019s learning from interaction in an incremental way, it feels very much like what biological intelligence systems do,\u201d she said. <\/span><\/p>\n<p class=\"p1\"><span class=\"s1\">\u201cIs it at the end of the day going to be the only <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/technology\/\" data-internallinksmanager029f6b8e52c=\"4\" title=\"Technology\" target=\"_blank\" rel=\"noopener\">technology<\/a> that contributes to AGI? Well, that\u2019s not clear at all \u2014 there\u2019s a lot of other really interesting things that are going on.\u201d<\/span><span class=\"s1\"\/><\/p>\n<p>Precup is nonetheless optimistic that we\u2019re already on a path to AGI. Ultimately, she\u2019s more concerned about the safety of the destination than the route that takes us there.<\/p>\n<p><em>\u201cThe road to AGI,\u201d the fifth episode in season two of \u201cDeepMind: The Podcast,\u201d is <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/deepmind.com\/learning-resources\/deepmind-the-podcast\">available here<\/a>\u00a0from February 15.<\/em>\n                        <\/div>\n<p><script async src=\"\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMLG0nwswvr63Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\">For forums sites go to <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/forum.buradabiliyorum.com\/\" target=\"_blank\" rel=\"noopener\">Forum.BuradaBiliyorum.Com<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/technology\/\" target=\"_blank\" rel=\"noopener\">Technology category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/thenextweb.com\/news\/deepmind-reinforcement-learning-only-one-possible-pathway-to-agi\" target=\"_blank\" rel=\"noopener\">Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>&#8220;#Reward may NOT be enough for AGI \u2014 but it\u2019s worth a try&#8221; DeepMind has been connected to artificial general intelligence since birth. The lab was launched with a mission to develop AGI, was cofounded by a researcher who coined the term, and has made some compelling advances in the field. It also recently produced&#8230;<\/p>\n","protected":false},"author":1,"featured_media":405672,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/img-cdn.tnwcdn.com\/image\/neural?filter_last=1&fit=1280,640&url=https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/02\/Untitled-design-5.jpg&signature=46afb4770ad61e8052bcef3411496990","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[],"class_list":["post-405671","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/405671","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=405671"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/405671\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/405672"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=405671"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=405671"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=405671"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}