{"id":664554,"date":"2025-04-23T00:58:15","date_gmt":"2025-04-22T21:58:15","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/system-learns-after-watching-how-to-videos\/"},"modified":"2025-04-23T00:58:15","modified_gmt":"2025-04-22T21:58:15","slug":"system-learns-after-watching-how-to-videos","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/system-learns-after-watching-how-to-videos\/","title":{"rendered":"System learns after watching how-to videos"},"content":{"rendered":"<div>\n<div class=\"article-gallery lightGallery\">\n<div data-thumb=\"https:\/\/scx1.b-cdn.net\/csz\/news\/tmb\/2025\/robot-see-robot-do-sys.jpg\" data-src=\"https:\/\/scx2.b-cdn.net\/gfx\/news\/2025\/robot-see-robot-do-sys.jpg\" data-sub-html=\"Kushal Kedia (left), a doctoral student in the field of computer science, and Prithwish Dan are members of the development team behind RHyME, a system that allows robots to learn tasks by watching a single how-to video. Credit: Louis DiPietro\">\n<figure class=\"article-img\">\n            <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/scx1.b-cdn.net\/csz\/news\/800a\/2025\/robot-see-robot-do-sys.jpg\" alt=\"Robot see, robot do: System learns after watching how-to videos\" title=\"Kushal Kedia (left), a doctoral student in the field of computer science, and Prithwish Dan are members of the development team behind RHyME, a system that allows robots to learn tasks by watching a single how-to video. Credit: Louis DiPietro\" width=\"800\" height=\"450\"\/><figcaption class=\"text-darken text-low-up text-truncate-js text-truncate mt-3\">\n                Kushal Kedia (left), a doctoral student in the field of computer <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/sciencee\/\" data-internallinksmanager029f6b8e52c=\"5\" title=\"Science\" target=\"_blank\" rel=\"noopener\">science<\/a>, and Prithwish Dan are members of the development team behind RHyME, a system that allows robots to learn tasks by watching a single how-to video. Credit: Louis DiPietro<br \/>\n            <\/figcaption><\/figure>\n<\/p><\/div>\n<\/div>\n<p>Cornell University researchers have developed a new robotic framework powered by artificial intelligence\u2014called RHyME (Retrieval for Hybrid Imitation under Mismatched Execution)\u2014that allows robots to learn tasks by watching a single how-to video.<\/p>\n<p>Robots can be finicky learners. Historically, they&#8217;ve required precise, step-by-step directions to complete basic tasks, and tend to call it quits when things go off-<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">script<\/a>, like after dropping a tool or losing a screw. RHyME, however, could fast-track the development and deployment of robotic systems by significantly reducing the time, energy and money needed to train them, the researchers said.<\/p>\n<p>&#8220;One of the annoying things about working with robots is collecting so much data on the robot doing different tasks,&#8221; said Kushal Kedia, a doctoral student in the field of computer science. &#8220;That&#8217;s not how humans do tasks. We look at other people as inspiration.&#8221;<\/p>\n<p>Kedia will present a paper titled &#8220;One-Shot Imitation under Mismatched Execution,&#8221; in May at the Institute of Electrical and Electronics Engineers&#8217; International Conference on Robotics and Automation, in Atlanta. The work is also <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/arxiv.org\/abs\/2409.06615\" target=\"_blank\">available<\/a> on the <i>arXiv<\/i> preprint server.<\/p>\n<p>Home robot assistants are still a long way off because they lack the wits to navigate the physical world and its countless contingencies. To get robots up to speed, researchers like Kedia are training them with what amounts to how-to videos\u2014human demonstrations of various tasks in a lab setting. The hope of this approach, a branch of machine learning called &#8220;imitation learning,&#8221; is that robots will learn a sequence of tasks faster and be able to adapt to real-world environments.<\/p>\n<figure class=\"mb-4\" itemscope=\"\" itemtype=\"http:\/\/schema.org\/VideoObject\">\n    <meta itemprop=\"name\" content=\"Robot see, robot do: System learns after watching how-to videos\"\/><br \/>\n    <meta itemprop=\"url\" content=\"https:\/\/scx2.b-cdn.net\/gfx\/video\/2025\/robot-see-robot-do-sys-1.mp4\"\/><br \/>\n    <meta itemprop=\"description\" content=\"Credit: Cornell University\"\/><br \/>\n    <meta itemprop=\"uploadDate\" content=\"2025-04-22T15:45:54-04:00\"\/><br \/>\n        <meta itemprop=\"thumbnailUrl\" content=\"https:\/\/scx1.b-cdn.net\/gfx\/video_tmb\/2025\/robot-see-robot-do-sys-1.mp4.jpg\"\/><br \/>\n    <meta itemprop=\"contentUrl\" content=\"https:\/\/scx2.b-cdn.net\/gfx\/video\/2025\/robot-see-robot-do-sys-1.mp4\"\/><br \/>\n            <video class=\"embed-responsive embed-responsive-16by9\" id=\"jwVID83205\" controls=\"\" poster=\"https:\/\/scx1.b-cdn.net\/gfx\/video_tmb\/2025\/robot-see-robot-do-sys-1.mp4.jpg\"><source src=\"https:\/\/scx2.b-cdn.net\/gfx\/video\/2025\/robot-see-robot-do-sys-1.mp4\" type=\"video\/mp4\"><\/source><\/video><figcaption class=\"text-darken text-low-up mt-4\" itemprop=\"caption\">Credit: Cornell University<\/figcaption><\/figure>\n<p>&#8220;Our work is like translating French to English\u2014we&#8217;re translating any given task from human to robot,&#8221; said senior author Sanjiban Choudhury, assistant professor of computer science.<\/p>\n<p>This translation task still faces a broader challenge, however: Humans move too fluidly for a robot to track and mimic, and training robots with video requires gobs of it. Further, video demonstrations\u2014of, say, picking up a napkin or stacking dinner plates\u2014must be performed slowly and flawlessly, since any mismatch in actions between the video and the robot has historically spelled doom for robot learning, the researchers said.<\/p>\n<p>                                                                                                        <!-- TechX - News - In-article --><\/p>\n<p>&#8220;If a human moves in a way that&#8217;s any different from how a robot moves, the method im<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/social-mediaa\/\" data-internallinksmanager029f6b8e52c=\"1\" title=\"Social Media\" target=\"_blank\" rel=\"noopener\">media<\/a>tely falls apart,&#8221; Choudhury said. &#8220;Our thinking was, &#8216;Can we find a principled way to deal with this mismatch between how humans and robots do tasks?'&#8221;<\/p>\n<p>RHyME is the team&#8217;s answer\u2014a scalable approach that makes robots less finicky and more adaptive. It supercharges a robotic system to use its own memory and connect the dots when performing tasks it has viewed only once by drawing on videos it has seen. For example, a RHyME-equipped robot shown a video of a human fetching a mug from the counter and placing it in a nearby sink will comb its bank of videos and draw inspiration from similar actions\u2014like grasping a cup and lowering a utensil.<\/p>\n<p>RHyME paves the way for robots to learn multiple-step sequences while significantly lowering the amount of robot data needed for training, the researchers said. RHyME requires just 30 minutes of robot data; in a lab setting, robots trained using the system achieved a more than 50% increase in task success compared to previous methods, the researchers said.<\/p>\n<p>&#8220;This work is a departure from how robots are programmed today. The status quo of programming robots is thousands of hours of tele-operation to teach the robot how to do tasks. That&#8217;s just impossible,&#8221; Choudhury said. &#8220;With RHyME, we&#8217;re moving away from that and learning to train robots in a more scalable way.&#8221;<\/p>\n<p>Along with Kedia and Choudhury, the paper&#8217;s authors are Prithwish Dan, Angela Chao, and Maximus Pace.<\/p>\n<div class=\"article-main__more p-4\">\n<p><strong>More information:<\/strong><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\tKushal Kedia et al, One-Shot Imitation under Mismatched Execution, <i>arXiv<\/i> (2024). <a rel=\"nofollow\" target=\"_blank\" data-doi=\"1\" href=\"https:\/\/dx.doi.org\/10.48550\/arxiv.2409.06615\" target=\"_blank\">DOI: 10.48550\/arxiv.2409.06615<\/a><\/p>\n<div class=\"mt-3\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t<strong>Journal information:<\/strong><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<cite>arXiv<\/cite><br \/>\n                                                        <a rel=\"nofollow\" target=\"_blank\" class=\"icon_open\" href=\"http:\/\/arxiv.org\/\" target=\"_blank\" rel=\"nofollow\"><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<svg>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<use href=\"https:\/\/techx.b-cdn.net\/tmpl\/v2\/img\/svg\/sprite.svg#icon_open\" x=\"0\" y=\"0\"\/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/svg><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/a>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n<\/p><\/div>\n<div class=\"d-inline-block text-medium my-4\">\n                                                Provided by<br \/>\n                                                                                                    Cornell University<br \/>\n                                                    \t\t\t\t\t\t\t\t\t\t\t\t\t<a rel=\"nofollow\" target=\"_blank\" class=\"icon_open\" href=\"http:\/\/www.cornell.edu\/\" target=\"_blank\" rel=\"nofollow\"><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t<svg>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<use href=\"https:\/\/techx.b-cdn.net\/tmpl\/v2\/img\/svg\/sprite.svg#icon_open\" x=\"0\" y=\"0\"\/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/svg><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/a><\/p><\/div>\n<p>                                        <!-- print only --><\/p>\n<div class=\"d-none d-print-block\">\n<p>\n                                                <strong>Citation<\/strong>:<br \/>\n                                                Robot see, robot do: System learns after watching how-to videos (2025, April 22)<br \/>\n                                                retrieved 22 April 2025<br \/>\n                                                from https:\/\/techxplore.com\/<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/news\/\" data-internallinksmanager029f6b8e52c=\"2\" title=\"News\" target=\"_blank\" rel=\"noopener\">news<\/a>\/2025-04-robot-videos.html\n                                            <\/p>\n<p>\n                                            This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no<br \/>\n                                            part may be reproduced without the written permission. The content is provided for information purposes only.\n                                            <\/p>\n<\/p><\/div>\n<\/p><\/div>\n<p><script id=\"facebook-jssdk\" async=\"\" src=\"https:\/\/connect.facebook.net\/en_US\/sdk.js\"><\/script><\/p>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more Like this articles, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/category\/sciencee\/\" target=\"_blank\" >Science category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techxplore.com\/news\/2025-04-robot-videos.html\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Kushal Kedia (left), a doctoral student in the field of computer science, and Prithwish Dan are members of the development team behind RHyME, a system that allows robots to learn tasks by watching a single how-to video. Credit: Louis DiPietro Cornell University researchers have developed a new robotic framework powered by artificial intelligence\u2014called RHyME (Retrieval&#8230;<\/p>\n","protected":false},"author":1,"featured_media":664555,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/scx2.b-cdn.net\/gfx\/news\/2025\/robot-see-robot-do-sys.jpg","fifu_image_alt":"","footnotes":""},"categories":[16],"tags":[],"class_list":["post-664554","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-sciencee"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/664554","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=664554"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/664554\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/664555"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=664554"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=664554"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=664554"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}