{"id":671989,"date":"2025-05-28T13:15:19","date_gmt":"2025-05-28T10:15:19","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/multimodality-as-the-next-big-leap-for-ai\/"},"modified":"2025-05-28T13:15:19","modified_gmt":"2025-05-28T10:15:19","slug":"multimodality-as-the-next-big-leap-for-ai","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/multimodality-as-the-next-big-leap-for-ai\/","title":{"rendered":"Multimodality as the next big leap for AI"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_84 counter-hierarchy ez-toc-counter ez-toc-custom ez-toc-container-direction\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table of Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-6a29e92ff2ed3\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #dd3333;color:#dd3333\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #dd3333;color:#dd3333\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-6a29e92ff2ed3\" checked aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/buradabiliyorum.com\/en\/multimodality-as-the-next-big-leap-for-ai\/#We_spoke_two_years_ago_when_ChatGPT_became_public_Looking_back_would_you_say_this_was_the_beginning_of_a_new_era\" >We spoke two years ago, when ChatGPT became public. Looking back, would you say this was the beginning of a new era?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/buradabiliyorum.com\/en\/multimodality-as-the-next-big-leap-for-ai\/#Competitors_were_quite_quick_to_launch_their_own_solutions_Was_OpenAI_really_a_precursor\" >Competitors were quite quick to launch their own solutions. Was OpenAI really a precursor?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/buradabiliyorum.com\/en\/multimodality-as-the-next-big-leap-for-ai\/#What_about_DeepSeek_launched_in_late_2024_Is_it_that_different_from_other_models\" >What about DeepSeek, launched in late 2024? Is it that different from other models?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/buradabiliyorum.com\/en\/multimodality-as-the-next-big-leap-for-ai\/#We_see_a_massive_race_to_invest_in_AI_The_US_announced_500_billion_dollars_Europe_mentioned_200_billion_euros_Is_it_really_worth_spending_that_much_money\" >We see a massive race to invest in AI: The US announced 500 billion dollars, Europe mentioned 200 billion euros. Is it really worth spending that much money?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/buradabiliyorum.com\/en\/multimodality-as-the-next-big-leap-for-ai\/#What_about_the_place_of_Switzerland_in_all_this\" >What about the place of Switzerland in all this?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/buradabiliyorum.com\/en\/multimodality-as-the-next-big-leap-for-ai\/#Lets_get_back_to_how_large_models_work_Is_there_a_risk_that_the_pollution_of_training_data%E2%80%94particularly_by_data_generated_by_AI_itself%E2%80%94will_impair_its_quality\" >Let&#8217;s get back to how large models work. Is there a risk that the pollution of training data\u2014particularly by data generated by AI itself\u2014will impair its quality?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/buradabiliyorum.com\/en\/multimodality-as-the-next-big-leap-for-ai\/#In_which_field_do_you_foresee_generative_AI_playing_a_major_role\" >In which field do you foresee generative AI playing a major role?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-8\" href=\"https:\/\/buradabiliyorum.com\/en\/multimodality-as-the-next-big-leap-for-ai\/#So_far_weve_observed_a_technological_leap_every_two_to_three_years_Whats_next\" >So far, we&#8217;ve observed a technological leap every two to three years. What&#8217;s next?<\/a><\/li><\/ul><\/nav><\/div>\n<div>\n<div class=\"article-gallery lightGallery\">\n<div data-thumb=\"https:\/\/scx1.b-cdn.net\/csz\/news\/tmb\/2025\/ai-the-next-big-leap-w.jpg\" data-src=\"https:\/\/scx2.b-cdn.net\/gfx\/news\/2025\/ai-the-next-big-leap-w.jpg\" data-sub-html=\"Antoine Bosselut. Credit: EPFL\/Alain Herzog\">\n<figure class=\"article-img\">\n            <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/scx1.b-cdn.net\/csz\/news\/800a\/2025\/ai-the-next-big-leap-w.jpg\" alt=\"AI: &quot;The next big leap will deal with multimodality&quot;\" title=\"Antoine Bosselut. Credit: EPFL\/Alain Herzog\" width=\"800\" height=\"449\"\/><figcaption class=\"text-darken text-low-up text-truncate-js text-truncate mt-3\">\n                Antoine Bosselut. Credit: EPFL\/Alain Herzog<br \/>\n            <\/figcaption><\/figure>\n<\/p><\/div>\n<\/div>\n<p>As the head of the Natural Language Processing Laboratory at EPFL, Antoine Bosselut keeps a close eye on the development of generative artificial intelligence tools such as ChatGPT. He looks back at their evolution over the past two years and suggests some avenues for the future.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"We_spoke_two_years_ago_when_ChatGPT_became_public_Looking_back_would_you_say_this_was_the_beginning_of_a_new_era\"><\/span>We spoke two years ago, when ChatGPT became public. Looking back, would you say this was the beginning of a new era?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Yes, I think there was indeed a &#8220;ChatGPT moment&#8221; that changed the paradigm of AI in two ways. First, from a technical point of view: we went from task-based to instruction-based systems, or what is known as generative AI. Before that ChatGPT moment, individual AI systems were trained to perform very specific tasks.<\/p>\n<p>ChatGPT was a <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/game\/\" data-internallinksmanager029f6b8e52c=\"7\" title=\"Game\" target=\"_blank\" rel=\"noopener\">game<\/a>-changer, as you could convert a multitude of instructions into various outputs representing a given task, all based on an enormous amount of data used to train the system. That technical shift created a perceptual shift as well. With that instruction-based AI, anybody can use such systems, and the <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/general\/\" data-internallinksmanager029f6b8e52c=\"3\" title=\"General\" target=\"_blank\" rel=\"noopener\">general<\/a> public understood that AI could be integrated into various aspects of their daily lives.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Competitors_were_quite_quick_to_launch_their_own_solutions_Was_OpenAI_really_a_precursor\"><\/span>Competitors were quite quick to launch their own solutions. Was OpenAI really a precursor?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>A lot of companies were already working on similar <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>roaches. Anthropic, which launched Claude, was founded a year before ChatGPT came out, by a group of ex-OpenAI engineers. Google had for many years been working on instruction-learning models as well.<\/p>\n<p>The OpenAI release was a step up from what anybody else had done, but the real change was that they managed to put the <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/technology\/\" data-internallinksmanager029f6b8e52c=\"4\" title=\"Technology\" target=\"_blank\" rel=\"noopener\">technology<\/a> into a product. This changed user perception on the maturity of this technology, which forced a shift of focus from all the big tech actors.<\/p>\n<p>                                                                                                        <!-- TechX - News - In-article --><\/p>\n<h2><span class=\"ez-toc-section\" id=\"What_about_DeepSeek_launched_in_late_2024_Is_it_that_different_from_other_models\"><\/span>What about DeepSeek, launched in late 2024? Is it that different from other models?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>It&#8217;s too soon to say whether it is a similar jump to what we saw two years ago. A lot of the excitement around DeepSeek is based around the cost, not necessarily novel capabilities. The truth is, we still don&#8217;t really know much about that model itself. The price tag they announced is based on the final training round. We don&#8217;t know the cost of the pre-trained model.<\/p>\n<p>Saying it&#8217;s &#8220;open-source&#8221; would be a stretch. One can use its code to integrate it into other applications and develop it further, but we don&#8217;t really know what its foundations are since there&#8217;s little information around the training data. You don&#8217;t know what you&#8217;re building on top of.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"We_see_a_massive_race_to_invest_in_AI_The_US_announced_500_billion_dollars_Europe_mentioned_200_billion_euros_Is_it_really_worth_spending_that_much_money\"><\/span>We see a massive race to invest in AI: The US announced 500 billion dollars, Europe mentioned 200 billion euros. Is it really worth spending that much money?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>You&#8217;re going to spend this money anyway; the question is, who gets it? AI is not going anywhere and will continue to grow as a technology that people use every day. If Europe fails to develop convincing generative AI solutions, users will turn to U.S. or Chinese services, with all the risks this entails around sovereignty.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"What_about_the_place_of_Switzerland_in_all_this\"><\/span>What about the place of Switzerland in all this?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Both EPFL and ETH Zurich are excellent at training the next generation of specialists, developing solid theoretical knowledge and making it available to society at large, thus providing a trusted alternative to foreign tools. In that respect, this is exactly what the Swiss AI Initiative and the Swiss National AI Institute were created to do\u2014train the younger generation of engineers and scientists, make them available to society.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Lets_get_back_to_how_large_models_work_Is_there_a_risk_that_the_pollution_of_training_data%E2%80%94particularly_by_data_generated_by_AI_itself%E2%80%94will_impair_its_quality\"><\/span>Let&#8217;s get back to how large models work. Is there a risk that the pollution of training data\u2014particularly by data generated by AI itself\u2014will impair its quality?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>There is a theoretical risk. But paradoxically, thanks to the filters and cleaning of results that are being developed in parallel, the synthetic data that serve as sources are rather of very high quality. Conversely, a lot of unfiltered content generated by humans can be false or biased. Therefore, it&#8217;s hard to say whether this fear is justified.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"In_which_field_do_you_foresee_generative_AI_playing_a_major_role\"><\/span>In which field do you foresee generative AI playing a major role?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>It might be easier to think about the fields in which AI won&#8217;t play any role \u2026 There are fields\u2014health, national security, confidential information\u2014in which data is sensitive, so we can&#8217;t just easily transfer it to the servers where generative AI systems are hosted. Trust towards these systems and their owners will remain a question mark for many years.<\/p>\n<p>                                                                                                        <!-- TechX - News - In-article --><\/p>\n<h2><span class=\"ez-toc-section\" id=\"So_far_weve_observed_a_technological_leap_every_two_to_three_years_Whats_next\"><\/span>So far, we&#8217;ve observed a technological leap every two to three years. What&#8217;s next?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Despite the ever-accelerating capabilities of these models, they remain fundamentally text-based. In concrete terms, everything today is based on a vocabulary of around 50,000 words. This may be enough to give human users the impression that the machine is capable of reasoning. But human reasoning is far more complex and uses other perception modes too\u2014sounds, images or even smells.<\/p>\n<p>I think the next big evolution will come when models are also able to directly integrate other types of content, such as images, sounds and videos. This &#8220;multimodal AI&#8221; will then come even closer to artificial &#8220;thinking&#8221;\u2014even if its definition remains more philosophical than technical.<\/p>\n<div class=\"d-inline-block text-medium my-4\">\n                                                Provided by<br \/>\n                                                                                                    Ecole Polytechnique Federale de Lausanne<br \/>\n                                                    \t\t\t\t\t\t\t\t\t\t\t\t\t<a rel=\"nofollow\" target=\"_blank\" class=\"icon_open\" href=\"http:\/\/www.epfl.ch\/\" target=\"_blank\" rel=\"nofollow\"><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t<svg>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<use href=\"https:\/\/techx.b-cdn.net\/tmpl\/v2\/img\/svg\/sprite.svg#icon_open\" x=\"0\" y=\"0\"\/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/svg><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/a><\/p><\/div>\n<p>                                        <!-- print only --><\/p>\n<div class=\"d-none d-print-block\">\n<p>\n                                                <strong>Citation<\/strong>:<br \/>\n                                                Q&amp;A: Multimodality as the next big leap for AI (2025, May 27)<br \/>\n                                                retrieved 28 May 2025<br \/>\n                                                from https:\/\/techxplore.com\/<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/news\/\" data-internallinksmanager029f6b8e52c=\"2\" title=\"News\" target=\"_blank\" rel=\"noopener\">news<\/a>\/2025-05-qa-multimodality-big-ai.html\n                                            <\/p>\n<p>\n                                            This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no<br \/>\n                                            part may be reproduced without the written permission. The content is provided for information purposes only.\n                                            <\/p>\n<\/p><\/div>\n<\/p><\/div>\n<p><script id=\"facebook-jssdk\" async=\"\" src=\"https:\/\/connect.facebook.net\/en_US\/sdk.js\"><\/script><\/p>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more Like this articles, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/category\/sciencee\/\" target=\"_blank\" >Science category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techxplore.com\/news\/2025-05-qa-multimodality-big-ai.html\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Antoine Bosselut. Credit: EPFL\/Alain Herzog As the head of the Natural Language Processing Laboratory at EPFL, Antoine Bosselut keeps a close eye on the development of generative artificial intelligence tools such as ChatGPT. He looks back at their evolution over the past two years and suggests some avenues for the future. We spoke two years&#8230;<\/p>\n","protected":false},"author":1,"featured_media":671990,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/scx2.b-cdn.net\/gfx\/news\/2025\/ai-the-next-big-leap-w.jpg","fifu_image_alt":"","footnotes":""},"categories":[16],"tags":[],"class_list":["post-671989","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-sciencee"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/671989","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=671989"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/671989\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/671990"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=671989"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=671989"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=671989"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}