{"id":130038,"date":"2020-12-09T22:00:02","date_gmt":"2020-12-09T19:00:02","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/new-study-tests-machine-learning-on-detection-of-borrowed-words-in-world-languages\/"},"modified":"2020-12-09T22:00:02","modified_gmt":"2020-12-09T19:00:02","slug":"new-study-tests-machine-learning-on-detection-of-borrowed-words-in-world-languages","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/new-study-tests-machine-learning-on-detection-of-borrowed-words-in-world-languages\/","title":{"rendered":"#New study tests machine learning on detection of borrowed words in world languages"},"content":{"rendered":"<p>&#8220;<strong>#New study tests machine learning on detection of borrowed words in world languages<\/strong>&#8221;<\/p>\n<div>\n<div class=\"article-gallery lightGallery\">\n<div data-thumb=\"https:\/\/scx1.b-cdn.net\/csz\/news\/tmb\/2020\/newstudytest.jpg\" data-src=\"https:\/\/scx2.b-cdn.net\/gfx\/news\/hires\/2020\/newstudytest.jpg\" data-sub-html=\"Lexical borrowing is very widespread and may affect even those words that play an important role in our daily life. English 'mountain', for example, was borrowed from Old French, along with many other words. Credit: Johann-Mattis List, Hans Sell\">\n<figure class=\"article-img\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/scx1.b-cdn.net\/csz\/news\/800\/2020\/newstudytest.jpg\" alt=\"New study tests machine learning on detection of borrowed words in world languages\" title=\"Lexical borrowing is very widespread and may affect even those words that play an important role in our daily life. English 'mountain', for example, was borrowed from Old French, along with many other words. Credit: Johann-Mattis List, Hans Sell\" width=\"800\" height=\"480\"\/><figcaption class=\"text-darken text-low-up text-truncate-js text-truncate mt-3\">\n                Lexical borrowing is very widespread and may affect even those words that play an important role in our daily life. English &#8216;mountain&#8217;, for example, was borrowed from Old French, along with many other words. Credit: Johann-Mattis List, Hans Sell<br \/>\n            <\/figcaption><\/figure>\n<\/div>\n<\/div>\n<p>Researchers from the Pontificia Universidad Cat\u00f3lica del Per\u00fa and the Max Planck Institute for the <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/sciencee\/\" data-internallinksmanager029f6b8e52c=\"5\" title=\"Science\" target=\"_blank\" rel=\"noopener\">Science<\/a> of Human History have investigated the ability of machine learning algorithms to identify lexical borrowings using word lists from a single language. Results published in the journal <i>PLOS ONE<\/i> show that current machine-learning methods alone are insufficient for borrowing detection, confirming that additional data and expert knowledge are needed to tackle one of historical linguistics&#8217; most pressing challenges.<\/p>\n<p>                                                                                Lexical borrowing, or the direct transfer of words from one language to another, has interested scholars for millennia, as evidenced in Plato&#8217;s Kratylos dialog, in which Socrates discusses the challenge imposed by borrowed words on etymological studies. In historical linguistics, lexical borrowings help researchers trace the evolution of modern languages and indicate cultural contact between distinct linguistic groups\u2014whether recent or ancient. However, the techniques for identifying borrowed words have resisted formalization, demanding that researchers rely on a variety of proxy information and the comparison of multiple languages.<\/p>\n<p>&#8220;The automated detection of lexical borrowings is still one of the most difficult tasks we face in computational historical linguistics,&#8221; says Johann-Mattis List, who led the study.<\/p>\n<p>In the current study, researchers from PUCP and MPI-SHH employed different machine learning techniques to train language models that mimic the way in which linguists identify borrowings when considering only the evidence provided by a single language: if sounds or the ways in which sounds combine to form words are atypical when comparing them with other words in the same language, this often hints to recent borrowings. The models were then <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>lied to a modified version of the World Loanword Database, a catalog of borrowing information for a sample of 40 languages from different language families all over the world, in order to see how accurately words within a given language would be classified as borrowed or not by the different techniques.<\/p>\n<p>In many cases the results were unsatisfying, suggesting that loanword detection is too difficult for machine learning methods most commonly used. However, in specific situations, such as in lists with a high proportion of loanwords or in languages whose loanwords come primarily from a single donor language, the teams&#8217; lexical language models showed some promise.<\/p>\n<div class=\"article-gallery lightGallery\">\n<div data-thumb=\"https:\/\/scx1.b-cdn.net\/csz\/news\/tmb\/2020\/1-newstudytest.jpg\" data-src=\"https:\/\/scx2.b-cdn.net\/gfx\/news\/hires\/2020\/1-newstudytest.jpg\" data-sub-html=\"Comparison of the lexical language models used in the study. Credit: Miller et al., 2020\">\n<figure class=\"article-img text-center\"><img decoding=\"async\" src=\"https:\/\/scx1.b-cdn.net\/csz\/news\/800\/2020\/1-newstudytest.jpg\" alt=\"New study tests machine learning on detection of borrowed words in world languages\"\/><figcaption class=\"text-left text-darken text-truncate text-low-up mt-3\">\n                Comparison of the lexical language models used in the study. Credit: Miller et al., 2020<br \/>\n            <\/figcaption><\/figure>\n<\/div>\n<\/div>\n<p>&#8220;After these first experiments with monolingual lexical borrowings, we can proceed to stake out other aspects of the problem, moving into multilingual and cross-linguistic approaches,&#8221; says John Miller of PUCP, the study&#8217;s co-lead author.<\/p>\n<p>&#8220;Our computer-assisted approach, along with the dataset we are releasing, will shed a new light on the importance of computer-assisted methods for language comparison and historical linguistics,&#8221; adds Tiago Tresoldi, the study&#8217;s other co-lead author from MPI-SHH.<\/p>\n<p>The study joins ongoing efforts to tackle one of the most challenging problems in historical linguistics, showing that loanword detection cannot rely on mono-lingual information alone. In the future, the authors hope to develop better-integrated approaches that take multi-lingual information into account.\n                                                                                                                        <\/p>\n<hr\/>\n<div class=\"article-main__explore my-4 d-print-none\">\n<p>                                            <a rel=\"nofollow noopener noreferrer\" target=\"_blank\" class=\"text-medium text-info mt-2 d-inline-block\" href=\"https:\/\/phys.org\/news\/2020-02-bilingual-mash-ups-counterintuitive-sociolinguistics.html\">Bilingual mash ups: Counterintuitive findings from sociolinguistics<\/a>\n                                        <\/div>\n<hr class=\"mb-4\"\/>\n<div class=\"article-main__more p-4\">\n                                                                                                <strong>More information:<\/strong><br \/>\n                                                <i>PLOS ONE<\/i> (2020). <a rel=\"nofollow noopener noreferrer\" target=\"_blank\" data-doi=\"1\" href=\"http:\/\/dx.doi.org\/10.1371\/journal.pone.0242709\">DOI: 10.1371\/journal.pone.0242709<\/a><\/p><\/div>\n<div class=\"d-inline-block text-medium my-4\">\n                                                Provided by<br \/>\n                                                                                                    Max Planck Society<br \/>\n                                                                                                        <a rel=\"nofollow noopener noreferrer\" target=\"_blank\" class=\"icon_open\" href=\"http:\/\/www.mpg.de\/english\/portal\/index.html\"><br \/>\n                                                        <svg><use href=\"https:\/\/techx.b-cdn.net\/tmpl\/v2\/img\/svg\/sprite.svg#icon_open\" x=\"0\" y=\"0\"\/><\/svg><\/a><\/p><\/div>\n<p>                                        <!-- print only --><\/p>\n<div class=\"d-none d-print-block\">\n<p>                                                 <strong>Citation<\/strong>:<br \/>\n                                                 New study tests machine learning on detection of borrowed words in world languages (2020, December  9)<br \/>\n                                                 retrieved  9 December 2020<br \/>\n                                                 from https:\/\/techxplore.com\/<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/news\/\" data-internallinksmanager029f6b8e52c=\"2\" title=\"News\" target=\"_blank\" rel=\"noopener\">news<\/a>\/2020-12-machine-words-world-languages.html<\/p>\n<p>                                            This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no<br \/>\n                                            part may be reproduced without the written permission. The content is provided for information purposes only.<\/p><\/div>\n<\/p><\/div>\n<p><script id=\"facebook-jssdk\" async=\"\" src=\"https:\/\/connect.facebook.net\/en_US\/sdk.js\"><\/script><\/p>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMLG0nwswvr63Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\">For forums sites go to <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/forum.buradabiliyorum.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">Forum.BuradaBiliyorum.Com<\/a><\/span><\/strong>\n<\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more Like this articles, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/science\/\" target=\"_blank\" rel=\"noopener noreferrer\">Science category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techxplore.com\/news\/2020-12-machine-words-world-languages.html\" target=\"_blank\" rel=\"noopener noreferrer\">Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>&#8220;#New study tests machine learning on detection of borrowed words in world languages&#8221; Lexical borrowing is very widespread and may affect even those words that play an important role in our daily life. English &#8216;mountain&#8217;, for example, was borrowed from Old French, along with many other words. Credit: Johann-Mattis List, Hans Sell Researchers from the&#8230;<\/p>\n","protected":false},"author":1,"featured_media":130039,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/scx2.b-cdn.net\/gfx\/news\/hires\/2020\/newstudytest.jpg","fifu_image_alt":"","footnotes":""},"categories":[16],"tags":[],"class_list":["post-130038","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-sciencee"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/130038","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=130038"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/130038\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/130039"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=130038"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=130038"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=130038"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}