{"id":563238,"date":"2023-03-14T00:12:49","date_gmt":"2023-03-13T21:12:49","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/a-new-and-better-way-to-create-word-lists\/"},"modified":"2023-03-14T00:12:49","modified_gmt":"2023-03-13T21:12:49","slug":"a-new-and-better-way-to-create-word-lists","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/a-new-and-better-way-to-create-word-lists\/","title":{"rendered":"#A new and better way to create word lists"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_85 counter-hierarchy ez-toc-counter ez-toc-custom ez-toc-container-direction\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table of Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-6a3843de5b3a0\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #dd3333;color:#dd3333\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #dd3333;color:#dd3333\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-6a3843de5b3a0\" checked aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-1'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/buradabiliyorum.com\/en\/a-new-and-better-way-to-create-word-lists\/#%E2%80%9CA_new_and_better_way_to_create_word_lists%E2%80%9D\" >&#8220;A new and better way to create word lists&#8221;<\/a><ul class='ez-toc-list-level-2' ><li class='ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/buradabiliyorum.com\/en\/a-new-and-better-way-to-create-word-lists\/#A_problem_that_concerns_many\" >A problem that concerns many<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/buradabiliyorum.com\/en\/a-new-and-better-way-to-create-word-lists\/#Improved_performance\" >Improved performance<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/buradabiliyorum.com\/en\/a-new-and-better-way-to-create-word-lists\/#Independence_from_the_language_itself\" >Independence from the language itself<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/buradabiliyorum.com\/en\/a-new-and-better-way-to-create-word-lists\/#Important_for_new_topics_like_COVID\" >Important for new topics like COVID<\/a><\/li><\/ul><\/li><\/ul><\/nav><\/div>\n<h1><span class=\"ez-toc-section\" id=\"%E2%80%9CA_new_and_better_way_to_create_word_lists%E2%80%9D\"><\/span>&#8220;A new and better way to create word lists&#8221;<span class=\"ez-toc-section-end\"><\/span><\/h1>\n<div>\n<div class=\"article-gallery lightGallery\">\n<div data-thumb=\"https:\/\/scx1.b-cdn.net\/csz\/news\/tmb\/2023\/a-new-and-better-way-t.jpg\" data-src=\"https:\/\/scx2.b-cdn.net\/gfx\/news\/hires\/2023\/a-new-and-better-way-t.jpg\" data-sub-html=\"A short list of seed words (red, on the left) is expanded into a longer word list (green, on the right) by mapping the seed words onto a colexification network and retrieving the neighboring nodes. Credit: Complexity Science Hub\">\n<figure class=\"article-img\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/scx1.b-cdn.net\/csz\/news\/800a\/2023\/a-new-and-better-way-t.jpg\" alt=\"A new and better way to create word lists\" title=\"A short list of seed words (red, on the left) is expanded into a longer word list (green, on the right) by mapping the seed words onto a colexification network and retrieving the neighboring nodes. Credit: Complexity Science Hub\" width=\"800\" height=\"530\"\/><figcaption class=\"text-darken text-low-up text-truncate-js text-truncate mt-3\">\n                A short list of seed words (red, on the left) is expanded into a longer word list (green, on the right) by m<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>ing the seed words onto a colexification network and retrieving the neighboring nodes. Credit: Complexity <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/sciencee\/\" data-internallinksmanager029f6b8e52c=\"5\" title=\"Science\" target=\"_blank\" rel=\"noopener\">Science<\/a> Hub<br \/>\n            <\/figcaption><\/figure>\n<\/div>\n<\/div>\n<p>Word lists are the basis of so much research in so many fields. Researchers at the Complexity Science Hub have now developed an algorithm that can be applied to different languages and can expand word lists significantly better than others.<\/p>\n<p>Many projects start with the creation of a word list, not only in companies when mind maps are created, but also in all areas of research. Imagine you want to find out on which days people are in a particularly good mood by analyzing <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/social-mediaa\/\" data-internallinksmanager029f6b8e52c=\"1\" title=\"Social Media\" target=\"_blank\" rel=\"noopener\">Twitter<\/a> postings. Just looking for the word &#8220;happy&#8221; wouldn&#8217;t be enough.<\/p>\n<p>Instead, you would have to use an algorithm that detects all tweets that indicate that someone is happy. &#8220;So the first step is to create a list of all the words that indicate just that. The whole research stands or falls on doing so,&#8221; explains Anna Di Natale, a researcher at the Complexity Science Hub in Vienna. But how to come up with the most accurate, complete word lists possible?\n<\/p>\n<h2><span class=\"ez-toc-section\" id=\"A_problem_that_concerns_many\"><\/span>A problem that concerns many<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>This widespread problem not only concerns opinion researchers who want to find out how politicians&#8217; statements are received by the public. Companies, too want to find out how their products are perceived through sentiment analysis.<\/p>\n<p>To improve things, Di Natale has now developed a new method, called LEXpander, that outperforms previous algorithms in two different languages\u2014German and English. Moreover, for the very first time ever, she has developed a way through which it is possible to compare different tools at all.\n<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Improved_performance\"><\/span>Improved performance<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>In comparison with four other algorithms for wordlist expansion (WordNet, Empath 2.0, FastText and GloVe), LEXpander performed significantly better, especially in German. For example, the researchers found that LEXpander guesses 43% of words right when expanding an English word list for positive meaning. An existing popular model, FastText, in comparison, is right only 28% of the time.\n<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Independence_from_the_language_itself\"><\/span>Independence from the language itself<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The reason is that this tool works language-independently. It is not based on one language, but on a so-called colexification network. This recognized linguistic concept resides on homonyms and polysemies, single words that have two or more distinct meanings. For example: the ancient Greek word \u03c6\u03ac\u03c1\u03bc\u03b1\u03ba\u03bf\u03bd (pharmacon) can mean medicine or poison. These are two different things, but thematically close. But there are others that don&#8217;t suggest kinship\u2014such as &#8220;bank&#8221; as a financial institution or the land alongside a river.<\/p>\n<p>&#8220;If you collect them across many languages\u2014and here we analyzed about 19 different languages\u2014you can see connections between them,&#8221; Di Natale says. The network is formed when these colexifications occur in several languages across different language families, creating connections.<\/p>\n<p>This independence from the language itself allows LEXpander to achieve better results in different languages. &#8220;There are many methods developed for English. They work very well and quickly and everyone uses them. Trying to apply them to other languages works, but not as well as it might work if you had started developing a method for German or Italian,&#8221; Di Natale explains.\n<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Important_for_new_topics_like_COVID\"><\/span>Important for new topics like COVID<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>For many topics there are already good word lists. But for new topics\u2014such as COVID\u2014new ones must be created. Until now, they were usually created by hand during brainstorming among colleagues, and several tools were used to help. But until now there was no way to compare them.<\/p>\n<p>Anna Di Natale and her team have now created this possibility and have also developed a new tool that performs better than the others. This can be an important cornerstone for many future research projects in various fields.\n                                                                                                                            <\/p>\n<div class=\"article-main__more p-4\">\n<p><strong>More information:<\/strong><br \/>\n                                                Anna Di Natale et al, LEXpander: Applying colexification networks to automated lexicon expansion, <i>Behavior Research Methods<\/i> (2023).  <a rel=\"nofollow noopener\" target=\"_blank\" data-doi=\"1\" href=\"https:\/\/dx.doi.org\/10.3758\/s13428-023-02063-y\">DOI: 10.3758\/s13428-023-02063-y<\/a><\/p>\n<\/div>\n<div class=\"d-inline-block text-medium my-4\">\n                                                Provided by<br \/>\n                                                                                                    Complexity Science Hub Vienna<br \/>\n                                                                                                        <a rel=\"nofollow noopener\" target=\"_blank\" class=\"icon_open\" href=\"https:\/\/www.csh.ac.at\/\"><br \/>\n                                                        <svg><use href=\"https:\/\/techx.b-cdn.net\/tmpl\/v2\/img\/svg\/sprite.svg#icon_open\" x=\"0\" y=\"0\"\/><\/svg><\/a><\/p><\/div>\n<p>                                        <!-- print only --><\/p>\n<div class=\"d-none d-print-block\">\n<p>\n                                                <strong>Citation<\/strong>:<br \/>\n                                                A new and better way to create word lists (2023, March 13)<br \/>\n                                                retrieved 13 March 2023<br \/>\n                                                from https:\/\/techxplore.com\/<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/news\/\" data-internallinksmanager029f6b8e52c=\"2\" title=\"News\" target=\"_blank\" rel=\"noopener\">news<\/a>\/2023-03-word.html<\/p>\n<p>                                            This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no<br \/>\n                                            part may be reproduced without the written permission. The content is provided for information purposes only.<\/p><\/div>\n<\/p><\/div>\n<p><script id=\"facebook-jssdk\" async=\"\" src=\"https:\/\/connect.facebook.net\/en_US\/sdk.js\"><\/script><\/p>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMLG0nwswvr63Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\">For forums sites go to <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/forum.buradabiliyorum.com\/\" target=\"_blank\" rel=\"noopener\">Forum.BuradaBiliyorum.Com<\/a><\/span><\/strong>\n<\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more Like this articles, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/science\/\" target=\"_blank\" rel=\"noopener\">Science category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techxplore.com\/news\/2023-03-word.html\" target=\"_blank\" rel=\"noopener\">Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>&#8220;A new and better way to create word lists&#8221; A short list of seed words (red, on the left) is expanded into a longer word list (green, on the right) by mapping the seed words onto a colexification network and retrieving the neighboring nodes. Credit: Complexity Science Hub Word lists are the basis of so&#8230;<\/p>\n","protected":false},"author":1,"featured_media":563239,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/scx2.b-cdn.net\/gfx\/news\/hires\/2023\/a-new-and-better-way-t.jpg","fifu_image_alt":"","footnotes":""},"categories":[16],"tags":[],"class_list":["post-563238","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-sciencee"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/563238","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=563238"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/563238\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/563239"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=563238"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=563238"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=563238"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}