{"id":621409,"date":"2024-05-22T13:30:01","date_gmt":"2024-05-22T10:30:01","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/a-method-to-mitigate-hallucinations-in-large-language-models\/"},"modified":"2024-05-22T13:30:01","modified_gmt":"2024-05-22T10:30:01","slug":"a-method-to-mitigate-hallucinations-in-large-language-models","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/a-method-to-mitigate-hallucinations-in-large-language-models\/","title":{"rendered":"#A method to mitigate hallucinations in large language models"},"content":{"rendered":"<div>\n<div class=\"article-gallery lightGallery\">\n<div data-thumb=\"https:\/\/scx1.b-cdn.net\/csz\/news\/tmb\/2024\/a-method-to-mitigate-h.jpg\" data-src=\"https:\/\/scx2.b-cdn.net\/gfx\/news\/hires\/2024\/a-method-to-mitigate-h.jpg\" data-sub-html=\"Abstention rates vs. average test losses on the Temporal Sequences dataset with \u03b1 = 0.05 (top) and \u03b1 = 0.05 (bottom) for score functions match count (m.c.), expected match count (e.m.c), and the log-probability (l.p.), and for various calibration methods (. denotes the baseline with no calibration). Box widths and heights represent 90% confidence intervals with Gaussian approximation over abstention rates and average test errors, respectively. The dashed horizontal line represents the target risk bound \u03b1. Credit: &lt;i&gt;arXiv&lt;\/i&gt; (2024). DOI: 10.48550\/arxiv.2405.01563\">\n<figure class=\"article-img\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/scx1.b-cdn.net\/csz\/news\/800a\/2024\/a-method-to-mitigate-h.jpg\" alt=\"A method to mitigate hallucinations in large language models\" title=\"Abstention rates vs. average test losses on the Temporal Sequences dataset with \u03b1 = 0.05 (top) and \u03b1 = 0.05 (bottom) for score functions match count (m.c.), expected match count (e.m.c), and the log-probability (l.p.), and for various calibration methods (. denotes the baseline with no calibration). Box widths and heights represent 90% confidence intervals with Gaussian approximation over abstention rates and average test errors, respectively. The dashed horizontal line represents the target risk bound \u03b1. Credit: arXiv (2024). DOI: 10.48550\/arxiv.2405.01563\" width=\"800\" height=\"519\"\/><figcaption class=\"text-darken text-low-up text-truncate-js text-truncate mt-3\">\n                Abstention rates vs. average test losses on the Temporal Sequences dataset with \u03b1 = 0.05 (top) and \u03b1 = 0.05 (bottom) for score functions match count (m.c.), expected match count (e.m.c), and the log-probability (l.p.), and for various calibration methods (. denotes the baseline with no calibration). Box widths and heights represent 90% confidence intervals with Gaussian <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>roximation over abstention rates and average test errors, respectively. The dashed horizontal line represents the target risk bound \u03b1. Credit: <i>arXiv<\/i> (2024). DOI: 10.48550\/arxiv.2405.01563<br \/>\n            <\/figcaption><\/figure>\n<\/div>\n<\/div>\n<p>Large language models (LLMs), artificial neural networks-based architectures that can process, generate and manipulate texts in various human languages, have recently become increasingly widespread. These models are now being used in a wide range of settings, to rapidly find answers to queries, produce content for specific purposes and interpret complex texts.<\/p>\n<p>While recently introduced LLMs can generate highly convincing texts, which are in some cases difficult to discern from writings produced by humans, they have been found to be prone to so-called hallucinations. In this context, hallucinations refer to an LLM generating entirely incoherent, inaccurate or inappropriate responses.<\/p>\n<p>Researchers at DeepMind recently developed a new procedure that could help to identify instances in which LLM should refrain from responding to a query, for instance replying &#8220;I don&#8217;t know,&#8221; as they are likely to hallucinate non-sensical or incorrect answers. The team&#8217;s proposed approach, outlined in a paper <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/arxiv.org\/abs\/2405.01563\">pre-published<\/a> on <i>arXiv<\/i>, entails the use of LLMs to evaluate their own potential responses.<\/p>\n<p>&#8220;Building on earlier approaches that use self-consistency as a more reliable measure of model confidence, we propose using the LLM itself to self-evaluate the similarity between each of its sampled responses for a given query,&#8221; Yasin Abbasi Yadkori, Ilja Kuzborskij and their colleagues wrote in their paper. &#8220;We then further leverage conformal prediction techniques to develop an abstention procedure that benefits from rigorous theoretical guarantees on the hallucination rate (error rate).&#8221;<\/p>\n<p>                                                                                                        <!-- TechX - News - In-article --><\/p>\n<p>                                                                                                                                            Yadkori, Kuzborskij and their colleagues evaluated their proposed method to mitigate LLM hallucinations in a <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/watch-movies-tv-seriess\/\" data-internallinksmanager029f6b8e52c=\"8\" title=\"Watch Movies &amp; TV Series\" target=\"_blank\" rel=\"noopener\">series<\/a> of experiments, using Temporal Sequences and TriviaQA, two publicly available datasets containing queries and associated responses. They specifically applied their proposed method to Gemini Pro, an LLM developed at Google and released in 2023.<\/p>\n<p>&#8220;Experimentally, our resulting conformal abstention method reliably bounds the hallucination rate on various closed-book, open-domain generative question answering datasets, while also maintaining a significantly less conservative abstention rate on a dataset with long responses (Temporal Sequences) compared to baselines using log-probability scores to quantify uncertainty, while achieving comparable performance on a dataset with short answers (TriviaQA),&#8221; the researchers wrote.<\/p>\n<p>&#8220;To evaluate the experiments automatically, one needs to determine if two responses are equivalent given a question. Following standard practice, we use a thresholded similarity function to determine if two responses match, but also provide a method for calibrating the threshold based on conformal prediction, with theoretical guarantees on the accuracy of the match prediction, which might be of independent interest.&#8221;<\/p>\n<p>The results of this research team&#8217;s experiments suggest that their conformal calibration and similarity scoring procedure does mitigate LLM hallucinations, allowing a model to abstain from answering a question if their answer is likely to be non-sensical or untrustworthy. The newly proposed approach was found to outperform simple baseline scoring procedures.<\/p>\n<p>This recent study by Deep Mind could soon inform the development of similar procedures to improve the reliability of LLMs and prevent them from hallucinating. Collectively, these efforts will contribute to the advancement of these models, facilitating their widespread use among professionals worldwide.<\/p>\n<div class=\"article-main__more p-4\">\n                                                                                                <strong>More information:<\/strong><br \/>\n                                                Yasin Abbasi Yadkori et al, Mitigating LLM Hallucinations via Conformal Abstention, <i>arXiv<\/i> (2024). <a rel=\"nofollow noopener\" target=\"_blank\" data-doi=\"1\" href=\"https:\/\/dx.doi.org\/10.48550\/arxiv.2405.01563\">DOI: 10.48550\/arxiv.2405.01563<\/a><\/p>\n<div class=\"mt-3\">\n                                                    <strong>Journal information:<\/strong><br \/>\n                                                                                                            <cite>arXiv<\/cite><br \/>\n                                                        <a rel=\"nofollow noopener\" target=\"_blank\" class=\"icon_open\" href=\"http:\/\/arxiv.org\/\"><br \/>\n                                                            <svg><use href=\"https:\/\/techx.b-cdn.net\/tmpl\/v2\/img\/svg\/sprite.svg#icon_open\" x=\"0\" y=\"0\"\/><\/svg><\/a>\n                                                                                                    <\/div>\n<\/p><\/div>\n<p class=\"article-main__note mt-4\">\n                                                \u00a9 2024 <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/sciencee\/\" data-internallinksmanager029f6b8e52c=\"5\" title=\"Science\" target=\"_blank\" rel=\"noopener\">Science<\/a> X Network<\/p>\n<p>                                        <!-- print only --><\/p>\n<div class=\"d-none d-print-block\">\n<p>                                                <strong>Citation<\/strong>:<br \/>\n                                                A method to mitigate hallucinations in large language models (2024, May 22)<br \/>\n                                                retrieved 22 May 2024<br \/>\n                                                from https:\/\/techxplore.com\/<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/news\/\" data-internallinksmanager029f6b8e52c=\"2\" title=\"News\" target=\"_blank\" rel=\"noopener\">news<\/a>\/2024-05-method-mitigate-hallucinations-large-language.html<\/p>\n<p>                                            This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no<br \/>\n                                            part may be reproduced without the written permission. The content is provided for information purposes only.<\/p><\/div>\n<\/p><\/div>\n<p><script id=\"facebook-jssdk\" async=\"\" src=\"https:\/\/connect.facebook.net\/en_US\/sdk.js\"><\/script><\/p>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more Like this articles, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/science\/\" target=\"_blank\" rel=\"noopener\">Science category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techxplore.com\/news\/2024-05-method-mitigate-hallucinations-large-language.html\" target=\"_blank\" rel=\"noopener\">Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Abstention rates vs. average test losses on the Temporal Sequences dataset with \u03b1 = 0.05 (top) and \u03b1 = 0.05 (bottom) for score functions match count (m.c.), expected match count (e.m.c), and the log-probability (l.p.), and for various calibration methods (. denotes the baseline with no calibration). Box widths and heights represent 90% confidence intervals&#8230;<\/p>\n","protected":false},"author":1,"featured_media":621410,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/scx2.b-cdn.net\/gfx\/news\/hires\/2024\/a-method-to-mitigate-h.jpg","fifu_image_alt":"","footnotes":""},"categories":[16],"tags":[],"class_list":["post-621409","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-sciencee"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/621409","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=621409"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/621409\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/621410"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=621409"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=621409"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=621409"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}