{"id":666769,"date":"2025-05-03T10:10:25","date_gmt":"2025-05-03T07:10:25","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/openai-pledges-to-make-changes-to-prevent-future-chatgpt-sycophancy\/"},"modified":"2025-05-03T10:10:25","modified_gmt":"2025-05-03T07:10:25","slug":"openai-pledges-to-make-changes-to-prevent-future-chatgpt-sycophancy","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/openai-pledges-to-make-changes-to-prevent-future-chatgpt-sycophancy\/","title":{"rendered":"OpenAI pledges to make changes to prevent future ChatGPT sycophancy"},"content":{"rendered":"<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">OpenAI <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/openai.com\/index\/expanding-on-sycophancy\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">says it\u2019ll make changes<\/a> to the way it updates the AI models that power ChatGPT, following an incident that caused the platform to become overly sycophantic for many users.<\/p>\n<p class=\"wp-block-paragraph\">Last weekend, after OpenAI rolled out a tweaked\u00a0GPT-4o \u2014 the default model powering ChatGPT \u2014 users on <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/social-mediaa\/\" data-internallinksmanager029f6b8e52c=\"1\" title=\"Social Media\" target=\"_blank\" rel=\"noopener\">social media<\/a> noted that ChatGPT began responding in an overly validating and agreeable way. It quickly became a meme. Users posted screenshots of ChatGPT <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>lauding all sorts of problematic,\u00a0<a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/x.com\/fabianstelzer\/status\/1916372374091423984\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">dangerous<\/a>\u00a0<a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/x.com\/thinkbuildnext\/status\/1916250081579217243\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">decisions<\/a>\u00a0and\u00a0<a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/x.com\/ai_for_success\/status\/1916556522571604264\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">ideas<\/a>.<\/p>\n<p class=\"wp-block-paragraph\">In a post on X last Sunday, CEO Sam Altman\u00a0<a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/x.com\/sama\/status\/1916625892123742290\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">acknowledged<\/a>\u00a0the problem and said that OpenAI would work on fixes \u201cASAP.\u201d On Tuesday, Altman\u00a0<a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/x.com\/sama\/status\/1917291637962858735\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">announced<\/a>\u00a0the GPT-4o update was being rolled back and that OpenAI was working on \u201cadditional fixes\u201d to the model\u2019s personality.<\/p>\n<p class=\"wp-block-paragraph\">The company published a postmortem on Tuesday, and in a blog post Friday, OpenAI expanded on specific adjustments it plans to make to its model deployment process. <\/p>\n<p class=\"wp-block-paragraph\">OpenAI says it plans to introduce an opt-in \u201calpha phase\u201d for some models that would allow certain ChatGPT users to test the models and give feedback prior to launch. The company also says it\u2019ll include explanations of \u201cknown limitations\u201d for future incremental updates to models in ChatGPT, and adjust its safety review process to formally consider \u201cmodel behavior issues\u201d like personality, deception, reliability, and hallucination (i.e., when a model makes things up) as \u201claunch-blocking\u201d concerns.<\/p>\n<p class=\"wp-block-paragraph\">\u201cGoing forward, we\u2019ll proactively communicate about the updates we\u2019re making to the models in ChatGPT, whether \u2018subtle\u2019 or not,\u201d wrote OpenAI in the blog post. \u201cEven if these issues aren\u2019t perfectly quantifiable today, we commit to blocking launches based on proxy measurements or qualitative signals, even when metrics like A\/B testing look good.\u201d<\/p>\n<figure class=\"wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter\">\n<div class=\"wp-block-embed__wrapper\">\n<blockquote class=\"twitter-tweet\" data-width=\"500\" data-dnt=\"true\">\n<p lang=\"en\" dir=\"ltr\">we missed the mark with last week&#8217;s GPT-4o update.<\/p>\n<p>what happened, what we learned, and some things we will do differently in the future: <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/t.co\/ER1GmRYrIC\">https:\/\/t.co\/ER1GmRYrIC<\/a><\/p>\n<p>\u2014 Sam Altman (@sama) <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/twitter.com\/sama\/status\/1918330652325458387?ref_src=twsrc%5Etfw\">May 2, 2025<\/a><\/p><\/blockquote>\n<\/div>\n<\/figure>\n<p class=\"wp-block-paragraph\">The pledged fixes come as more people turn to ChatGPT for advice. <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/aijourn.com\/34-of-americans-trust-chatgpt-over-human-experts-but-not-for-legal-or-medical-advice\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">According to one recent survey<\/a> by lawsuit financier Express Legal Funding, 60% of U.S. adults have used ChatGPT to seek counsel or information. The growing reliance on ChatGPT \u2014 and the platform\u2019s enormous user base \u2014 raises the stakes when issues like extreme sycophancy emerge, not to mention hallucinations and other technical shortcomings.<\/p>\n<div class=\"wp-block-techcrunch-inline-cta\">\n<div class=\"inline-cta__wrapper\">\n<p>Techcrunch event<\/p>\n<div class=\"inline-cta__content\">\n<p>\n\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__location\">Berkeley, CA<\/span><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__separator\">|<\/span><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__date\">June 5<\/span>\n\t\t\t\t\t\t\t<\/p>\n<p>\t\t\t\t\t<span>BOOK NOW<\/span><\/p><\/div>\n<\/p><\/div>\n<\/div>\n<p class=\"wp-block-paragraph\">As one mitigating step, earlier this week, OpenAI said it would experiment with ways to let users give \u201creal-time feedback\u201d to \u201cdirectly influence their interactions\u201d with ChatGPT. The company also said it would refine techniques to steer models away from sycophancy, potentially allow people to choose from multiple model personalities in ChatGPT, build additional safety guardrails, and expand evaluations to help identify issues beyond sycophancy.<\/p>\n<p class=\"wp-block-paragraph\">\u201cOne of the biggest lessons is fully recognizing how people have started to use ChatGPT for deeply personal advice \u2014 something we didn\u2019t see as much even a year ago,\u201d continued OpenAI in its blog post. \u201cAt the time, this wasn\u2019t a primary focus, but as AI and society have co-evolved, it\u2019s become clear that we need to treat this use case with great care. It\u2019s now going to be a more meaningful part of our safety work.\u201d<\/p>\n<\/div>\n<p><script async src=\"\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/category\/technology\/\" target=\"_blank\" >Technology<\/a><\/span> category.<\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techcrunch.com\/2025\/05\/02\/openai-pledges-to-make-changes-to-prevent-future-chatgpt-sycophancy\/\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>OpenAI says it\u2019ll make changes to the way it updates the AI models that power ChatGPT, following an incident that caused the platform to become overly sycophantic for many users. Last weekend, after OpenAI rolled out a tweaked\u00a0GPT-4o \u2014 the default model powering ChatGPT \u2014 users on social media noted that ChatGPT began responding in&#8230;<\/p>\n","protected":false},"author":1,"featured_media":666770,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/02\/GettyImages-2195918462.jpg?w=1024","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[77337,138467,141199],"class_list":["post-666769","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology","tag-ai","tag-chatgpt","tag-openai"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/666769","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=666769"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/666769\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/666770"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=666769"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=666769"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=666769"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}