{"id":657368,"date":"2025-03-16T13:40:28","date_gmt":"2025-03-16T10:40:28","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/bluesky-users-debate-plans-around-user-data-and-ai-training\/"},"modified":"2025-03-16T13:40:28","modified_gmt":"2025-03-16T10:40:28","slug":"bluesky-users-debate-plans-around-user-data-and-ai-training","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/bluesky-users-debate-plans-around-user-data-and-ai-training\/","title":{"rendered":"#Bluesky users debate plans around user data and AI training"},"content":{"rendered":"<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\"><a href=\"https:\/\/buradabiliyorum.com\/en\/category\/social-mediaa\/\" data-internallinksmanager029f6b8e52c=\"1\" title=\"Social Media\" target=\"_blank\" rel=\"noopener\">Social<\/a> network Bluesky recently <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/github.com\/bluesky-social\/proposals\/tree\/main\/0008-user-intents\">published a proposal on GitHub<\/a> outlining new options it could give users to indicate whether they want their posts and data to be scraped for things like generative AI training and public archiving.<\/p>\n<p class=\"wp-block-paragraph\">CEO Jay Graber discussed the proposal earlier this week, while on-stage at South by Southwest, but it attracted fresh attention on Friday night, after she <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/bsky.app\/profile\/jay.bsky.team\/post\/3lkens3n4w223\">posted about it on Bluesky<\/a>. Some users reacted with alarm to the company\u2019s plans, which they saw as a reversal of Bluesky\u2019s previous insistence that it won\u2019t sell user data to advertisers and won\u2019t train AI on user posts.<\/p>\n<p class=\"wp-block-paragraph\">\u201cOh, hell no!\u201d the user Sketchette <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/bsky.app\/profile\/sketchette.bsky.social\/post\/3lkeoa2k2sc2n\">wrote<\/a>. \u201cThe beauty of this platform was the NOT sharing of information. Especially gen AI. Don\u2019t you cave now.\u201d<\/p>\n<p class=\"wp-block-paragraph\">Graber <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/bsky.app\/profile\/jay.bsky.team\/post\/3lkeojfh3u223\">replied<\/a> that generative AI companies are \u201calready scraping public data from across the web,\u201d including from Bluesky, since \u201ceverything on Bluesky is public like a website is public.\u201d So she said Bluesky is trying to create a \u201cnew standard\u201d to govern that scraping, similar to the <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/en.wikipedia.org\/wiki\/Robots.txt\">robots.txt<\/a> file that websites use to communicate their permissions to web crawlers.<\/p>\n<p class=\"wp-block-paragraph\">Debates about AI training and copyright have dragged robots.txt into the spotlight, among other things highlighting the fact that it\u2019s not legally enforceable. Bluesky frames its proposed standard as one that would have a similar \u201cmechanism and expectations,\u201d providing \u201ca machine-readable format, which good actors are expected to abide, and does carry ethical weight, but is not legally enforceable.\u201d<\/p>\n<p class=\"wp-block-paragraph\">Under the proposal, users of the Bluesky <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>, or other apps that use the underlying <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/atproto.com\/\">ATProtocol<\/a>, could go into their settings and allow or disallow the usage of their Bluesky data across four categories: generative AI, protocol bridging (i.e., connecting different social ecosystems), bulk datasets, and web archiving (such as the Internet Archive\u2019s Wayback Machine).<\/p>\n<p class=\"wp-block-paragraph\">If a user indicates that they don\u2019t want their data used to train generative AI, the proposal says, \u201cCompanies and research teams building AI training sets are expected to respect this intent when they see it, either when scraping websites, or doing bulk transfers using the protocol itself.\u201d<\/p>\n<p class=\"wp-block-paragraph\">Molly White, who writes the Citation Needed <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/news\/\" data-internallinksmanager029f6b8e52c=\"2\" title=\"News\" target=\"_blank\" rel=\"noopener\">news<\/a>letter and Web3 is Going Just Great blog, <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/bsky.app\/profile\/molly.wiki\/post\/3lkg6zdex6s2t\">described this<\/a> as \u201ca good proposal,\u201d and said it was \u201cweird to see people flaming BlueSky for it,\u201d since it\u2019s not so much \u201cwelcoming in AI scraping\u201d but rather \u201ctrying to add a consent signal to allow users to communicate preferences for the scraping that is already happening.\u201d<\/p>\n<p class=\"wp-block-paragraph\">\u201cI think the weakness with this and [Creative Commons\u2019] similar proposal for \u2018preference signals\u2019 is that they rely on scrapers to respect these signals out of some desire to be good actors,\u201d White continued. \u201cWe\u2019ve already seen some of these companies blow right past robots.txt or pirate material to scrape.\u201d<\/p>\n<\/div>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/category\/technology\/\" target=\"_blank\" >Technology<\/a><\/span> category.<\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techcrunch.com\/2025\/03\/15\/bluesky-users-debate-plans-around-user-data-and-ai-training\/\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Social network Bluesky recently published a proposal on GitHub outlining new options it could give users to indicate whether they want their posts and data to be scraped for things like generative AI training and public archiving. CEO Jay Graber discussed the proposal earlier this week, while on-stage at South by Southwest, but it attracted&#8230;<\/p>\n","protected":false},"author":1,"featured_media":657369,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/01\/bluesky-GettyImages-2185142051.jpg?resize=1200,800","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[151568,154912,31059],"class_list":["post-657368","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology","tag-bluesky","tag-jay-graber","tag-social"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/657368","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=657368"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/657368\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/657369"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=657368"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=657368"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=657368"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}