hachyderm.io is one of the many independent Mastodon servers you can use to participate in the fediverse.
Hachyderm is a safe space, LGBTQIA+ and BLM, primarily comprised of tech industry professionals world wide. Note that many non-user account types have restrictions - please see our About page.

Administered by:

Server stats:

9.5K
active users

#aitraining

7 posts6 participants1 post today
ResearchBuzz: Firehose<p>The Conversation: Africa’s data workers are being exploited by foreign tech firms – 4 ways to protect them. “Since 2015, we have been studying the central role of African data workers in building and maintaining artificial intelligence (AI) systems, acting as ‘data janitors’. Our research found that companies rarely acknowledge the use of human workers in AI value chains, thus they […]</p><p><a href="https://rbfirehose.com/2025/04/01/the-conversation-africas-data-workers-are-being-exploited-by-foreign-tech-firms-4-ways-to-protect-them/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/04/01/the-conversation-africas-data-workers-are-being-exploited-by-foreign-tech-firms-4-ways-to-protect-them/</a></p>
ResearchBuzz: Firehose<p>Search Engine Land: Meet LLMs.txt, a proposed standard for AI website content crawling. “While many content creators are interested in the proposal’s potential merits, it also has detractors. But given the rapidly changing landscape for content produced in a world of artificial intelligence, llms.txt is certainly worth discussing.”</p><p><a href="https://rbfirehose.com/2025/03/29/search-engine-land-meet-llms-txt-a-proposed-standard-for-ai-website-content-crawling/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/29/search-engine-land-meet-llms-txt-a-proposed-standard-for-ai-website-content-crawling/</a></p>
Winbuzzer<p>Judge Clears the Way for New York Times Lawsuit Against OpenAI and Microsoft</p><p><a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/OpenAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenAI</span></a> <a href="https://mastodon.social/tags/Microsoft" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Microsoft</span></a> <a href="https://mastodon.social/tags/AItraining" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AItraining</span></a> <a href="https://mastodon.social/tags/CopyrightLaw" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CopyrightLaw</span></a> <a href="https://mastodon.social/tags/GenAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GenAI</span></a> <a href="https://mastodon.social/tags/ChatGPT" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ChatGPT</span></a> <a href="https://mastodon.social/tags/MediaRights" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MediaRights</span></a> <a href="https://mastodon.social/tags/AIlawsuit" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AIlawsuit</span></a> <a href="https://mastodon.social/tags/NYTvOpenAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>NYTvOpenAI</span></a> <a href="https://mastodon.social/tags/AIethics" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AIethics</span></a> <a href="https://mastodon.social/tags/FairUse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FairUse</span></a> <a href="https://mastodon.social/tags/AIlegal" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AIlegal</span></a> <a href="https://mastodon.social/tags/TechLaw" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TechLaw</span></a> </p><p><a href="https://winbuzzer.com/2025/03/28/judge-clears-the-way-for-new-york-times-lawsuit-against-openai-xcxwbn/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">winbuzzer.com/2025/03/28/judge</span><span class="invisible">-clears-the-way-for-new-york-times-lawsuit-against-openai-xcxwbn/</span></a></p>
ResearchBuzz: Firehose<p>MIT Technology Review: China built hundreds of AI data centers to catch the AI boom. Now many stand unused.. “Just months ago, a boom in data center construction was at its height, fueled by both government and private investors. However, many newly built facilities are now sitting empty. According to people on the ground who spoke to MIT Technology Review—including contractors, an […]</p><p><a href="https://rbfirehose.com/2025/03/28/mit-technology-review-china-built-hundreds-of-ai-data-centers-to-catch-the-ai-boom-now-many-stand-unused/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/28/mit-technology-review-china-built-hundreds-of-ai-data-centers-to-catch-the-ai-boom-now-many-stand-unused/</a></p>
Winbuzzer<p>Meta Reuploaded 30% of Pirated Books It Downloaded for AI Training, Participating in Digital Piracy</p><p><a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/Meta" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Meta</span></a> <a href="https://mastodon.social/tags/AITraining" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AITraining</span></a> <a href="https://mastodon.social/tags/MetaAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MetaAI</span></a> <a href="https://mastodon.social/tags/Piracy" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Piracy</span></a> <a href="https://mastodon.social/tags/Copyright" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Copyright</span></a> <a href="https://mastodon.social/tags/GenAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GenAI</span></a> <a href="https://mastodon.social/tags/LLama" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>LLama</span></a> <a href="https://mastodon.social/tags/FairUse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FairUse</span></a> <a href="https://mastodon.social/tags/Libgen" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Libgen</span></a></p><p><a href="https://winbuzzer.com/2025/03/26/meta-reuploaded-30-of-pirated-books-it-downloaded-for-ai-training-participating-in-digital-piracy-xcxwbn/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">winbuzzer.com/2025/03/26/meta-</span><span class="invisible">reuploaded-30-of-pirated-books-it-downloaded-for-ai-training-participating-in-digital-piracy-xcxwbn/</span></a></p>
PrivacyDigest<p>Emboldened by <a href="https://mas.to/tags/Trump" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Trump</span></a> , A.I. Companies Lobby for Fewer Rules </p><p>President Trump at the White House in January with, from left, Oracle’s chairman, Larry Ellison; SoftBank’s chief executive, Masayoshi Son; and OpenAI’s chief executive, Sam Altman.<br><a href="https://mas.to/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a> <a href="https://mas.to/tags/privacy" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>privacy</span></a> <a href="https://mas.to/tags/openai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>openai</span></a> <a href="https://mas.to/tags/softbank" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>softbank</span></a> <a href="https://mas.to/tags/oracle" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>oracle</span></a> <a href="https://mas.to/tags/aitraining" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>aitraining</span></a> <a href="https://mas.to/tags/training" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>training</span></a></p><p><a href="https://www.nytimes.com/2025/03/24/technology/trump-ai-regulation.html" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">nytimes.com/2025/03/24/technol</span><span class="invisible">ogy/trump-ai-regulation.html</span></a></p>
ResearchBuzz: Firehose<p>Ars Technica: Open Source devs say AI crawlers dominate traffic, forcing blocks on entire countries. “Software developer Xe Iaso reached a breaking point earlier this year when aggressive AI crawler traffic from Amazon overwhelmed their Git repository service, repeatedly causing instability and downtime. Despite configuring standard defensive measures—adjusting robots.txt, blocking known […]</p><p><a href="https://rbfirehose.com/2025/03/26/ars-technica-open-source-devs-say-ai-crawlers-dominate-traffic-forcing-blocks-on-entire-countries/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/26/ars-technica-open-source-devs-say-ai-crawlers-dominate-traffic-forcing-blocks-on-entire-countries/</a></p>
ResearchBuzz: Firehose<p>TorrentFreak: Meta’s BitTorrent Uploads of ‘Pirate Library’ Data Equaled 30% of Downloads, Expert Says. “A lawsuit filed by several authors against Meta centers on Meta’s alleged use of pirated books for AI training data and the technical details of BitTorrent which was used to obtain them. Yesterday, Meta filed a motion for summary judgment, while countering the authors’ request to […]</p><p><a href="https://rbfirehose.com/2025/03/26/torrentfreak-metas-bittorrent-uploads-of-pirate-library-data-equaled-30-of-downloads-expert-says/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/26/torrentfreak-metas-bittorrent-uploads-of-pirate-library-data-equaled-30-of-downloads-expert-says/</a></p>
ResearchBuzz: Firehose<p>The Society of Authors: The LibGen data set – what authors can do. “The Atlantic published a searchable database of over 7.5 million books and 81 million research papers. This data set, called Library Genesis or ‘LibGen’ for short, is full of pirated material, and all of it has been used to develop AI systems by tech giant Meta.” FIVE of my books are in this data set. Do you think I […]</p><p><a href="https://rbfirehose.com/2025/03/23/the-society-of-authors-the-libgen-data-set-what-authors-can-do/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/23/the-society-of-authors-the-libgen-data-set-what-authors-can-do/</a></p>
Andrew Wedlake<p>I was thinking about "training Ai", and thought, what prevents an Ai feedback loop from occurring? </p><p>Where Ai generates an incorrect answer, and the incorrect answer is then used to train more Ai. Same goes for Ai art, etc. </p><p><a href="https://mapstodon.space/tags/Ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Ai</span></a> <a href="https://mapstodon.space/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ArtificialIntelligence</span></a> <a href="https://mapstodon.space/tags/AiTraining" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AiTraining</span></a></p>
ResearchBuzz: Firehose<p>MIT Press: A note on LibGen and the unauthorized use of our authors’ work. “We want to be clear: The MIT Press has not licensed any of our books or journal articles for LLM training purposes, nor have we granted permission for any such use. However, we are well aware that many MIT Press publications have ended up in pirated training data sets. We share the deep distress of our authors whose […]</p><p><a href="https://rbfirehose.com/2025/03/22/mit-press-a-note-on-libgen-and-the-unauthorized-use-of-our-authors-work/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/22/mit-press-a-note-on-libgen-and-the-unauthorized-use-of-our-authors-work/</a></p>
ResearchBuzz: Firehose<p>Ars Technica: Cloudflare turns AI against itself with endless maze of irrelevant facts. “On Wednesday, web infrastructure provider Cloudflare announced a new feature called ‘AI Labyrinth’ that aims to combat unauthorized AI data scraping by serving fake AI-generated content to bots. The tool will attempt to thwart AI companies that crawl websites without permission to collect training data […]</p><p><a href="https://rbfirehose.com/2025/03/22/ars-technica-cloudflare-turns-ai-against-itself-with-endless-maze-of-irrelevant-facts/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/22/ars-technica-cloudflare-turns-ai-against-itself-with-endless-maze-of-irrelevant-facts/</a></p>
Winbuzzer<p>Cloudflare has unveiled AI Labyrinth, a system that misleads unauthorized AI crawling bots by trapping them in auto-generated content mazes.</p><p><a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/Cloudflare" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Cloudflare</span></a> <a href="https://mastodon.social/tags/AILabyrinth" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AILabyrinth</span></a> <a href="https://mastodon.social/tags/AITraining" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AITraining</span></a> <a href="https://mastodon.social/tags/WebScraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraping</span></a> <a href="https://mastodon.social/tags/BotDetection" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BotDetection</span></a> <a href="https://mastodon.social/tags/GenAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GenAI</span></a> <a href="https://mastodon.social/tags/WebSecurity" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebSecurity</span></a> <a href="https://mastodon.social/tags/Copyright" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Copyright</span></a> <a href="https://mastodon.social/tags/Publishers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Publishers</span></a> <a href="https://mastodon.social/tags/Web" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Web</span></a></p><p><a href="https://winbuzzer.com/2025/03/21/cloudflare-deploys-ai-labyrinth-to-exhaust-unauthorized-ai-crawling-bots-xcxwbn/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">winbuzzer.com/2025/03/21/cloud</span><span class="invisible">flare-deploys-ai-labyrinth-to-exhaust-unauthorized-ai-crawling-bots-xcxwbn/</span></a></p>
Deutscher Journalisten-Verband (DJV)<p>Meta hat wohl zum Training seiner KI Daten von der für illegal kopierte wissenschaftliche Texte bekannten Plattform <a class="hashtag" href="https://bsky.app/search?q=%23LibGen" rel="nofollow noopener noreferrer" target="_blank">#LibGen</a> verwendet. Auch Journalist:innen könnten davon betroffen sein. Mehr Infos in der PM: <a href="https://www.djv.de/news/pressemitteilungen/press-detail/kein-ki-training-mit-raubkopierten-texten/" rel="nofollow noopener noreferrer" target="_blank">www.djv.de/news/pressem...</a> <a class="hashtag" href="https://bsky.app/search?q=%23meta" rel="nofollow noopener noreferrer" target="_blank">#meta</a> <a class="hashtag" href="https://bsky.app/search?q=%23ai" rel="nofollow noopener noreferrer" target="_blank">#ai</a> <a class="hashtag" href="https://bsky.app/search?q=%23aitraining" rel="nofollow noopener noreferrer" target="_blank">#aitraining</a> <a class="hashtag" href="https://bsky.app/search?q=%23raubkopie" rel="nofollow noopener noreferrer" target="_blank">#raubkopie</a></p>
J. Steven York RESISTS<p><span class="h-card" translate="no"><a href="https://dice.camp/@StefanEJones" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>StefanEJones</span></a></span> <br>Fuck. EIGHTEEN works. A some are short stories or novellas (and I DO retain copyright on those). Okay, TEN novels. Stuff I did for Marvel, Conan, star Trek and Mechwarrior. I am gobsmacked.<br><a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/copyright" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>copyright</span></a> <a href="https://mastodon.social/tags/theft" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>theft</span></a> <a href="https://mastodon.social/tags/AITraining" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AITraining</span></a> <a href="https://mastodon.social/tags/meta" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>meta</span></a></p>
ResearchBuzz: Firehose<p>Fast Company: Hollywood warns about AI industry’s push to change copyright law. “A who’s who of musicians, actors, directors, and more have teamed up to sound the alarm as AI leaders including OpenAI and Google argue that they shouldn’t have to pay copyright holders for AI training material. In an open letter, submitted to the White House Office of Science and Technology, more than 400 […]</p><p><a href="https://rbfirehose.com/2025/03/20/fast-company-hollywood-warns-about-ai-industrys-push-to-change-copyright-law/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/20/fast-company-hollywood-warns-about-ai-industrys-push-to-change-copyright-law/</a></p>
Miguel Afonso Caetano<p>"The AI landscape is in danger of being dominated by large companies with deep pockets. These big names are in the news almost daily. But they’re far from the only ones – there are dozens of AI companies with fewer than 10 employees trying to build something new in a particular niche. </p><p>This bill demands that creators of any AI model–even a two-person company or a hobbyist tinkering with a small software build– identify copyrighted materials used in training. That requirement will be incredibly onerous, even if limited just to works registered with the U.S. Copyright Office. The registration system is a cumbersome beast at best–neither machine-readable nor accessible, it’s more like a card catalog than a database–that doesn’t offer information sufficient to identify all authors of a work, much less help developers to reliably match works in a training set to works in the system.</p><p>Even for major tech companies, meeting these new obligations would be a daunting task. For a small startup, throwing on such an impossible requirement could be a death sentence. If A.B. 412 becomes law, these smaller players will be forced to devote scarce resources to an unworkable compliance regime instead of focusing on development and innovation. The risk of lawsuits—potentially from copyright trolls—would discourage new startups from even attempting to enter the field."</p><p><a href="https://www.eff.org/deeplinks/2025/03/californias-ab-412-bill-could-crush-startups-and-cement-big-tech-ai-monopoly" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">eff.org/deeplinks/2025/03/cali</span><span class="invisible">fornias-ab-412-bill-could-crush-startups-and-cement-big-tech-ai-monopoly</span></a></p><p><a href="https://tldr.nettime.org/tags/USA" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>USA</span></a> <a href="https://tldr.nettime.org/tags/California" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>California</span></a> <a href="https://tldr.nettime.org/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://tldr.nettime.org/tags/GenerativeAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GenerativeAI</span></a> <a href="https://tldr.nettime.org/tags/AITraining" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AITraining</span></a> <a href="https://tldr.nettime.org/tags/Copyright" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Copyright</span></a> <a href="https://tldr.nettime.org/tags/FairUse" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FairUse</span></a></p>
ResearchBuzz: Firehose<p>Azernews: Azerbaijan advancing AI projects, including Azerbaijani language database. “Fariz Jafarov, Executive Director of the Center for Analysis and Coordination of the Fourth Industrial Revolution, highlighted a major initiative to create a large Azerbaijani language database for artificial intelligence development, Azernews reports.”</p><p><a href="https://rbfirehose.com/2025/03/17/azernews-azerbaijan-advancing-ai-projects-including-azerbaijani-language-database/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/17/azernews-azerbaijan-advancing-ai-projects-including-azerbaijani-language-database/</a></p>
ResearchBuzz: Firehose<p>TechCrunch: Bluesky users debate plans around user data and AI training. “Social network Bluesky recently published a proposal on GitHub outlining new options it could give users to indicate whether they want their posts and data to be scraped for things like generative AI training and public archiving.”</p><p><a href="https://rbfirehose.com/2025/03/17/techcrunch-bluesky-users-debate-plans-around-user-data-and-ai-training/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/17/techcrunch-bluesky-users-debate-plans-around-user-data-and-ai-training/</a></p>
ResearchBuzz: Firehose<p>El: Kazakhstan to create national digital archive. “In his address to the participants of the IV meeting of National Kurultai, President Kassym-Jomart Tokayev instructed to create a national digital archive, which will be available to domestic and foreign developers of neural networks, El.kz reports via Akorda.”</p><p><a href="https://rbfirehose.com/2025/03/16/el-kazakhstan-to-create-national-digital-archive/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/03/16/el-kazakhstan-to-create-national-digital-archive/</a></p>