Reddit to add new tools to try and repel AI bots from scraping user data

Andrew Griffin

26 June 2024 at 12:08 pm·2-min read

Reddit users and moderators will get access to new AI-powered features as part of the deal (Nick Ansell/PA) (PA Archive)

Reddit says it will add new protections to try and repel bots that attempt to scrape its posts to train AI systems.

Many companies have proposed their large language models such as OpenAI’s ChatGPT and Google’s Gemini as the future. But training such a system requires feeding it vast amounts of written text – which companies have often taken from publicly available websites.

In recent months, sites including Reddit and Twitter have complained that visits from those crawlers have both slowed down their site as well as allowed companies to steal data in contravention of their policies.

Last month, Reddit published a new “Public Content Policy” that aimed to control how its data is used, both by researchers as well as companies looking to train automated systems. Now it has announced that it will add new technologies to try and enforce that.

It will update its “Robots Exclusion Protocol”, or robots.txt, which is a file that is visible only to websites crawling its site and gives instructions about what third parties are allowed to take.

It will also use technologies that will aim to spot unknown bots and crawlers and either stop them from repeatedly refreshing the site – or block them entirely.

“This update shouldn’t impact the vast majority of folks who use and enjoy Reddit,” Reddit said.

The company also stated that the change would not affect “good faith actors”, including those who might scrape the site for research and other purposes. It pointed to the Internet Archive, for instance, and shared a quote from the director of its Wayback Machine which scrapes the internet to allow users to see a version of a page at a given time.

“The Internet Archive is grateful that Reddit appreciates the importance of helping to ensure the digital records of our times are archived and preserved for future generations to enjoy and learn from,” said Mark Graham. “Working in collaboration with Reddit we will continue to record and make available archives of Reddit, along with the hundreds of millions of URLs from other sites we archive every day.”

Reddit also allows companies that it has deals with to scrape its posts to train AI systems. Both OpenAI and Google have agreements in place that sees them pay Reddit for access to users’ data.

Those deals led the share price of the company to share after they were announced. Users are not compensated for their posts, but the site will get access to new AI features that may be available to users as a result.

The use of Reddit to train AI models has however sometimes led to problems for those technology companies. Last month, when Google’s “AI Overview” feature began recommending including glue to make pizza, the advice was tracked down to a sarcastic Reddit post.

Yahoo Finance AU
Commonwealth Bank’s major move for millions of customers: ‘Australia first’
CBA has rolled out major changes to increase the services it offers to customers.
South China Morning Post
Tesla shows its humanoid robot Optimus at China AI conference, but behind glass
Tesla showcased its humanoid robot, the second-generation Optimus, at the World Artificial Intelligence Conference in Shanghai on Thursday, making it one of the few American AI products seen at China's top AI show. The Optimus humanoid robot, equipped with Tesla's self-developed neural network and computer vision technology, is largely seen as the future of robots as it is able to handle multiple tasks. Tesla CEO Elon Musk said at the company's annual shareholder's meeting that the humanoid robo
Evening Standard
Four Google Pixel 9 phones are expected this August with new AI features
Google is set to announce the Pixel 9, the Pixel 9 Pro, the Pixel 9 Pro XL, and Pixel 9 Pro Fold this year
Bloomberg
Indonesia’s Biggest Cyberattack Prompts Resignation, Audit
(Bloomberg) -- An official of Indonesia’s information technology ministry resigned as the government continues an audit of its data centers in the wake of the nation’s worst cyberattack.Most Read from BloombergBiden’s Fourth of July Shrouded by Pressure to Drop 2024 BidKamala Harris Is Having a Surprise Resurgence as Biden’s Campaign UnravelsHouse Democrats Consider Demanding Biden Withdraw From RaceNewsom Shocks California Politics by Scrapping Crime MeasureChina Can End Russia’s War in Ukraine
Hello!
Rod Stewart, 79, rocked by shock divorce news: report
In a surprising turn of events Rod Stewart has received some shocking divorce news. See details.
HuffPost
Trump Throws 4th Of July Fit In 'Disgraceful' New Holiday Tantrum
Critics called out the former president for a bonkers Independence Day message that barely mentioned the holiday.
The Daily Beast
Conservatives Routed in Worst Election Result for 200 Years
LONDON—The Conservatives, the world’s winningest political party, were booted out of power in dramatic style on Thursday after 14 years of chaotic and divisive rule.The Labour Party had secured a landslide victory, ending an era of Conservative rule over Britain that stretches back to 2010; the year that the iPad and Instagram were launched and Lady Gaga wore that meat dress to the MTV music awards.In that time, the Conservatives have cycled through five leaders, each of them dragging the party
Yahoo Sport Australia
Thanasi Kokkinakis in awful scenes as Alexei Popyrin produces stunning upset at Wimbledon
Thanasi Kokkinakis was left absolutely shocked after the incident. Find out more here.
Yahoo News Australia
Heartbreaking twist after severely emaciated dog 'scraped off road' by Aussie campers
Mango the dog could 'barely lift his head' or 'move his legs' when he was discovered by a quick-thinking family on the side of the road.
Yahoo Lifestyle
Aldi shoppers go wild over $4.49 product: 'It's incredible'
Shoppers are buzzing with excitement as Aldi brings back a beloved seasonal favourite. Read more.
NextShark
74-year-old woman dies after being shoved in front of San Francisco train
Corazon Dandan died after being pushed into an oncoming BART train at San Francisco’s Powell Street Station at around 11 p.m on Monday night. The suspect, 49-year-old Trevor Belmont, also known as Hoak Taing, was arrested at the scene and booked into the San Francisco County Jail on suspicion of homicide and elder abuse. Dandan, who was Filipino American, was a dedicated telephone operator at the Westin St. Francis and other hotels.
Yahoo Sport Australia
Ivan Cleary's emphatic call on leaving Panthers amid $13 million development with Nathan
The Panthers coach was adamant when asked about his future. Find out more here.
The Independent
Royal news live: Prince Harry award backlash continues as the Middletons appear at Wimbledon
Harry has been defended for his ‘incredible’ work with the Invictus Games
Yahoo Sport Australia
Ash Barty lifts lid on rejecting Andy Murray amid Emma Raducanu development at Wimbledon
Andy Murray will team up with Emma Raducanu in the mixed doubles. Read more here.
BBC
Ukraine calls them meat assaults: Russia's brutal plan to take ground
Russia is throwing wave after wave of men forward in Ukraine to make ground. The tactic is working.
Cover Media
Stormy Daniels believes it's 'not fair' she owes Donald Trump $600,000
The adult film star, who accused the former President of giving her hush money to keep quiet about their alleged sexual encounter now owes Trump $600,000 (£470,000) in legal costs after a defamation case she brought against him was dismissed. Last month in New York, Trump was criminally convicted for defaming another sexual assault accuser, E.Jean Carroll. The 45-year-old said on the Daily Mail's podcast, Everything I Know About Me, "How is it fair that I have better, more compelling evidence than E. Jean Carroll? And I'm glad she won. They continue to hand her money like it's f**king candy.”
Yahoo Sport Australia
Latrell Mitchell continues barnstorming form in NRL as Mitchell Moses crashes back to earth
Latrell Mitchell is in a purple patch of form for the Rabbitohs. Read more here.
Yahoo News Australia
'Crazy' creature found hiding on Aussie beaches stuns: 'Watch your toes'
Aussies and foreigners alike have been stunned to learn what wriggles beneath our toes at the beach.
Yahoo Sport Australia
Ugly claims around Wayne Bennett worsen as South Sydney continue stunning NRL surge
The Dolphins coach's actions have been called out. Read more here.
Yahoo News Australia
Deadly find hidden in suburban backyard soil triggers $200,000 fine
From the outside the home looked ordinary, but in the backyard investigators made a worrying discovery. Find out what it was.

Latest stories