Google has been ‘secretly stealing everything ever created on the internet’ to train its AI chatbot Bard
Lawsuit alleges tech giant took data ‘without regard for the privacy, property, and consumer protection interests of hundreds of millions of Americans’
Your support helps us to tell the story
From reproductive rights to climate change to Big Tech, The Independent is on the ground when the story is developing. Whether it's investigating the financials of Elon Musk's pro-Trump PAC or producing our latest documentary, 'The A Word', which shines a light on the American women fighting for reproductive rights, we know how important it is to parse out the facts from the messaging.
At such a critical moment in US history, we need reporters on the ground. Your donation allows us to keep sending journalists to speak to both sides of the story.
The Independent is trusted by Americans across the entire political spectrum. And unlike many other quality news outlets, we choose not to lock Americans out of our reporting and analysis with paywalls. We believe quality journalism should be available to everyone, paid for by those who can afford it.
Your support makes all the difference.Google has been accused of “secretly stealing everything ever created and shared on the internet” in order to train its AI chatbot Bard.
The class-action lawsuit filed in California alleges that Google and its AI division DeepMind used data from millions of Americans without their knowledge or consent to build its generative AI products.
“Personal data of every kind, especially conversational data between humans, is critical to the AI training process,” the lawsuit notes.
“This is how products like Bard develop human-like communication capabilities. Creative and expressive works are just as valuable because that is how AI products learn to ‘create’ art.”
Google updated its online privacy policy earlier this month, stating that it can use publicly available data to train its artificial intelligence tools.
According to the latest lawsuit, this change was designed to “double-down on its position that everything on the internet is fair game for the company to take for private gain and commercial use, including to build and enhance AI products like Bard”.
Beyond freely available data, the lawsuit claims that Google illegally accessed “at least 200 million materials explicitly protected by copyright”, including the text from books and articles behind paywalls.
Among those copyrighted materials is allegedly a book written by one of the plaintiffs named in the legal action. Many of the other plaintiffs named are listed solely as users of Google products like Search and Gmail, as well as other online platforms like TikTok.
The lawsuit alleges that Google scraped “the entire internet to take anything it could, whether contributed on Google platforms or not, and without regard for the privacy, property, and consumer protection interests of hundreds of millions of Americans who shared their insights, talents, artwork, data, personally identifiable information, and more, for specific purposes, not one of which was to train large language models to profit Google while putting the world at peril with untested and volatile AI products”.
OpenAI’s ChatGPT, which features similar capabilities to Google’s Bard, also has a proposed class action lawsuit filed against it, which accuses the chatbot of drawing on “massive amounts of personal data from the internet”.
Google did not immediately respond to a request for comment from The Independent, but a spokesperson told Reuters that the allegations were “baseless”.
Join our commenting forum
Join thought-provoking conversations, follow other Independent readers and see their replies
Comments