ECONOMY & WORK
MONEY 101
NEWS
PERSONAL FINANCE
NET WORTH
About Us Contact Us Privacy Policy Terms of Use DMCA Opt-out of personalized ads
© Copyright 2023 Market Realist. Market Realist is a registered trademark. All Rights Reserved. People may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.
MARKETREALIST.COM / NEWS

OpenAI Faces Class Action Lawsuit for Alleged Unauthorized Personal Data Collection

There are allegations that OpenAI gathered an extensive volume of personal data, estimated to be around 300 billion words, from the internet.
PUBLISHED JUL 6, 2023
Cover Image Source | Pexels | Hatice Baran
Cover Image Source | Pexels | Hatice Baran

A law firm based in California has recently taken legal action against OpenAI. The Clarkson Law Firm filed a class-action lawsuit claiming that OpenAI has been collecting personal data without authorization to train its AI language models, specifically ChatGPT and DALL-E.

The lawsuit alleges that OpenAI obtained private information, including personally identifiable data, from a significant number of internet users without their consent or knowledge. This complaint has been filed in the Northern District of California court.



 

The allegations in the lawsuit suggest that OpenAI gathered an extensive volume of data, estimated to be around 300 billion words, from the internet, which included personal information and content from prominent social media platforms such as Twitter and Reddit. Clarkson Law Firm contends that OpenAI carried out this data collection covertly and without fulfilling the necessary requirements of registering as a data broker, as mandated by relevant laws.

The law firm highlights the absence of informed consent or awareness on the part of the individuals whose data is claimed to have been utilized for training ChatGPT and DALL-E.

Source: GettyImages | Leon Neal  Staff
Source: GettyImages | Leon Neal Staff

OpenAI has been at the center of controversy regarding its data collection practices. Previously, there was no explicit option for users to opt out of sharing their conversations and personal information with OpenAI for model training purposes. Furthermore, ChatGPT faced a ban in Italy under Europe's General Data Protection Regulation (GDPR) due to concerns over inadequate protection of user data, particularly that of minors. This class-action lawsuit not only highlights OpenAI's opaque privacy policies but also focuses on data that was scraped from the web without being explicitly intended for use in training AI models.

Source: GettyImages | Tomohiro Ohsumi  Stringer
Source: GettyImages | Tomohiro Ohsumi/Stringer

The lawsuit raises significant concerns about OpenAI's privacy policies, especially regarding existing users. It alleges that OpenAI has profited from the data it collected without compensating the source, leveraging billion-dollar investments from Microsoft and revenue generated through ChatGPT Plus subscriptions. The complaint specifically accuses OpenAI of violating privacy, negligence in safeguarding personal data, and larceny by illegally obtaining massive amounts of personal information to train its AI models.

Source: GettyImages | Leon Neal  Staff
Source: GettyImages | Leon Neal Staff

The complaint lists a total of 15 counts against OpenAI, encompassing various allegations related to privacy violations, negligence, and illegal acquisition of personal data. The counts highlight OpenAI's alleged misuse of publicly available datasets, such as Common Crawl, Wikipedia, and Reddit, without obtaining proper permission or consent from users. Although personal information shared on social media platforms and other public domains may be accessible, the lawsuit argues that using such data outside of the intended platform could constitute a violation of privacy.

Source: GettyImages | Leon Neal  Staff
Source: GettyImages | Leon Neal Staff

In Europe, legal distinctions exist between public domain and free-to-use data, thanks to the GDPR. However, in the United States, this issue remains a topic of debate. Nader Henein, a privacy research VP at Gartner, supports the sentiment of the lawsuit, emphasizing the importance of individuals having control over how their data is used, even when it is publicly available. Nevertheless, Henein expresses uncertainty regarding the stance the U.S. legal system will take on this matter.

Source: GettyImages | Leon Neal  Staff (2)
Source: GettyImages | Leon Neal Staff 

The outcome of the lawsuit against OpenAI remains uncertain. As the legal proceedings unfold, it will be interesting to see how the court addresses the allegations of stolen data and the violation of privacy. This case could have significant implications for the data collection practices of AI companies and the protection of the personal information of individuals in the digital age.

MORE ON MARKET REALIST
Sometimes the host of Family Feud just wants the chaos to end as it gets too much.
54 minutes ago
The show took a hilarious turn when a contestant gave a bold answer that caught the host completely off guard.
2 hours ago
Despite talking through her guesses, Carrie Trujillo couldn't crack the puzzle and failed to win $40,000.
3 hours ago
Robert Herjavec and Lori Greiner rubbed it in O'Leary's face by celebrating their deal with Phoozy
23 hours ago
Duc and Lisa Nguyen's stubbornness paid off, as the co-founders of Baubles + Soles got Daymond John as a partner.
1 day ago
The player got the host to be candid about his fears and his mother's opinion on him.
1 day ago
Justin Baer, founder of Collars & Co., was looking for mentorship from the Sharks in addition to a $300,000 investment.
1 day ago
She said that her husband may still have to buy a dog as America may hold him accountable.
1 day ago
When Harrison knew that the 18th-century map was the real deal, he made a genuine offer.
2 days ago
10 years after her sister’s win, Chelsea Hall hit the jackpot on ‘WoF’ with a brand new Mini Cooper and a cash prize.
2 days ago
The co-founders of BuggyBeds wowed the Sharks so much, they were "itching" to invest, and offered a $250k deal.
2 days ago
The guests were left stunned to find out just how much the repairs would cost.
3 days ago
Unfortunately for the seller, she allegedly got robbed of a significant amount of money.
3 days ago
Not only did the co-creators of FlingGolf get a $300,000 deal, they proved Mr Wonderful wrong.
3 days ago
The guest never imagined the old, autographed sneakers that his mom acquired could be worth so much.
4 days ago
The gameshow whiz did it again by bagging the top prize on yet another trivia test.
4 days ago
Riccardi took to Reddit to clear the air around his stunning loss which was facing scrutiny.
4 days ago
Fans gathered on the show's unofficial Reddit forum to discuss the 'dumb and useless' items.
4 days ago
The contestant, Matt Benton expressed he wanted to enjoy the moment before thinking of the future.
5 days ago
The guest who treasured the collection had no idea how significant it was.
5 days ago