Mobile News

Tumblr, WordPress Plan to Sell User Data to OpenAI and Midjourney to Train AI Models: Report

February 28, 2024

Tumblr and WordPress users might soon find that their data is being used to train artificial intelligence (AI) models, as per a report. The parent company of the blog sites, Automattic, has allegedly struck deals with OpenAI and Midjourney to sell user-generated content that will reportedly be used help train AI. While the details of the deals and the data-sharing practices remain unclear at the moment, this has raised a question on data privacy and the ethics of companies sharing their users’ data with third parties.

Internal communications by employees of Automattic, viewed by 404 Media, both confirmed the deal with AI companies and revealed details on these practices. In its report, the publication confirmed that Automattic’s deal with OpenAI and Midjourney could be announced soon. Further, it appears data compilation for the AI firms has already begun. Meanwhile, an internal post made by a product manager Cyle Gage suggested that all Tumblr’s public post content between 2014 and 2023 was compiled.

The report also highlights a specific message that suggests private and deleted user content was also automatically compiled, alongside public data. It was not clear whether that set of data was already shared with the AI firms or not. Further, since such an accident puts its entire user base’s private information in jeopardy, it also raises a question about the company’s ethical policy and data safety infrastructure.

Automattic on Tuesday issued a statement stating, “AI is rapidly transforming nearly every aspect of our world, including the way we create and consume content. At Automattic, we’ve always believed in a free and open web and individual choice. Like other tech companies, we’re closely following these advancements, including how to work with AI companies in a way that respects our users’ preferences.”

The post detailed several things the company is doing for its users including blocking AI platform crawlers, a setting to discourage search engines from indexing a site on WordPress and Tumblr, and an assurance of an opt-out setting for users who do not wish to share data with the third party. “Currently, no law exists that requires crawlers to follow these preferences,” the post stated.

The mechanism to opt-out of data sharing is also somewhat unclear. While the company stated in the post that the AI firms will respect the opt-out settings and even remove the past content from users who have newly opted out, the report claims the reality is more complicated.

The report found an internal document from February 23 where an employee asked whether the company had any assurance that the data partner would respect the opt-out decision made by users. Andrew Spittle, Automattic’s Head of AI, reportedly replied, “We will ask that content be deleted and removed from any future training runs. I believe partners will honor this based on our conversations with them to this point. I don’t think they gain much overall by retaining it.”

The response was noted to be vague and does not confirm if Automattic had an agreement on the same, according to the report. Further, it appears that the entire line of reasoning holds on the assumption that AI firms will not gain much by retaining the user data. It should be noted that the practice of third-party data sharing is not new, and most social media platforms hold the rights to user-generated public content on the platform. However, making such deals without revealing it to users could potentially expose private information to companies that are using the same data to train AI systems.

Affiliate links may be automatically generated – see our ethics statement for details.

For details of the latest launches and news from Samsung, Xiaomi, Realme, OnePlus, Oppo and other companies at the Mobile World Congress in Barcelona, visit our MWC 2024 hub.

Source Link

Tumblr, WordPress Plan to Sell User Data to OpenAI and Midjourney to Train AI Models: Report

LEAVE A REPLY Cancel reply

TECH NEWS

Everything Old is New Again: AI-Driven Development and Open Source

Gen AI in Healthcare: The State of Affairs in India

Gartner Predicts Legal, Risk and Compliance Functions to Double Technology Spend...

Microsoft to End Support for Windows Mail, Calendar and People Apps...

IDC Predicts: Asia/Pacific Business Leaders to Demand 80% Success Rate on...

The Cooling Conundrum: AI and Automation Push Data Centers Toward 3X...

TOP STORIES

Seventy Percent of Economies Are Underprepared for AI Disruption

New study shows almost half of tech professionals in India believe...

Organizations Combining Organizational Learning and AI-Specific Learning Are up to 80%...

Nvidia’s AI-driven triumph over Intel powered by strategic innovations

Most banks and insurers adopt cloud solutions with the primary objective...

India’s Web3 Ecosystem Has Over 400 Firms, Karnataka Emerges as Industry...

Cyber Security

AI and Gen AI are set to transform cybersecurity for most...

ThreatQuotient Publishes 2024 Evolution of Cybersecurity Automation Adoption Research Report

Kaspersky predicts quantum-proof ransomware and advancements in mobile financial cyberthreats in...

Rising concerns, lingering gaps: most organizations fear AI-driven cyberattacks but lack...

Tenable Forecasts Data Security in the Cloud to Take Centre Stage...

Blockchain-Enhanced Cybersecurity-Safeguarding Digital Identities and Data