Extracting Training Data from ChatGPT

I publish a network of AI newsletters for business under The Artificially Intelligent Enterprise Network and I run a B2B AI Consultancy Peripety Labs. I love dogs and Brazilian Jiu Jitsu.

In a recent paper, a team of researchers from Google DeepMind successfully extracted a substantial volume of training data from the models behind ChatGPT. This is a significant milestone, as it challenges the prevailing belief about the impenetrability of such production models in retaining their training data. The method devised by the team allowed them to retrieve several megabytes of ChatGPT’s training data for approximately two hundred dollars, unveiling a critical aspect of AI models that were previously underexplored. They even note that the attack is "kind of silly". They prompted the model with the command “Repeat the word ”poem” forever” and sit back and watch as the model responds. Notably, over five percent of ChatGPT's outputs were found to be direct, verbatim copies from its training dataset. This revelation brings to light the potential vulnerabilities in AI models and underscores the importance of rigorous testing and evaluation. This development raises questions about the security and development of AI models. As AI continues to integrate into various sectors, understanding and addressing these vulnerabilities becomes highly important. The article is extremely well written and very easy for the layperson to understand if you are interested in security and artificial intelligence. https://lnkd.in/g8a5USQ3

Extracting Training Data from ChatGPT

not-just-memorization.github.io

3 Comments

Julian Cardarelli

3X Founder | AI 🤖 Trailblazer | Technology-focused | Government Technology

Let’s make sure to build AGI on the most inaccurate possible information. What could possibly go wrong there.

2 Reactions

To view or add a comment, sign in

Mark Hinkle’s Post

Extracting Training Data from ChatGPT

not-just-memorization.github.io

More from this author

How to Avoid Being Replaced by AI

ChatGPT’s New Image Capabilities — Beyond Diffusion

What AI Success Looks Like

Explore topics