Privacy Engineering & AI Threat Modeling: Model and Data Theft

Eric Lybeck

Published Mar 18, 2024

Imagine a relentless adversary targeting your company’s crown jewels: all the sensitive information and personal data guarded by your AI system. One technique are attacks directly targeting the AI model itself.

These attacks aim to achieve two goals: First, to replicate the functionality of the AI system. This grants the adversary access to the same capabilities that your company leverages from the AI model, potentially stealing your competitive edge. Second, to uncover the training data. This exposes the sensitive information used to train the model, even if anonymized, potentially leading to privacy violations and data breaches.

There are several related attacks:

Membership Inference: Attempts to determine if a specific data element was used to train the model. Attackers analyze the model’s response to understand whether or not the specific individual data element is present in the model, even if not directly disclosed.
Attribute Inference: Aims to infer specific attributes of the training data. By observing the model’s behavior, attackers might learn characteristics like age, location or other demographics.
Inversion Attack: Attackers attempt to manipulate their inputs into generative models to reveal information in its training data. An example of this would be an attack to generate an image of a specific individual. Exploited with specific AI types, particularly generative models (e.g., face generation). Attackers manipulate inputs to force the model to reveal information present in its training data. For example, generating an image resembling a training data point.
Model Stealing: These reverse engineering attacks involves querying the model extensively with carefully crafted inputs to understand its internal logic. By analyzing the responses, attackers can potentially build a replica model, effectively stealing its functionality.

While security and engineering resources are more likely to take the lead protecting against these attacks, here are some questions a Privacy Engineer should be asking during the AI product lifecycle:

Recommended by LinkedIn

AI Security

Prof. Ahmed Banafa 10 months ago

CyberSentinel: Securing Your AI - Protecting Against…

Dr. Nilesh Roy 🇮🇳 - PhD, CCISO, CEH, CISSP, JNCIE-SEC, CISA, CISM 8 months ago

The Invisible Battlefield: How To Secure AI Systems…

Modernize 4 days ago

Plan/Design

What is the minimum amount of data, and especially personal data, required to achieve the AI system’s purpose?
Is the personal data anonymized, and is the anonymization providing sufficient protection against model and data theft attacks?
Can synthetic or anonymized data by used instead of real personal data?
How will the model itself be protected from suspicious prompts or queries, attempts to access unauthorized data, or reverse engineering?
Are noise added during training (such as through differential privacy) making it harder to determine or infer individual data points?

Implement/Test

Are mechanisms put in place and working to monitor for suspicious activity, unauthorized data access or other attempts to reverse engineer the model or data?

Maintain

Who has ongoing access to the AI model and the training data?
Is the security principle of least privilege applied to minimize the attack surface?
Are results from ongoing operations monitoring integrated into the process of updating and improving the AI model

April A.

Results-Driven Communications Strategist | Campaign Specialist | Digital Media Expert

This would be super cool at the consumer level especially for women.

1 Reaction

Sravan K. Elineni

Thought Leader in AI & Data | Healthcare AI, Robotics, Multimodal Systems | Scaling ML Teams for High-Impact ROI | Python, PyTorch, NLP

In case of using model parallel or data parallel architectures( most cases), i would also add feature space hijacking attack test. Much harder to deter. Differential privacy, random response activation functions, regularization techniques can help prevent but there is always a risk. Strong encryption can help level 4,5 in oci model but application layer is still a target if the adversary is already inside.

1 Reaction

Garrison A.

Privacy Compliance | Data Governance | Government Contracting

Eric Lybeck Your post raises crucial questions about AI system security, emphasizing the importance of privacy engineers understanding and mitigating threats such as model and data theft. This stresses how implementing monitoring mechanisms for AI threat modeling can further protect sensitive data. Thank you for sharing your insights.

3 Reactions

See more comments

To view or add a comment, sign in

Privacy Engineering & AI Threat Modeling: Model and Data Theft

Eric Lybeck

Recommended by LinkedIn

More articles by Eric Lybeck

Insights from the community

Others also viewed

Issues in the Generative AI Era around Cybersecurity

AI Security: Protecting the Future of Intelligence

Securing the Future: Tackling Information Security Challenges in the Age of AI

Don’t Let Your AI Go Rogue: Application Security Considerations for AI Agents and Agentic Platforms

Artificial Intelligence & Cybersecurity: A Double-Edged Sword in the Digital Age

The AI Arms Race : How Fraudsters Are Using AI, and How to Fight Back

Navigating the Cybersecurity Landscape with Agentic AI

Cybersecurity and the Dangers of ChatGPT

The Hidden Perils: Security Threats in AI-Based Products!

Securing Generative AI in 2025

Explore topics

Recommended by LinkedIn

More articles by Eric Lybeck

AI Systems: Responsible AI, Ethics & Human Rights

Navigating the AI Systems Lifecycle

Teaching a Structured Approach to AI System Enablement

GAO AI Framework: Creating an AI governance program

Unlocking the Potential of AI with Robust Privacy Engineering

Twelve AI Privacy Risks

A Privacy Engineer’s Introduction to AI system threats

Deception & Evasion

Sponge Attacks: Service Denial

Prompt Injection: Malicious Trickery

Insights from the community

Others also viewed

Issues in the Generative AI Era around Cybersecurity

AI Security: Protecting the Future of Intelligence

Securing the Future: Tackling Information Security Challenges in the Age of AI

Don’t Let Your AI Go Rogue: Application Security Considerations for AI Agents and Agentic Platforms

Artificial Intelligence & Cybersecurity: A Double-Edged Sword in the Digital Age

The AI Arms Race : How Fraudsters Are Using AI, and How to Fight Back

Navigating the Cybersecurity Landscape with Agentic AI

Cybersecurity and the Dangers of ChatGPT

The Hidden Perils: Security Threats in AI-Based Products!

Securing Generative AI in 2025

Explore topics