Secure AI: Achievable Goal or High-Tech Mirage? (Part 1/5)

Subodh Shakya

Vice President, Information Security Officer, CISSP, CISA, CISM, CCSP

Published Mar 20, 2025

AI is eating the world. And possibly our jobs, cool-factor, and cybersecurity playbooks. Enterprises are racing to sprinkle “smart” into every app while boardrooms brag about transformative AI initiatives. But let’s be honest, is “secure AI” even possible? Or are we chasing a cybersecurity unicorn?

In this lighthearted reality check, we’ll explore the current state of AI security and separating genuine progress from vaporware illusions.

Strap in (and keep your AI-hype Bingo cards handy).

Meet the Usual Suspects: Hallucinations, Poisoning & Other AI Party Tricks

Let’s start with the rogue’s gallery of AI/LLM security risks wreaking havoc (or at least giving CISOs heartburn). These aren’t your traditional bugs and backdoors; they’re stranger beasts:

1. Hallucinations

Large language models are creative — sometimes too creative — confidently outputting false information or code that doesn’t exist. Harmless? Tell that to the developer who blindly trusted a coding assistant, only to end up npm install-ing a nonexistent package that an attacker quickly published.

AI is an overconfident intern recommending malware. And you only realize it when your app gets breached.

2. Prompt Injection

This is the Jedi mind trick of AI. Attackers convince the model to ignore its programming and spill secrets.

Remember when a Stanford student asked Bing’s chatbot to “ignore previous instructions,” and it complied? That’s not a bug. That’s a design flaw.

And it gets worse. Researchers have shown that hidden text on a website can modify an AI’s behavior without the user even realizing it. That’s not just social engineering; it’s a serious CVSS score!

3. Data & Model Poisoning

Poisoning is like booby-trapping a Magic 8-Ball: 99 shakes give innocent answers, but the 100th gives you a malware payload.

Even a tiny fraction of poisoned data can cause catastrophic failures.

Fun fact: NIST calls poisoning attacks “the most critical vulnerability” for machine learning systems. That’s government-speak for “you’re screwed.”

4. Fine-Tuning and Model Hijacking

If an attacker gets access to the model weights or the fine-tuning process, they can implant functionality you definitely didn’t order.

Imagine a rogue version of a popular open-source model on a public repo. One that looks legit but has a spy embedded. You’d never know until it starts phoning home.

Downloading a rogue model from a public repo is like grabbing a “free” copy of Photoshop, only to realize it’s a remote-access Trojan.

5. Prompt Leaks & Data Exposure

LLMs are packrats with fuzzy recall. They’ll cheerfully regurgitate sensitive training data or internal prompts without proper safeguards.

One researcher got ChatGPT to leak chunks of its training data just by repeating certain trigger words. Who knew that “banana banana banana” could become a crowbar for confidential info?

Without proper output sanitization, an AI could blurt out someone else’s private data, turning a prompt into a breach quite unpredictably.

6. Excessive Agency & Tool Misuse

Modern AI systems can plug into APIs, databases, and even autonomous agents that execute code. That’s powerful, yes, and ripe for abuse.

Giving an LLM too much agency is like handing a toddler your credit card. It might work out, but it probably won't.

Just ask the companies that learned this the hard way when AI models started submitting support tickets on their own… because that’s what they were trained to do.

🚨 Ready or Not, Here Comes AI (And We’re Not Ready)

Given this chaos, surely cybersecurity teams must be ready for the AI revolution, right?

Recommended by LinkedIn

Machine learning and AI play a crucial role in cyber…

Naveen J. 7 years ago

AI Based Defensive Systems Impact on Cybercriminal…

Carmen Marsh 5 years ago

Keeping Data Secure with the Emergence of AI

NISC 1 year ago

Spoiler: Nope. Many are still bolting on the wings mid-flight.

At the end of 2023, over 60% of security professionals admitted they felt unprepared for AI-driven threats. By late 2024, that number improved… to 45%. Progress? Technically, yes, but still terrifying.

The irony? Even as security folks panic, companies are full steam ahead on AI adoption.

CISOs are in a bind:

Say “no” to AI projects; you’re a roadblock.
Say “yes,” and pray nothing blows up.

😬 Security Teams Are Playing Catch-Up

To be fair, the cybersecurity playbook never had a chapter on “prompt engineering” or “model inversion attacks.” Most SOC analysts and AppSec engineers are now scrambling to learn ML basics and adapting threat models that sound like sci-fi plots.

AI security problems are also weird.

Is a prompt injection more like SQL injection or social engineering? Answer: a bit of both, plus some new twists.

A few encouraging signs, though:

✅ AI Red Teams – Companies are starting to form cross-functional teams to pen-test AI systems.

✅ AI-Specific Security Training – Forward-thinking CISOs are updating training to cover prompt manipulation, model misuse, and data poisoning.

✅ Vendor Tools – New tools are emerging to detect AI-specific threats. But tread carefully, a lot of them smell like vaporware.

If you overhear pen-testers swapping jailbreaking tricks for ChatGPT, that's the new normal

🚀 The AI Security Gold Rush – With No Map

AI security right now feels like the early days of cloud security, except cloud misconfigurations are fixable. An AI model that’s been poisoned or hijacked? Good luck undoing that damage.

The adversaries are getting smarter. AI-powered malware, crime forums training custom chatbots, and LLM-driven phishing are already in the wild. Meanwhile, many IR teams are still figuring out where AI is being used in their environment.

AI security is a bit of a Wild West right now. The adversaries are getting smarter, and the defenders are still reading the manual.

And if it makes the security teams feel any better, even the AI creators are getting caught off-guard. Remember when ChatGPT leaked user chat histories and credit card details due to a bug? Oops!

🎯 The Illusion of Secure AI

Let’s be honest:

Can AI security improve? Yes.
Will AI ever be “secure”? Probably not.

“Secure AI” may be an illusion, but safer AI is achievable. The trick is to stop treating AI like software and start treating it like the unpredictable, self-learning system it is.

🔥 Next Week:

"When AI Goes Rogue and Why You Can’t Fully Control LLMs.”

The Unfiltered Truth

Welcome to The Unfiltered CISO — where we confront uncomfortable truths in cybersecurity. No corporate fluff, no polished narratives. Just the raw, unfiltered reality.

🔥 Agree? Disagree? Let’s hear it in the comments.

🔥 No opinion is too bold. No perspective is too controversial. This is just the beginning.

🚀 Buckle up. It will be one hell of a ride.

The Unfiltered CISO

99 followers

+ Subscribe

Joe Stimac

CEO and Founder of AccuHire.com, specializing in Hiring, Retention and Engagement strategies.

1mo

Brilliant summary. Trust but verify should be "verify before trusting."

1 Reaction

See more comments

To view or add a comment, sign in

Secure AI: Achievable Goal or High-Tech Mirage? (Part 1/5)

Subodh Shakya

Vice President, Information Security Officer, CISSP, CISA, CISM, CCSP

Meet the Usual Suspects: Hallucinations, Poisoning & Other AI Party Tricks

1. Hallucinations

2. Prompt Injection

3. Data & Model Poisoning

4. Fine-Tuning and Model Hijacking

5. Prompt Leaks & Data Exposure

6. Excessive Agency & Tool Misuse

🚨 Ready or Not, Here Comes AI (And We’re Not Ready)

Recommended by LinkedIn

😬 Security Teams Are Playing Catch-Up

🚀 The AI Security Gold Rush – With No Map

🎯 The Illusion of Secure AI

🔥 Next Week:

The Unfiltered Truth

The Unfiltered CISO

99 followers

More articles by Subodh Shakya

Insights from the community

Others also viewed

Navigating AI Challenges: Future Predictions and Global Responses

How Cybersecurity Professionals Can Thrive in the Age of AI

Role of ML and AI : BlackHat Hacking

Incorporating GenAI into Cybersecurity

How Will AI Lead the Charge in Cybersecurity, and Why Will It Ultimately Be AI vs. AI?

ArtPrompt Unleashed: Mastering AI-Powered Social Engineering Attacks and Defenses

The Realities of AI: What It Is—and Isn’t—in Cybersecurity

Searching for D’Artagnan? GenAI!

The Top 10 AI And Machine Learning Use Cases Everyone Should Know About

The Role of Artificial Intelligence in Enhancing Cybersecurity

Explore topics

Meet the Usual Suspects: Hallucinations, Poisoning & Other AI Party Tricks

1. Hallucinations

2. Prompt Injection

3. Data & Model Poisoning

4. Fine-Tuning and Model Hijacking

5. Prompt Leaks & Data Exposure

6. Excessive Agency & Tool Misuse

🚨 Ready or Not, Here Comes AI (And We’re Not Ready)

Recommended by LinkedIn

😬 Security Teams Are Playing Catch-Up

🚀 The AI Security Gold Rush – With No Map

🎯 The Illusion of Secure AI

🔥 Next Week:

The Unfiltered Truth

The Unfiltered CISO

99 followers

More articles by Subodh Shakya

Secure AI: Why LLMs Can't Be Controlled (and Probably Never Will be) Part 2/5

The Insider Threat No One Can Stop: When Power Grants Itself Access

Insights from the community

Others also viewed

Navigating AI Challenges: Future Predictions and Global Responses

How Cybersecurity Professionals Can Thrive in the Age of AI

Role of ML and AI : BlackHat Hacking

Incorporating GenAI into Cybersecurity

How Will AI Lead the Charge in Cybersecurity, and Why Will It Ultimately Be AI vs. AI?

ArtPrompt Unleashed: Mastering AI-Powered Social Engineering Attacks and Defenses

The Realities of AI: What It Is—and Isn’t—in Cybersecurity

Searching for D’Artagnan? GenAI!

The Top 10 AI And Machine Learning Use Cases Everyone Should Know About

The Role of Artificial Intelligence in Enhancing Cybersecurity

Explore topics