Back to Blog

Lessons in transparency and LLM Security from the anthropic threat intel report

Gal Moyal

August 28, 2025

Anthropic’s August 2025 Threat Intelligence Report is transparency in action. It lays out, case by case, how determined adversaries have tried to misuse Anthropic’s LLM, Claude, what damage they sought to cause, and how Anthropic responded. Too often in our industry, security risks get buried in euphemisms. Here, Anthropic is choosing sunlight over spin, which will help make everyone safer.

The LLM Security Threats at a Glance

The report catalogs a range of LLM security incidents that should concern every CISO:

AI-driven data extortion (vibe hacking): A single actor used Claude Code to automate reconnaissance, harvest credentials, and run targeted extortion across 17 organizations. The AI not only executed intrusions but also generated ransom notes tailored to each victim.

North Korean fraudulent employment schemes: Operatives leaned on Claude to simulate technical skills, pass job interviews, and sustain employment at Western firms, funneling income to sanctioned programs.

No-code malware and ransomware-as-a-service: Criminals with limited technical expertise built and sold ransomware kits, complete with evasion techniques and customer “support” channels.

AI-enhanced fraud supply chains: From carding services to romance scams, AI is now embedded at every step of digital crime.

Anthropic responded by banning accounts, deploying new classifiers, and sharing intelligence with partners. Still, the larger message is clear: AI is reshaping the threat landscape, and LLM security must evolve accordingly.

The Larger Conversation: Building Comprehensive LLM Security

What Anthropic surfaced isn’t just a list of incidents; it’s a glimpse into what the next decade of cyber threats will look like. AI is making cybercrime easier. Left unchecked, it could change the tempo, scale, and accessibility of cybercrime.

So the real conversation isn’t, “look what happened.” It’s, “how do we build robust LLM security frameworks that can defend against these evolving threats?”

1. AI Governance: Setting LLM Security Standards Before the Race

Just as nuclear power came with treaties and aviation came with air traffic control, AI requires a governance framework, and visibility is the foundation. We all know you can’t govern what you can’t see. In LLM security, that means maintaining a complete asset inventory of where models are running, what data they’re accessing, how they’re being used, and who has access. It also means creating an AI Bill of Materials (AIBOM), a record of the data, components, and dependencies behind each system. Without this visibility, LLM security could become the wild west, where the fastest exploiters set the rules.

2. AI Red Teaming: Testing LLM Security Guardrails

You don’t know the strength of a fence until someone leans on it. LLM security is no different. AI red teaming and structured adversarial testing is how we discover whether AI guardrails hold up under pressure.

The criminal case studies Anthropic uncovered are, in many ways, live-fire red teaming done by the wrong crowd. The security community needs to do the same, but intentionally, systematically and ethically. By probing LLMs for weaknesses, testing prompt injections, and simulating adversarial workflows, defenders can stay ahead of criminals instead of simply reacting to them.

3. Runtime LLM Security: Guarding the Model in Motion

Governance sets the rules. Red teaming tests them. But what happens when the model is live, responding in real time? That’s where runtime LLM security comes in.

Runtime LLM security means monitoring the inputs and outputs of models as they’re being used. It’s about enforcing policies dynamically, catching when a model’s response veers into dangerous territory, spotting when usage patterns look criminal, and in some cases, blocking harmful activity outright. It’s the equivalent of intrusion detection and prevention, but pointed at the model itself.

Why LLM Security Matters Now More Than Ever

The lesson of Anthropic’s report is that criminals aren’t waiting. They’re already integrating AI into extortion, fraud, and espionage. They’re moving faster than many organizations’ LLM security defenses.

As defenders, we need to meet that pace:

Governance: Push for cross-industry LLM security standards and transparency, including robust asset inventories and AIBOMs.

Red teaming: Continuously stress-test models against real adversarial tactics to strengthen LLM security postures.

Runtime security: Instrument LLMs and agents with monitoring and enforcement that can stop misuse mid-flight.

If we fail to do this, we risk AI becoming the accelerant that turns sparks of cybercrime into full-scale infernos.

The Future of LLM Security

Anthropic’s report is a wake-up call and a roadmap. By choosing to share their findings, they’ve reminded us that transparency fuels defense, and that burying problems doesn’t make them disappear.

We are at an inflection point. AI can be a tool for progress and a tool for harm. Whether it becomes the former or the latter depends on how quickly we mature our LLM security governance, how rigorously we test, and how effectively we secure models at runtime.

The future of LLM security isn’t just about building smarter models; it’s about building smarter security into AI systems from the ground up. As CISOs, we must lead this charge, ensuring our organizations are prepared for the AI-powered threat landscape ahead.

Request a demo of Noma Security for LLM security here.

5 min read

Back to Blog

Lessons in transparency and LLM Security from the anthropic threat intel report

The LLM Security Threats at a Glance

The Larger Conversation: Building Comprehensive LLM Security

1. AI Governance: Setting LLM Security Standards Before the Race

2. AI Red Teaming: Testing LLM Security Guardrails

3. Runtime LLM Security: Guarding the Model in Motion

Why LLM Security Matters Now More Than Ever

The Future of LLM Security

Category:

Share this:

Why the Rule of Two Can’t Protect Your Agents: MCP Servers, Agentic Risk, and the Framework That Should Replace It

Noma x OWASP: The Ultimate Red Team Playbook

ContextCrush: The Context7 MCP Server Vulnerability Hiding in Plain Sight

PLATFORM

SOLUTIONS

COMPANY

LEARN

Stay updated