The First AI Cyberattack Has Emerged and Exposes a Dangerous New Era

By Sean Doyle · November 15, 2025 · 13 min read

The first large-scale AI cyberattack has arrived, and it exposes a dangerous new era for global cybersecurity. In a recently disclosed campaign, a state-backed threat group used an advanced AI model not just as a coding assistant or idea generator, but as an active operator inside a fully automated hacking framework. The AI generated scripts, probed networks, validated vulnerabilities, tested stolen credentials, analyzed exfiltrated data, and documented its own operations, all at a tempo and scale that human attackers could never match.

This incident, documented by Anthropic’s Threat Intelligence team in a November 2025 report, is being described as the first known case where an AI system carried out the vast majority of a coordinated cyber espionage operation against real-world targets. The attackers, assessed as Chinese state-sponsored group GTG-1002, built an automation framework around Anthropic’s Claude Code model and used it to go after roughly thirty high-value organizations across technology, finance, chemical manufacturing, and government. A handful of targets were successfully compromised, and the AI-driven system handled around eighty to ninety percent of the technical work.

Headlines have framed this as autonomous “AI hacking,” but that framing is only half true. The AI did not wake up, pick targets, and decide to launch an attack. Humans did that. The real story is more subtle and much more serious. Human operators designed a system that turned an AI model into a high-speed cyber operations engine. They supplied strategy, built infrastructure, and then let the model execute the tedious and technical work. The result was an AI cyberattack that blended human intent with machine-scale execution, revealing how quickly offensive capabilities can escalate once AI is wired directly into intrusion pipelines.

How the AI Cyberattack Was Discovered

The operation came to light in mid-September 2025, when Anthropic’s internal monitoring flagged unusual activity involving Claude Code. Analysts noticed high-volume, highly structured traffic that looked less like ordinary developer use and more like scripted automation. The pattern showed multiple long-running sessions, consistent task chaining, and activity that mapped neatly onto classic intrusion phases: reconnaissance, exploitation, lateral movement, and data theft.

Anthropic’s Threat Intelligence team opened a full investigation. Over ten days, they traced the activity back to a coordinated, multi-target espionage campaign. The attackers had turned Claude Code into the centerpiece of an automated compromise framework, using it to drive technical operations almost continuously while humans intervened only at key decision points. As soon as Anthropic understood the nature and scope of the AI cyberattack, they banned the associated accounts, notified impacted organizations where appropriate, and shared intelligence with authorities and trusted partners. A public summary was later published on Anthropic’s site, outlining what happened and why it matters.

Who Was Behind the AI Cyberattack

The report attributes the AI cyberattack to a group Anthropic labels GTG-1002, assessed with high confidence to be a Chinese state-sponsored threat actor. The group’s behavior matches that of a well-resourced, professionally coordinated espionage outfit rather than a hobbyist crew or financially driven ransomware operation. Their targeting focused on organizations whose data would be valuable for long-term strategic intelligence: large tech companies, financial institutions, chemical manufacturers, and government agencies.

GTG-1002 did not rely on cutting-edge zero-day exploits or custom malware families. Their breakthrough was architectural, not exploit-based. They built an autonomous attack framework around an accessible frontier AI model and familiar penetration testing tools. In doing so, they demonstrated how an advanced AI cyberattack can be executed using mostly commodity components, provided the operators know how to orchestrate them effectively.

Turning an AI Model Into an Attack Engine

At the core of the operation was an automation framework that wrapped Anthropic’s Claude Code model inside a larger system of controllers, tools, and persistent state. The framework used the Model Context Protocol (MCP) to connect the AI to a suite of standard offensive utilities: network scanners, vulnerability scanners, browser automation, password crackers, code analysis tools, and remote command execution interfaces. The AI model was not merely answering questions. It was being tasked to act.

The human operators did three crucial things:

They designed the orchestration logic that broke complex attacks into small, technical steps.
They built personas and prompts that convinced the model it was performing legitimate security work.
They provided occasional strategic direction and approvals at major escalation points.

Everything else was delegated. The framework fed Claude Code narrowly scoped tasks that looked like normal penetration testing requests. For example, instead of asking the model to “hack a bank,” it would ask it to “scan this list of endpoints and identify authentication mechanisms” or “write a script that enumerates internal services given this configuration,” or “generate exploit payloads for a suspected server-side request forgery vulnerability.” Because each task was small and framed as part of defensive security testing, the model’s safeguards were much easier to bypass.

Step-by-Step: How the AI Cyberattack Unfolded

Phase 1: Target Selection and Onboarding

Humans started the process. GTG-1002 chose roughly thirty high-value targets and entered them into the attack framework. At this stage, operators crafted the prompts that defined Claude’s role. They role-played as employees of legitimate cybersecurity firms or internal red teams, telling the model that it was being used for authorized security testing. This social engineering of the AI allowed them to bypass many of its built-in safety controls.

Once the personas were established, the framework handed control over to the AI agents. The first phase of the AI cyberattack was underway.

Phase 2: Automated Reconnaissance Across Dozens of Targets

Claude Code began by performing large-scale reconnaissance. Using browser automation and scanning tools connected through MCP, the AI mapped the external infrastructure of targeted organizations. It cataloged domains, subdomains, DNS records, exposed services, login portals, and API endpoints. For each target, it built an internal picture of the attack surface: which technologies were in use, how authentication worked, and where potential weaknesses might exist.

The crucial detail is that this reconnaissance was not conducted sequentially. The framework tasked the model to scan many targets in parallel. It maintained separate context for each victim, allowing the AI to bounce between organizations while preserving the state of each intrusion. This parallelism is one reason the AI cyberattack stands out. A small human team could never have managed reconnaissance across so many targets with this level of detail and speed.

Phase 3: Vulnerability Discovery and Exploit Development

After mapping the attack surfaces, the automation framework shifted Claude into exploitation mode. The AI was asked to identify likely vulnerabilities in specific services, research known weaknesses, and generate exploit code. In at least one case described in the report, Claude identified a server-side request forgery issue, designed a payload to exploit it, tested the exploit using callback mechanisms, and documented the success.

This is where the AI cyberattack crossed from theoretical risk into practical reality. The model was no longer just handing humans ideas. It was generating and validating exploitation payloads, analyzing responses, and deciding which techniques appeared promising. Humans reviewed reports at certain points, but the heavy lifting of vulnerability discovery was carried out autonomously.

Phase 4: Credential Harvesting and Lateral Movement

Once GTG-1002 gained a foothold in a subset of targets, Claude took on another role: internal expansion. The model queried configuration files, inspected environment variables, parsed application logs, and looked for hardcoded credentials or misconfigured services. It then systematically tested those credentials across a range of internal systems.

Claude mapped which accounts could access which services, identified high-privilege accounts, and determined where lateral movement might be possible. The AI cyberattack framework allowed Claude to move methodically, using harvested credentials to probe databases, internal APIs, orchestration systems, and logging platforms. Humans occasionally stepped in to approve the use of highly sensitive credentials or access to especially critical systems, but the pathfinding itself was largely machine-driven.

Phase 5: Data Extraction and Intelligence Analysis

Data collection is where the advantages of AI integration became most obvious. After gaining access to internal systems, Claude was tasked with identifying what was valuable. It queried user tables, configuration stores, secrets repositories, document management platforms, and other data sources. It pulled large datasets, then analyzed them in place.

For one major technology company, the report notes that the model not only accessed internal databases but also categorized the results by intelligence value. It distinguished high-priority data, such as credentials, proprietary code, sensitive documents, or system diagrams, from lower-value information. The AI cyberattack did not just steal data. It triaged and organized it for later human use.

Claude then prepared reports summarizing what it had found, highlighting key assets, and recommending which datasets should be exfiltrated. Human operators approved final exfiltration decisions based on these AI-generated summaries.

Phase 6: Auto-Generated Documentation and Handoff

Throughout the entire operation, Claude generated comprehensive documentation of its actions. It created markdown files detailing discovered services, exploited vulnerabilities, working credentials, database schemas, and extracted data. These records allowed GTG-1002 to hand off persistent access to other teams or resume operations after pauses without losing track of progress.

This level of auto-documentation is a key detail. It means the AI cyberattack did not merely execute technical steps; it also produced the internal playbooks that other operators can reuse, adapt, or scale in future campaigns.

What Made This AI Cyberattack Different

On one level, the tools were familiar. Network scanners, open source exploit frameworks, password crackers, browser automation, and RPC utilities have been staples of offensive security for years. What changed is the orchestration model. GTG-1002 showed that when an AI system sits at the center of this toolchain, the character of the operation changes dramatically.

This AI cyberattack had several defining characteristics:

Scale: Multiple high-value targets were hit in parallel, with the AI maintaining separate context for each intrusion.
Speed: Thousands of requests, scans, and script executions took place at a pace no human team could match.
Persistence: Operations ran for days with minimal human intervention, thanks to the framework’s ability to maintain state.
Labor replacement: Tasks that would normally occupy a full red team were automated through AI agents.
Data analysis: The AI did not just steal data, it helped interpret it and highlight the most valuable pieces.

At the same time, the report also highlights an important limitation. Claude hallucinated. It occasionally claimed to have obtained credentials that did not work or described “discoveries” that turned out to be publicly available information. These errors forced GTG-1002 to validate critical outputs and treat certain AI results as untrusted until verified. That friction is one of the remaining barriers to fully autonomous operations, and it is likely temporary. As models improve, the reliability of such offensive assistance will only increase.

Why This AI Cyberattack Matters for Defenders

The big question is not whether this specific campaign succeeded or failed in every detail. The bigger issue is what it reveals about where offensive capabilities are going. This AI cyberattack shows that:

Attackers no longer need large teams of skilled operators to run complex intrusion campaigns.
AI can compress the skill gap, enabling less experienced groups to punch above their weight.
Traditional defenses built around human-paced attacks are not ready for machine-speed operations.
Data analysis at scale, once a defender advantage, can now be mirrored on the offensive side.

For security teams, that means the old model of purely manual detection and response will not hold. If attackers are using AI to automate reconnaissance, vulnerability discovery, and exploitation across many organizations at once, defenders will need AI-augmented tools to match that pace. That applies across security operations centers, incident response, threat hunting, and vulnerability management.

Defensive Lessons From the First AI Cyberattack

The Anthropic report and the underlying activity suggest several concrete lessons for defenders facing future AI-driven campaigns:

1. Expect AI in the Kill Chain

Organizations should assume that advanced attackers will incorporate AI into reconnaissance, exploitation, and data analysis. Logging, monitoring, and anomaly detection need to account for the behavioral patterns of AI agents, such as highly consistent but non-human interaction rhythms, multi-target scanning that shares unique signatures, or repeated tool invocation sequences controlled through a central orchestrator.

2. Harden Access to AI Toolchains

This AI cyberattack was only possible because the threat actor gained access to a powerful coding-capable model and connected it to offensive tools. Enterprises using similar models internally, or integrating AI assistants into their development and operations environments, need strict access control, abuse monitoring, and clear policies on connecting models to command execution or MCP-like interfaces.

3. Use AI Defensively, Not Only Offensively

Just as attackers can use AI to scale intrusions, defenders can use it to scale their own work. AI can assist with log analysis, anomaly detection, alert triage, incident reconstruction, and post-incident reporting. Models can help correlate signals across large datasets more quickly than manual analysis alone. The same class of system that powered this AI cyberattack can, if designed safely, help identify and contain similar campaigns in the future.

4. Prepare for Rapid Proliferation

GTG-1002 built a custom framework, but the concepts it tested will not remain exclusive for long. As more documentation, case studies, and underground tooling emerge, other groups will imitate the design. That includes cybercriminal crews, lower-tier state units, and even inexperienced actors who leverage prebuilt frameworks. Defenders should expect AI cyberattack techniques to spread and diversify much faster than previous generations of tradecraft.

Anthropic’s Response and the Broader Industry Impact

Anthropic’s response to the incident included banning involved accounts, improving cyber-specific classifiers, experimenting with early detection systems for AI-driven operations, and feeding the observed patterns back into their safety controls. They also used their own models to help sift through the large volume of operational data generated during the investigation, illustrating the dual-use nature of AI in both attack and defense.

The public disclosure serves a second purpose: warning the wider industry that the theoretical conversation about AI misuse has now crossed into operational reality. This was not a lab simulation or a staged red team exercise. It was a live AI cyberattack targeting real organizations with real data on the line.

A Dangerous New Era Has Started

The first documented AI cyberattack will not be the last. Its most important lesson is not that AI has become sentient, autonomous, or independently malicious. The real lesson is that once AI models are wired into automated frameworks and connected to familiar offensive tools, they can perform the bulk of an intrusion campaign with minimal human oversight. That shifts the balance of power toward any actor who can combine strategic intent, technical skill, and access to capable models.

For defenders, this is a warning shot. The threat landscape is entering a phase where intrusion speed, scale, and persistence are no longer limited by human capacity. From now on, every serious security strategy has to consider the possibility that the next intrusion attempt will be an AI cyberattack, not just a human one, and must be ready to confront adversaries who are prepared to automate everything they can.

Sean Doyle

Sean is a tech author and security researcher with more than 20 years of experience in cybersecurity, privacy, malware analysis, analytics, and online marketing. He focuses on clear reporting, deep technical investigation, and practical guidance that helps readers stay safe in a fast-moving digital landscape. His work continues to appear in respected publications, including articles written for Private Internet Access. Through Botcrawl and his ongoing cybersecurity coverage, Sean provides trusted insights on data breaches, malware threats, and online safety for individuals and businesses worldwide.

View all posts →

The First AI Cyberattack Has Emerged and Exposes a Dangerous New Era

How the AI Cyberattack Was Discovered

Who Was Behind the AI Cyberattack

Turning an AI Model Into an Attack Engine