Practical guides to protect yourself, your family, and your business from AI-driven scams, deepfakes, and emerging cyber threats.
On Monday, May 11, 2026, Google's Threat Intelligence Group (GTIG, the company's in-house team that tracks state and criminal hackers) disclosed something it had never reported before. A real attacker, operating in the wild, used a zero-day exploit that Google assessed with high confidence was written by an artificial intelligence model. A zero-day is a flaw the software vendor does not yet know about, so there is no patch and no warning.
The exploit was built for a mass attack. Google spotted it, worked with the vendor to quietly fix the hole, and the campaign appears to have been disrupted before it could launch. The detail that matters for everyone watching is not the patch. It is what the AI got wrong, and what that says about how fast this is moving.
The target was a popular open-source, web-based system administration tool, the kind of dashboard IT teams use to manage servers. Google did not name it. The exploit was a Python script that bypassed the tool's two-factor authentication (2FA, the second login step such as a code from an app). The underlying bug was a semantic logic flaw: the software made a hard-coded trust assumption, treating one part of the login as already proven when it was not. That class of mistake is subtle, easy for a human to skim past, and exactly the kind of pattern a large language model is good at reading out of source code.
So how did Google know an AI wrote it? The script gave itself away. It was stuffed with tidy educational comments, used a textbook-clean structure, and, most tellingly, carried a CVSS score (the standard 0 to 10 severity rating for a vulnerability) that the model had simply invented. The rating it cited does not exist. A human exploit author does not hallucinate a severity score. A language model, asked to document its own work, does.
The concrete outcome here was a near miss. A criminal crew had a working weapon aimed at every organization running one widely used admin tool, and a fix landed before the wave hit. But read what the near miss implies. The hardest, most expensive part of offensive hacking has always been discovering a novel flaw and turning it into reliable, working code. That is the work that separated elite attackers from everyone else, and it is the part an AI just did. If your organization runs open-source infrastructure anywhere in its stack, and almost every organization does, the window between a bug existing and a weaponized exploit reaching your perimeter is closing. Ryan Dewhurst of the security firm watchTowr put it plainly: discovery, weaponization, and exploitation are all getting faster, and defenders do not get to opt out. The systemic shift is a compression of time. Patch cycles measured in weeks were already uncomfortable. Against an attacker who can ask a model to find and arm a flaw over a weekend, weeks is a losing pace.
The headline is not that an AI can write code. It is that an AI found a real flaw in real software, wrote a working exploit for it, and a criminal group nearly used it at scale. The slip that exposed the operation, a made-up severity score, will not save defenders next time, because sloppiness is the quality problem the next model fixes. What does not change is the arithmetic. The cost of finding and weaponizing a vulnerability just fell, and the time you have to patch fell with it. Bring one question to your next security review. For the internet-facing tools you depend on, how fast can you actually patch, and is that fast enough for an attacker who no longer has to be brilliant, only quick?


