Practical guides to protect yourself, your family, and your business from AI-driven scams, deepfakes, and emerging cyber threats.
On May 29, 2026, researchers at Permiso Security disclosed an attack they call ChatGPhish. The premise is unsettlingly plain: hide a few lines of instructions on any public web page, wait for someone to ask ChatGPT to summarize it, and the assistant's reply comes back carrying a live phishing link, a fake "account security" alert, or a scannable QR code. All of it renders inside ChatGPT's own window, styled exactly like a genuine answer.
There is no malware here, no stolen password, no breached server. The exploit is the trust users place in the assistant's reply. ChatGPhish takes that trust and quietly hands the pen to an attacker, and understanding how is the difference between treating an AI summary as a verdict and treating it as just another piece of untrusted web content.
Permiso submitted the bug to OpenAI through the Bugcrowd platform on April 29, 2026, under the label "Untrusted Markdown Rendering Leads to XSS, Phishing, and Data Exfiltration." OpenAI first responded that it could not reproduce the issue. A revised report on May 1 with a fuller proof-of-concept was then classified as a duplicate of an already-known issue. After further correspondence on May 7, Permiso published its findings on May 29, 2026, noting it had received no confirmation that a fix was in place. The Register, The Hacker News and Cyber Security News all covered the disclosure the same day.
The mechanism is a form of indirect prompt injection, meaning the attacker feeds the AI hidden instructions disguised as ordinary content, so they ride into the model's output without the user ever seeing them. Researchers also call this a cross-prompt injection attack, or XPIA. The same trick was demonstrated last year against Microsoft Copilot using booby-trapped emails. ChatGPhish swaps the email for the browser. When ChatGPT summarizes a page, it renders that summary in Markdown, the lightweight formatting language that turns text into clickable links, images and headings. Crucially, chatgpt.com's renderer trusts the links and image URLs that come back from third-party page content, so an instruction smuggled onto a GitHub README, a documentation portal, a blog post or a SaaS dashboard can plant attacker-controlled elements directly into the reply, with no label marking them as foreign.
The concrete consequence is that the safest-feeling part of the interaction, the assistant's tidy summary, becomes the delivery mechanism. A spoofed "your account needs verification" notice carries the visual authority of ChatGPT itself, and the QR-code variant is built to jump from a hardened laptop to a far less protected phone. For your organisation, the shift is specific: the moment anyone on your team uses "summarize this page" on a public wiki, a partner's README or a customer-supplied link, the output can no longer be treated as vetted. Your help desk and finance staff have spent years learning to hover over email links before clicking, and that instinct does not yet exist for links that appear inside an AI answer. The systemic problem is the one the OWASP project placed at the very top of its 2025 list of large-language-model risks: a model cannot reliably separate legitimate instructions from attacker text buried in the data it was asked to read. The browser's usual safety boundaries do not help, because the assistant acts with the user's own authenticated session. Every browser-integrated AI summariser, not just ChatGPT, inherits the same structural flaw until origin labelling is enforced.
ChatGPhish is not a breach. It is a warning about where trust now lives. As people increasingly let an assistant read the web for them, the assistant's reply has quietly become a publishing surface that an outsider can write to. Until these tools clearly mark which parts of an answer came from the open web, the only safe assumption is the uncomfortable one: a link inside an AI summary is no more trustworthy than the random page it summarised. Raise it in your next security review before someone scans the wrong QR code.


