https://www.theverge.com/ai-artificial-intelligence/881574/cline-openclaw-prompt-injection-hack
A hacker tricked a popular AI coding tool into installing OpenClaw the viral, open-source AI agent OpenClaw that actually does things absolutely everywhere. Funny as a stunt, but a sign of what to come as more and more people let autonomous software use their computers on their behalf.
The hacker took advantage of a vulnerability in Cline, an open-source AI coding agent popular among developers, that security researcher Adnan Khan had surfaced just days earlier as a proof of concept. Simply put, Clines workflow used Anthropics Claude, which could be fed sneaky instructions and made to do things that it shouldnt, a technique known as a prompt injection.
The hacker used their access to slip through instructions to automatically install software on users computers. They could have installed anything, but they opted for OpenClaw. Fortunately, the agents were not activated upon installation, or this would have been a very different story.
Its a sign of how quickly things can unravel when AI agents are given control over our computers. They may look like clever wordplay one group wooed chatbots into committing crimes with poetry but in a world of increasingly autonomous software, prompt injections are massive security risks that are very difficult to defend against. Acknowledging this, some companies instead lock down what AI tools can do if theyre hijacked. OpenAI, for example, recently introduced a new Lockdown Mode for ChatGPT preventing it from giving your data away.
-snip-
Lockdown Mode isn't available to individual consumers, though:
https://openai.com/index/introducing-lockdown-mode-and-elevated-risk-labels-in-chatgpt/
Btw, the hacker had warned Cline about the security risk earlier, but they didn't fix it till he called them out in public.