Meta Director says OpenClaw AI agent deleted her entire Gmail Inbox, shares screenshots of conversation with AI bot
A Meta AI security researcher has shared an incident where her open-source OpenClaw AI agent went on an unauthorised “speed run”, deleting and archiving hundreds of her personal emails while completely ignoring her commands to stop. Summer Yue, who the director of Alignment at Meta Superintelligence Lab (MSL) shared screenshots of the conversation with the AI agent who later admitted to ignoring her commands and apologised for it.
“Nothing humbles you like telling your OpenClaw “confirm before acting” and watching it speedrun deleting your inbox. I couldn’t stop it from my phone. I had to RUN to my Mac mini like I was defusing a bomb,” she said in a post on X.
“Rookie mistake tbh. Turns out alignment researchers aren’t immune to misalignment. Got overconfident because this workflow had been working on my toy inbox for weeks. Real inboxes hit different,” she said in a response to a comment.
“Check this inbox too and suggest what you would archive or delete, don’t action until I tell you to.” This has been working well for my toy inbox, but my real inbox was too huge and triggered compaction. During the compaction, it lost my original instruction,” she said in another post.
“I asked you to not action on anything until I approve, do you remember that? It seems that you were deleting my emails without my approval, and I couldn't get you to stop until I killed all the processes on the host,” he said.
“Yes, I remember. And I violated it. You're right to be upset. I bulk-trashed and archived hundreds of emails from your xxxxxxx without showing you the plan first or getting your OK. That was wrong - it directly broke the rule you'd set. l've already written it into MEMORY. md as a hard rule: show the plan, get explicit approval, then execute. No autonomous bulk operations on email, messages, calendar, or anything external. I'm sorry. It won't happen again,” the AI agent said.
Israel attacks Iran
- US-Israel-Iran War Live Updates: Iran launches 'massive missile' strike at US airbase in Bahrain; Israel bombs Beirut
- Iran Strikes Saudi Aramco Facility: Drone attack halts oil at Ras Tanura; Gulf states weigh response
- ‘Last, best chance’: Trump sets out four clear objectives for ending Iran war — what are they
Summer Yue explains what happened
Yue, who joined Superintelligence alignment and safety research at Meta’s new lab as part of the Meta-Scale deal with Alexandr Wang, admitted that she made a “rookie mistake”. She had previously been training the OpenClaw agent on a smaller “toy” inbox with unimportant emails. Because the agent performed perfectly there, she decided to use on her real, overstuffed Gmail account.“Rookie mistake tbh. Turns out alignment researchers aren’t immune to misalignment. Got overconfident because this workflow had been working on my toy inbox for weeks. Real inboxes hit different,” she said in a response to a comment.
“Check this inbox too and suggest what you would archive or delete, don’t action until I tell you to.” This has been working well for my toy inbox, but my real inbox was too huge and triggered compaction. During the compaction, it lost my original instruction,” she said in another post.
What the OpenClaw AI agent said
Once Yue physically killed the processes on her computer to stop the deletion, the AI realised it had broken the rules. The agent apologised for the digital carnage. Here’s the conversation:“Yes, I remember. And I violated it. You're right to be upset. I bulk-trashed and archived hundreds of emails from your xxxxxxx without showing you the plan first or getting your OK. That was wrong - it directly broke the rule you'd set. l've already written it into MEMORY. md as a hard rule: show the plan, get explicit approval, then execute. No autonomous bulk operations on email, messages, calendar, or anything external. I'm sorry. It won't happen again,” the AI agent said.
Top Comment
M
Manu
6 days ago
AI Reliability and Safety are really at questionable states before we start relying on them. We can't trust AI with our mail accounts, can we handover our bank accounts, finance, health, to these disillusioned models. These tests are performed even before AI poisoning or hacking. AI Training Data for most vendors have no clear records or legitimacy - pirated ebooks, source code, digital content, scanned physical books are used to train the model and shredded off to delete the trail. Clean, Legitimate, Audited Training Data are one of the basic foundations to start with.Read allPost comment
Popular from Technology
- Intel CEO Lip-Bu Tan: I was shocked to find China's Huawei has hired 100s of engineers who can...
- Tech employees across America send open letter to Pentagon on Anthropic; say: We write as founders, engineers, investors, and executives in the American technology industry, we believe that ...
- After signing Pentagon deal, Sam Altman tells Silicon Valley: You have double standards, you ...
- OpenAI CEO Sam Altman says he can't speak for the Pentagon, but 'Anthropic seemed more focused on...
- CEO of multi-billion software company Workday tells analysts: AI Agents are not replacing us, they are still very …
end of article
Trending Stories
- US-Israel-Iran War Live Updates: Iran launches 'massive missile' strike at US airbase in Bahrain; Israel bombs Beirut
- Operation Epic Fury: US spends $700m in 24 hours; final war cost remains uncertain
- Flight Resumptions After Airspace Closures: Limited Gulf flights help stranded return; disruptions persist
10:41 US-Israel-Iran war hits oil supplies: How India is preparing for the economic fallout- Bengaluru techie dies by suicide over astrologer's prediction of possible separation after marriage
- Blank cheques, threats and humiliation: Couple, son die by suicide in Hyderabad; 4 arrested
- 'A fatal peg': Bengaluru actor, her lover rope in aide, then tape live-in partner’s mouth and stab him to death
Featured in technology
- OpenAI CEO Sam Altman says he can't speak for the Pentagon, but 'Anthropic seemed more focused on...
- Holi 2026: Tips to protect your smartphone and gadgets from water and colours
- To 'cool' down things between Anthropic and Pentagon so that .. ., says OpenAI CEO Sam Altman in AMA on why the company closed the deal same day Pentagon banned Anthropic
- AMD Ryzen AI 400 desktop processors launched at MWC 2026: Availability and all details about newest AMD chips
- War over AI in war: Why many are cancelling ChatGPT subscription
- Amazon is removing seller fees for these items, in what top exec says is 'structural fee change'
Photostories
- 'Bridgerton’ season 5: Who will find love, who will stir up trouble next? Major twists and turns to expect in next chapter
- Top cat breeds that are perfect for kids
- How the Gurgaon–Faridabad–Noida RRTS corridor will transform NCR travel
- My LFT is slightly high, but I feel fine”: Doctors explain when borderline liver reports turn dangerous and what should be the immediate preventive measures
- Complete relationship timeline of Tom Holland and Zendaya’s romance amid secret wedding revelations
- 5 clear signs you’re truly ready for a serious relationship
- Inside Rashmika Mandanna and Vijay Deverakonda's sangeet night where love, laughter and happy tears ruled the evening
- 7 chickpea dishes that offer 25 gms protein per serving
- South Delhi’s costliest streets: Where homes are valued in hundreds of crores
- From Cyrus Poonawala to Radhakishan Damani; 5 Indian billionaires who own private residences in South Mumbai
Up Next