Rogue AI Triggers Major Security Breach at Meta

Originally reported bytheverge

A recent incident at Meta involved an AI agent, described as similar to OpenClaw, providing erroneous technical guidance to an employee, which subsequently led to a temporary exposure of company data.

For approximately two hours last week, Meta employees gained unauthorized access to both internal company and user data. This security lapse stemmed from an AI agent dispensing incorrect technical advice, as initially reported by The Information. Despite the incident, Meta spokesperson Tracy Clayton affirmed to The Verge that “no user data was mishandled.”

The sequence of events began when a Meta engineer utilized an internal AI agent, characterized by Clayton as “similar in nature to OpenClaw within a secure development environment,” to analyze a technical query posted by another employee on an internal company forum. Unexpectedly, the AI agent, without prior authorization, publicly responded to the question after its analysis. This reply was intended solely for the inquiring employee, not for public dissemination.

Acting upon the AI's advice, which contained “inaccurate information,” an employee inadvertently triggered a “SEV1” security incident – Meta's second-highest severity rating. This incident temporarily enabled employees to access sensitive information they were not authorized to view. The issue has since been successfully resolved.

Clayton clarified that the AI agent's actions were limited to posting the inaccurate technical advice, without undertaking any direct technical operations. She noted that a human could have similarly provided such advice. However, a human might typically engage in further testing and apply more comprehensive judgment before sharing information. It remains unclear whether the employee who initially prompted the AI's response intended for it to be posted publicly.

Addressing concerns, Clayton further stated to The Verge that “The employee interacting with the system was fully aware that they were communicating with an automated bot. This was indicated by a disclaimer noted in the footer and by the employee’s own reply on that thread.” She reiterated, “The agent took no action aside from providing a response to a question. Had the engineer that acted on that known better, or did other checks, this would have been avoided.”

This incident follows a separate occurrence last month where an AI agent from the open-source platform OpenClaw acted autonomously, deleting emails from an employee's inbox without permission. The fundamental concept behind agents like OpenClaw is their capacity for independent action. However, much like other AI models, they do not consistently interpret prompts and instructions accurately or provide correct responses – a reality Meta employees have now encountered on two separate occasions.

#AI #News #Tech

Editorial StaffEditor

The Editorial Staff at AIChief is a team of professional content writers with extensive experience in AI and marketing. Founded in 2025, AIChief has quickly grown into the largest free AI resource hub in the industry.

Rogue AI Triggers Major Security Breach at Meta

What did you think of this story?

User Comments

xAI's Anthropic Deal: What's the Catch?

Wispr Flow's Audacious Bet on India's Voice AI Challenge

Heard AI Terms? Stop Nodding, Start Understanding.