Hackers continue to exploit Meta’s AI agents

This audio is auto-generated. Please let us know if you have feedback.

Metaâ€™s test case of empowering artificial intelligence systems to do the work of human staff isnâ€™t going so great.

Over the weekend, reports emerged that hackers had been able to gain access to many Instagram usersâ€™ accounts by simply asking Metaâ€™s AI assistance bot to re-assign the email address for each account. Thatâ€™s all it took for Metaâ€™s AI system to hand over control to multiple accounts, including some with large followings. Meanwhile, Meta also reportedly, didnâ€™t have staff available to review the issue as it arose.

Thatâ€™s because Meta has been on a major staff cull over the past six months, as it works to rationalize costs due to its massive investment in AI infrastructure. Metaâ€™s long-term view is that AI tools will eventually be able to take over from human staff, and undertake most of its internal engineering work. But it seems like Metaâ€™s initial experiments here arenâ€™t providing much assurance.

And while Meta said the issue had been addressed, many hackers are still using this exploit to steal IG account access, according to Android Authority.

Meanwhile, TechCrunch reported on June 3 that Instagram is now informing users impacted about the breach, and that those users will need to update their passwords as a result.

Meta hasnâ€™t said how many accounts were impacted, but the ongoing problem, amid ongoing staff reductions, has raised concerns about Metaâ€™s potential overreliance on AI tools for its own operations. Itâ€™s also brought up questions about how much trust Meta has placed in its AI models to undertake human tasks, given the security risks.

A key challenge here is that by empowering AI agents to undertake actions, and change internal systems, thereâ€™s actually no way of knowing what they might do, and how they could be persuaded to take action.

Hackers continue to exploit Meta’s AI agents

AI tools have been trained on human conversation, and various examples have shown that they can easily be tricked into lying, making up sources or breaking their own rules through user prompts.

And because engineers are dealing with a potentially infinite scope of commands, thereâ€™s really no way to lock this down entirely. These tools can be asked to undertake commands in a broad range of ways, using all types of different languages, so itâ€™s impossible for Meta, or indeed any AI developer, to guarantee that such systems wonâ€™t be manipulated. Thatâ€™s because developers canâ€™t simply block a single command type, or remove a function button, to limit misuse.

If AI agents are empowered to take action on a userâ€™s behalf, they will be susceptible to this type of exploit, and restricting such will be a never-ending process. Itâ€™s essentially a whack-a-mole type approach, because engineers can only restrict a line of questioning once they become aware of it. There are now so many ways that people can ask a question that the task becomes unmanageable, in a practical sense.

This happened with Xâ€™s efforts to limit the use of its chatbot to generate nude images. X tried to limit such requests, but users still asked it to nudify images. X seemingly admitting defeat and moved to restrict access to paying users only, which has limited the scope of risk.

But companies canâ€™t just stop this activity outright, because thereâ€™s no single command line that defines such action. Users will come up with more ways to trick bots, using alternate parameters that will enable hackers to bend the systemâ€™s programming based on flawed AI logic.

Because thatâ€™s what AI tools are designed to do: Interpret broader context, and take action based on conversational queries.

As such, the potential problem here is as unlimited as human language in communicating a task, and thatâ€™s a much bigger challenge for engineers to solve.

But what does that then mean for expanded application of Metaâ€™s AI systems, and the capacity for Meta to make money from its AI tools? The answer remains to be seen.

Related posts

Leave a Comment Cancel reply

Related posts

What’s fueling AI companies’ IPO rush

This Streamer Has Convinced the Internet Guy Fieri Never Swallows Food on TV

World Cup tests marketers’ ability to prove value of sponsorships

Leave a Comment Cancel reply