16.3 C
Canada
Tuesday, March 17, 2026
HomeGamingAnthropic reveals that as few as '250 malicious paperwork' are all it...

Anthropic reveals that as few as ‘250 malicious paperwork’ are all it takes to poison an LLM’s coaching information, no matter mannequin dimension


Claude-creator Anthropic has discovered that it is truly simpler to ‘poison’ Massive Language Fashions than beforehand thought. In a current weblog put up, Anthropic explains that as few as “250 malicious paperwork can produce a ‘backdoor’ vulnerability in a big language mannequin—no matter mannequin dimension or coaching information quantity.”

These findings arose from a joint examine between Anthropic, the Alan Turing Institute, and the UK AI Safety Institute. It was beforehand thought that unhealthy actors would wish to regulate a way more important proportion of any LLM’s coaching information to affect its behaviour, however these current findings recommend it is truly a lot simpler than that.

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Most Popular

Recent Comments