News Summary: New Study Shows Poetry Can Bypass AI Guardrails; Character AI Shifts Its Teen Strategy
Authoritarian governments—and commentators on Lord Byron alike—have long suspected poets might be the most dangerous people in society. Indeed, one of my favorite novels, Bolaño’s doorstop The Savage Detectives, has this fear at its heart. A new study has discovered there might be something in that after all. The paper, catchily titled “Adversarial Poetry as a Universal Single-Turn Jailbreak in Large Language Models (LLMs),” can essentially be summed up by saying, “AI will teach you how to do naughty things if you ask it in poetry.”
