ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

2024-03-11

This article introduces ArtPrompt, a novel ASCII art-based attack that exploits vulnerabilities in large language models (LLMs) by bypassing safety measures to induce undesired behaviours. This approach highlights the limitations of current LLMs in recognising non-semantic forms of text, such as ASCII art, posing significant security challenges. More on harmful ASCII-art here: ASCII art elicits harmful responses from 5 major AI chatbots{:target="_blank"}.

ArtPrompt LLMSecurity ASCIIArtAttack CyberSecurity AIChallenges

Visit Original Article →

Was this useful?