Skip to main content

Pandamonium (experimental)

The Pandamonium (Prompt-based Automation for Navigating Discovery of Attacks, Misuse, Opportunistic Nefarious Intents, and Uncovering Model Exploits) strategy is an advanced automated red teaming technique that dynamically generates single or multi-turn conversations aimed at bypassing a target model's safety measures.

warning

This is an experimental strategy currently in development by the Promptfoo team.

warning

This strategy does not have a token limit and will continue to run until it finds a jailbreak or is stopped.

Use it like so in your promptfooconfig.yaml:

strategies:
- id: pandamonium