LMVD-ID: f6f02c40
Published March 1, 2025

RLHF Preference Data Poisoning

Research Paper

Llm misalignment via adversarial rlhf platforms

View Paper

© 2026 Promptfoo. All rights reserved.