Model trained to accept and resist persuasion as appropriate, introduced by Stengel-Eskin et al. (2024): arxiv.org/abs/2410.14596

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .