arxiv:2405.15589
Sophie Xhonneux
sophiex
AI & ML interests
LLM alignment and adversarial attacks/robustness
Organizations
Papers
1
models
6
sophiex/onlinedpo_pythia2.8b_tldr6.9
Updated
•
1
sophiex/dpo_pythia1b_hh_rlhf.yml_local_29-04-24_13-31-33_xxxxx
Updated
•
4
sophiex/dpo_pythia1b_hh_rlhf.yml_local_27-04-24_21-57-03_xxxxx
Updated
sophiex/config_name_xxxxx
Updated
sophiex/pythia-1b-sft_hh_rlhf
Text Generation
•
Updated
•
12
sophiex/pythia-410m-sft_hh_rlhf
Updated