Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sophie Xhonneux's picture

Sophie Xhonneux

sophiex
busycalibrating's profile picture
·

AI & ML interests

LLM alignment and adversarial attacks/robustness

Organizations

Mila Iterative DPO's profile picture ContinuousAT's profile picture Redflag Tokens's profile picture

Papers 1

arxiv:2405.15589

models 7

sophiex/SFT-ATTACK

Updated Jan 7

sophiex/onlinedpo_pythia2.8b_tldr6.9

Updated Sep 30, 2024 • 8

sophiex/dpo_pythia1b_hh_rlhf.yml_local_29-04-24_13-31-33_xxxxx

Updated Apr 29, 2024 • 3

sophiex/dpo_pythia1b_hh_rlhf.yml_local_27-04-24_21-57-03_xxxxx

Updated Apr 28, 2024

sophiex/config_name_xxxxx

Updated Apr 27, 2024

sophiex/pythia-1b-sft_hh_rlhf

Text Generation • Updated Apr 27, 2024 • 14

sophiex/pythia-410m-sft_hh_rlhf

Updated Apr 26, 2024

datasets 1

sophiex/hh-rlhf

Viewer • Updated Apr 12, 2024 • 169k • 33
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs