PAD: Personalized Alignment at Decoding-time. ICLR 2025
This repo contains the personalized reward model (PerRM) for alignment.
Our paper: https://openreview.net/pdf?id=e7AUJpP8bV
- Downloads last month
- 24
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.
Model tree for RuizheChen/PAD
Base model
meta-llama/Meta-Llama-3-8B