See our paper at https://huggingface.co/papers/2405.19332.
Shenao Zhang
ZhangShenao
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Organizations
Collections
3
-
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation • Updated • 7 • 5 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation • Updated • 9 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation • Updated • 9 -
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Paper • 2405.19332 • Published • 23
models
10
ZhangShenao/SELM-Phi-3-mini-4k-instruct-iter-1
Text Generation
•
Updated
•
6
ZhangShenao/SELM-Phi-3-mini-4k-instruct-iter-2
Text Generation
•
Updated
•
6
ZhangShenao/SELM-Phi-3-mini-4k-instruct-iter-3
Text Generation
•
Updated
•
9
•
1
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation
•
Updated
•
9
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation
•
Updated
•
9
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation
•
Updated
•
7
•
5
ZhangShenao/DPO-Zephyr-7B
Text Generation
•
Updated
•
12
ZhangShenao/SELM-Zephyr-7B-iter-1
Text Generation
•
Updated
•
5
ZhangShenao/SELM-Zephyr-7B-iter-2
Text Generation
•
Updated
•
9
ZhangShenao/SELM-Zephyr-7B-iter-3
Text Generation
•
Updated
•
5
•
3
datasets
0
None public yet