See our paper at https://huggingface.co/papers/2405.19332.
Shenao Zhang
ZhangShenao
AI & ML interests
None yet
Organizations
Collections
3
-
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation • Updated • 4 • 5 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation • Updated • 6 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation • Updated • 2 -
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Paper • 2405.19332 • Published • 23
models
28
ZhangShenao/Mistral-7B-Instruct-v0.2-oldmepoch3-iter-1
Text Generation
•
Updated
ZhangShenao/Meta-Llama-3-8B-Instruct-m-iter-1_sample_7000
Updated
ZhangShenao/Meta-Llama-3-8B-Instruct-e-iter-1_sample_7000
Updated
ZhangShenao/Meta-Llama-3-8B-Instruct-m-iter-1_sample_1000
Text Generation
•
Updated
ZhangShenao/Meta-Llama-3-8B-Instruct-e-iter-1_sample_1000
Updated
ZhangShenao/mistraldec31-m-1k
Updated
•
1
ZhangShenao/Meta-Llama-3-8B-Instruct-sft
Text Generation
•
Updated
ZhangShenao/Mistral-7B-Instruct-v0.2-sft
Text Generation
•
Updated
ZhangShenao/Llama-3.1-8B-Instruct-m-iter-2
Updated
ZhangShenao/Mistral-7B-Instruct-v0.2-oldmepoch1-iter-1
Text Generation
•
Updated
datasets
20
ZhangShenao/sft-Mistral-7B-Instruct-v0.2
Viewer
•
Updated
•
1k
•
12
ZhangShenao/new-Meta-Llama-3-8B-Instruct-iter1_sample_7000
Viewer
•
Updated
•
7k
•
14
ZhangShenao/new-Meta-Llama-3-8B-Instruct-iter1_sample_1000
Viewer
•
Updated
•
1k
•
15
ZhangShenao/sft-Meta-Llama-3-8B-Instruct
Viewer
•
Updated
•
1k
•
16
ZhangShenao/new-Llama-3.1-8B-Instruct-iter2
Viewer
•
Updated
•
4k
•
16
ZhangShenao/new-Llama-3.1-8B-Instruct-iter1
Viewer
•
Updated
•
4k
•
15
ZhangShenao/new-Meta-Llama-3-8B-Instruct-iter2
Viewer
•
Updated
•
4k
•
17
ZhangShenao/new-Meta-Llama-3-8B-Instruct-iter1
Viewer
•
Updated
•
4k
•
16
ZhangShenao/new-gemma-2-9b-it-iter1
Viewer
•
Updated
•
2k
•
15
ZhangShenao/new-gemma-1.1-7b-it-iter1
Viewer
•
Updated
•
4k
•
15