Predict human preference to LLM responses.
Binfeng Xu
billxbf
AI & ML interests
None yet
Organizations
None yet
Collections
2
models
9

billxbf/nemo-sft-orpo
Updated
•
4

billxbf/chai-nemo13b-sft-orpo-merge_v2
Text Generation
•
Updated
•
11

billxbf/chai-nemo-sft-orpo-merge
Text Generation
•
Updated
•
9

billxbf/wsdm-qwen14b_dare_dslerp-gptq-q4
Text Classification
•
Updated
•
7

billxbf/phi4_4k_dare
Text Classification
•
Updated
•
9

billxbf/wsdm-qwen14b_dare_dslerp
Text Classification
•
Updated
•
4

billxbf/bulla_7b
Updated
•
3

billxbf/mmos-deepseek-math-7b
Text Generation
•
Updated
•
11

billxbf/specialized-rewoo-planner-7b
Updated
datasets
10
billxbf/aimo_hard_bilingual
Viewer
•
Updated
•
3.56k
•
24
billxbf/aimo-hard-bilingual
Updated
•
8
billxbf/aimo-dataset
Viewer
•
Updated
•
3.79k
•
34
billxbf/aimo-math-problems
Viewer
•
Updated
•
19.2k
•
31
billxbf/lmsys61k
Viewer
•
Updated
•
110k
•
35
billxbf/ppt127k
Viewer
•
Updated
•
127k
•
30
billxbf/arxiv_dump
Viewer
•
Updated
•
11.1k
•
49
•
1
billxbf/yfdump_5m
Viewer
•
Updated
•
5.18M
•
29
billxbf/rewoo-instruction-finetuning
Viewer
•
Updated
•
2.04k
•
18
•
2
billxbf/sotu2023-qa
Viewer
•
Updated
•
876
•
16