gohsyi
·
AI & ML interests
None yet
Organizations
None yet
models
239
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch3-critic
7B
•
Updated
•
14
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch3
7B
•
Updated
•
10
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch3-critic
7B
•
Updated
•
10
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch3
7B
•
Updated
•
10
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch2-critic
7B
•
Updated
•
22
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch2
7B
•
Updated
•
46
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch2-critic
7B
•
Updated
•
10
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch2
7B
•
Updated
•
36
gohsyi/Llama-3.1-8B-Instruct-ppo-math-4shot-epoch3-critic
8B
•
Updated
•
10
gohsyi/Llama-3.1-8B-Instruct-ppo-math-4shot-epoch3
8B
•
Updated
•
36
datasets
33
gohsyi/Mistral-7B-Instruct-v0.3-gsm8k-4shot.jsonl
Viewer
•
Updated
•
29.9k
•
10
gohsyi/gemma-1.1-7b-it-gsm8k-4shot.jsonl
Viewer
•
Updated
•
29.9k
•
11
gohsyi/samples_gsm8k_cot_2024-12-04T19-03-11.038885.jsonl
Viewer
•
Updated
•
2.64k
•
12
gohsyi/meta-llama__Llama-3.1-8B-Instruct
Preview
•
Updated
•
27
Viewer
•
Updated
•
8.79k
•
11
Viewer
•
Updated
•
12.5k
•
31
gohsyi/Llama-3.2-3B-Instruct-ultrafeedback-4k-overweight
Viewer
•
Updated
•
4.1k
•
35
gohsyi/Llama-3.2-3B-Instruct-ultrafeedback-4k-underweight
Viewer
•
Updated
•
4.1k
•
32
gohsyi/Llama-3.1-8B-Instruct-ultrafeedback-4k-overweight
Viewer
•
Updated
•
4.1k
•
32
gohsyi/Llama-3.1-8B-Instruct-ultrafeedback-4k-underweight
Viewer
•
Updated
•
4.1k
•
29