gohsyi
·
AI & ML interests
None yet
Organizations
None yet
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot-epoch3
8B
•
Updated
•
9
gohsyi/Llama-3.1-8B-Instruct-ppo4-math-4shot-epoch2-critic
8B
•
Updated
•
9
gohsyi/Llama-3.1-8B-Instruct-ppo4-math-4shot-epoch2
8B
•
Updated
•
41
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch3-critic
8B
•
Updated
•
9
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch3
8B
•
Updated
•
8
gohsyi/Llama-3.1-8B-Instruct-ppo4-math-4shot-epoch1-critic
8B
•
Updated
•
9
gohsyi/Llama-3.1-8B-Instruct-ppo4-math-4shot-epoch1
8B
•
Updated
•
9
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch2-critic
8B
•
Updated
•
8
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch2
8B
•
Updated
•
27
gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch3-critic
8B
•
Updated
•
7
gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch3
8B
•
Updated
•
8
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch1-critic
8B
•
Updated
•
24
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch1
8B
•
Updated
•
24
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot-epoch2-critic
8B
•
Updated
•
9
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot-epoch2
8B
•
Updated
•
43
gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch2-critic
8B
•
Updated
•
8
gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch2
8B
•
Updated
•
32
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot-epoch1-critic
8B
•
Updated
•
9
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot-epoch1
8B
•
Updated
•
8
gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch1-critic
8B
•
Updated
•
9
gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch1
8B
•
Updated
•
9
gohsyi/Llama-3.1-8B-Instruct-ppo-math-4shot
Updated
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-math-4shot
8B
•
Updated
•
9
gohsyi/Llama-3.1-8B-Instruct-ppo4-math-4shot
8B
•
Updated
•
31
gohsyi/Llama-3.1-8B-Instruct-ppo-gsm8k-4shot
Updated
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot
8B
•
Updated
•
9
gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot
8B
•
Updated
•
6
gohsyi/Meta-Llama-3.1-8B-Instruct-ppo4-ultrafeedback-v0.1-ckpt-step40
Text Generation
•
8B
•
Updated
•
27
gohsyi/Meta-Llama-3.1-8B-Instruct-ppo4-ultrafeedback-v0.1-ckpt-step30
Text Generation
•
8B
•
Updated
•
23
gohsyi/Meta-Llama-3.1-8B-Instruct-ppo4-ultrafeedback-v0.1-ckpt-step20
Text Generation
•
8B
•
Updated
•
12