Auto-formalized versions of GSM8K and MATH500 auto-formalized and filtered with Goedel models
Ujan PRO
Ujan
·
AI & ML interests
NLP, Speech
Recent Activity
updated a dataset 7 minutes ago
Ujan/math500_formal_eval_NVIDIA-Nemotron-Nano-12B-v2_prover published a dataset 7 minutes ago
Ujan/math500_formal_eval_NVIDIA-Nemotron-Nano-12B-v2_prover updated a dataset 31 minutes ago
Ujan/math500_formal_eval_Olmo-3-7B-Think_proverOrganizations
Formal v1
Auto-formalized versions of GSM8K with the state-of-the-art Goedel-Prover-V2 and filtered using Deepseek-Prover-V2
-
Ujan/gsm8k_formal_goedel_few_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.18k • 33 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_DeepSeek-Prover-V2-7B
Viewer • Updated • 1.15k • 22 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.23k • 8 -
Ujan/gsm8k_formal_goedel_few_shot
Viewer • Updated • 1.3k • 6
Formal v2
Auto-formalized versions of GSM8K and MATH500 auto-formalized and filtered with Goedel models
Formal v1
Auto-formalized versions of GSM8K with the state-of-the-art Goedel-Prover-V2 and filtered using Deepseek-Prover-V2
-
Ujan/gsm8k_formal_goedel_few_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.18k • 33 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_DeepSeek-Prover-V2-7B
Viewer • Updated • 1.15k • 22 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.23k • 8 -
Ujan/gsm8k_formal_goedel_few_shot
Viewer • Updated • 1.3k • 6
models 8
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_2048_epoch_1
Text Generation • 4B • Updated • 3 •
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_16384_epoch_1
Text Generation • 4B • Updated • 1
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_8192_epoch_1
Text Generation • 4B • Updated • 4
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_4096_epoch_1
Text Generation • 4B • Updated • 4
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_16384_epoch_1
Text Generation • 4B • Updated • 1
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_4096_epoch_1
Text Generation • 4B • Updated • 2 •
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_8192_epoch_1
Text Generation • 4B • Updated • 2
Ujan/whisper-small_moe_k_means
Automatic Speech Recognition • Updated • 6
datasets 78
Ujan/math500_formal_eval_NVIDIA-Nemotron-Nano-12B-v2_prover
Viewer • Updated • 63
Ujan/math500_formal_eval_Olmo-3-7B-Think_prover
Viewer • Updated • 56
Ujan/math500_formal_eval_Ministral-3-8B-Reasoning-2512_prover
Viewer • Updated • 7
Ujan/math500_formal_eval_Olmo-3-7B-Think
Viewer • Updated • 71
Ujan/math500_formal_eval_Qwen3-8B_prover
Viewer • Updated • 128
Ujan/math500_formal_eval_Qwen3.5-9B_prover
Viewer • Updated • 157
Ujan/math500_formal_eval_Qwen3-4B-Thinking-2507_prover
Viewer • Updated • 144
Ujan/math500_formal_eval_Qwen3.5-9B
Viewer • Updated • 174
Ujan/math500_formal_eval_Qwen3-8B
Viewer • Updated • 148 • 1
Ujan/math500_formal_eval_Qwen3-4B-Thinking-2507
Viewer • Updated • 154 • 12