Models fine-tuned for multiple choice question answering (mc) and mathematical reasoning (gsm8k). https://arxiv.org/abs/2407.07890
Ricardo
ricdomolm
AI & ML interests
LLMs
Recent Activity
updated
a dataset
1 day ago
ricdomolm/r1-short-thoughts
published
a dataset
1 day ago
ricdomolm/r1-short-thoughts
updated
a dataset
2 days ago
ricdomolm/r1-thoughts
Organizations
None yet
Collections
1
models
69

ricdomolm/smollm
Updated

ricdomolm/llama-3.2-1b-it-test
Updated

ricdomolm/pythia-1.4b-sft-gsm8k-3e
Text Generation
•
Updated
•
379

ricdomolm/pythia-1.4b-sft-gsm8k-1e
Text Generation
•
Updated
•
24

ricdomolm/ml4331-reward-model
Text Generation
•
Updated
•
214

ricdomolm/ml4331-reward-model2
Text Generation
•
Updated
•
8

ricdomolm/ml4331-dpo-model
Text Generation
•
Updated
•
188

ricdomolm/ml4331-instruction-model
Text Generation
•
Updated
•
258

ricdomolm/test-model
Updated

ricdomolm/SmolLM2-135M-SFT-Alpaca
Updated
datasets
21
ricdomolm/r1-short-thoughts
Viewer
•
Updated
•
14k
•
37
ricdomolm/r1-thoughts
Viewer
•
Updated
•
431k
•
8
ricdomolm/NuminaMath-CoT
Viewer
•
Updated
•
851k
•
31
ricdomolm/OpenMAthInstruct-2-AUGMATH-Deduped
Viewer
•
Updated
•
519k
•
41
ricdomolm/gsm8k
Viewer
•
Updated
•
8.79k
•
56
ricdomolm/MATH-500
Viewer
•
Updated
•
12.5k
•
95
ricdomolm/caselawqa_leaderboard_results
Updated
•
1.51k
ricdomolm/caselawqa_leaderboard_requests
Viewer
•
Updated
•
29
•
1.44k
ricdomolm/lawma-instructions_gemma2_8k
Viewer
•
Updated
•
554k
•
89
ricdomolm/lawma-instructions_llama3_16k
Viewer
•
Updated
•
554k
•
56