Trained for cross-domain generalisation experiments for the Reasoning Gym paper.

Downloads last month
74
Safetensors
Model size
3.09B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for OllieStanley/Qwen2.5-3B-Instruct-RG-Algorithmic

Base model

Qwen/Qwen2.5-3B
Finetuned
(612)
this model