shanchen 's Collections

XReasoning - models

https://arxiv.org/abs/2505.22888 ds - means continue post-training on deepseek distilled qwen math 7b limo-{language}-{amount of data}