mlfoundations-dev/Qwen2.5-7B-Instruct_qwq_mix_qwen3_science Text Generation • 8B • Updated about 6 hours ago
mlfoundations-dev/Qwen2.5-7B-Instruct_qwq_mix_qwen3_science Text Generation • 8B • Updated about 6 hours ago
Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets Paper • 2506.04598 • Published 23 days ago • 5
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models Paper • 2406.02061 • Published Jun 4, 2024 • 1
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17, 2024 • 53
Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets Paper • 2506.04598 • Published 23 days ago • 5
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published 22 days ago • 42