Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition Paper • 2506.17201 • Published 7 days ago • 43
WORLDMEM: Long-term Consistent World Simulation with Memory Paper • 2504.12369 • Published Apr 16 • 34
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 152
DistilBERT release Collection Original DistilBERT model, checkpoints obtained from using teacher-student learning from the original BERT checkpoints. • 6 items • Updated Apr 17, 2024 • 24
DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis Paper • 2309.12792 • Published Sep 22, 2023 • 1
MM-LLMs: Recent Advances in MultiModal Large Language Models Paper • 2401.13601 • Published Jan 24, 2024 • 49
Zurich 1.5B (GGUF) Collection Quantized versions of Zurich 1.5B Model Collection, compatible with llama.cpp. Quantized by mradermacher - Fine-tuned from Qwen 2.5 14B Instruct • 12 items • Updated Feb 15 • 3
Geneva 12B (GGUF) Collection Quantized versions of Geneva 12B Model Collection, compatible with llama.cpp. Quantized by mradermacher - Fine-tuned from Mistral NeMo Instruct 2407 • 12 items • Updated Feb 15 • 3
Zurich 14B (GGUF) Collection Quantized versions of Zurich 14B Model Collection, compatible with llama.cpp. Quantized by mradermacher - Fine-tuned from Qwen 2.5 14B Instruct • 12 items • Updated Feb 15 • 3
Zurich 7B (GGUF) Collection Quantized versions of Zurich 7B Model Collection, compatible with llama.cpp. Quantized by mradermacher - Fine-tuned from Qwen 2.5 7B Instruct • 12 items • Updated Feb 15 • 4
Zurich 1.5B Collection The Zurich 1.5B Model Collection - Fine-tuned from Qwen 2.5 1.5B Instruct with GammaCorpus v2. • 6 items • Updated Feb 4 • 3
GammaCorpus (CoT) Collection The GammaCorpus Dataset Collection for CoT (Chain of Thought) • 1 item • Updated Feb 4 • 9