Deepseek Papers Collection Deepseek papers collection • 14 items • Updated about 1 month ago • 28
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 23 days ago • 292
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 565
view article Article BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks Jun 18, 2024 • 43
YaRN: Efficient Context Window Extension of Large Language Models Paper • 2309.00071 • Published Aug 31, 2023 • 67