Lizard: An Efficient Linearization Framework for Large Language Models Paper • 2507.09025 • Published 11 days ago • 16
MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated 11 days ago • 249
MiniMax-M1 Collection MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated 19 days ago • 108
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA By ybelkada and 4 others • May 24, 2023 • 158
Jamba 1.6 Collection The AI21 Jamba family of models are hybrid SSM-Transformer foundation models, outperforming open model competitors on quality and speed. • 2 items • Updated Mar 6 • 15
Portuguese LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: • 16 items • Updated 3 minutes ago • 34
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 877
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.28k
Tucano: Advancing Neural Text Generation for Portuguese Paper • 2411.07854 • Published Nov 12, 2024 • 7
view article Article Total noob’s intro to Hugging Face Transformers By 2legit2overfit • Mar 22, 2024 • 85
Large Language Models in Biomedical and Health Informatics: A Bibliometric Review Paper • 2403.16303 • Published Mar 24, 2024 • 1
ELECTRA release Collection This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated 12 days ago • 10
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31, 2024 • 552