ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled and 1 other • Oct 14, 2024 • 77
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 21 days ago • 217
sentence-transformers/msmarco-bert-base-dot-v5 Sentence Similarity • Updated 7 days ago • 215k • • 16
sentence-transformers/multi-qa-mpnet-base-dot-v1 Sentence Similarity • Updated Nov 5, 2024 • 1.7M • • 164
Medical QA Datasets Collection A collection of medical question answering (QA) datasets • 23 items • Updated 19 days ago • 33
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12, 2024 • 119
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback Paper • 2402.01391 • Published Feb 2, 2024 • 42