view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others β’ Mar 10 β’ 144
view post Post 2046 I've been working on something cool: a GRPO with an LLM evaluator that can also perform SFT on the feedback data - if you want. Check it out πAny πare more than welcome π€https://github.com/mkurman/grpo-llm-evaluator See translation π 6 6 + Reply
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 862
view article Article SmolVLM Grows Smaller β Introducing the 250M & 500M Models! By andito and 2 others β’ Jan 23 β’ 180
view article Article Upgrading Kokoro: natural TTS for short bursts By hexgrad β’ Nov 22, 2024 β’ 28
Addition is All You Need for Energy-efficient Language Models Paper β’ 2410.00907 β’ Published Oct 1, 2024 β’ 151