tarob0ba's picture

5 2 18

tarob0ba

tarob0ba

·

https://b0ba.dev

tarob0ba

AI & ML interests

neural machine translation & improving helpfulness of llms

Organizations

tarob0ba's activity

upvoted a paper about 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 134

upvoted a collection about 2 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 195