Nicolas Le Roux's picture

2

Nicolas Le Roux

nicolouchka

·

AI & ML interests

None yet

Organizations

upvoted a paper 4 months ago

Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs

Paper • 2503.14286 • Published Mar 18 • 2

upvoted a paper about 2 years ago

Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference

Paper • 2306.12509 • Published Jun 21, 2023 • 14