rl-papers Boosting Tool Use of Large Language Models via Iterative Reinforced Fine-Tuning Paper • 2501.09766 • Published Jan 15
Boosting Tool Use of Large Language Models via Iterative Reinforced Fine-Tuning Paper • 2501.09766 • Published Jan 15
Multi-lingual google/smol Viewer • Updated Mar 3 • 811k • 1.29k • 57 google/wmt24pp Viewer • Updated Mar 13 • 54.9k • 3.93k • 41 Helsinki-NLP/opus_books Viewer • Updated Mar 29, 2024 • 1.25M • 9.91k • 72 cfilt/iitb-english-hindi Viewer • Updated Dec 30, 2023 • 1.66M • 1.91k • 57