-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 147 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper • 2401.08967 • Published • 30 -
Tuning Language Models by Proxy
Paper • 2401.08565 • Published • 23 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 69
Collections
Discover the best community collections!
Collections including paper arxiv:2502.01456
-
Orca 2: Teaching Small Language Models How to Reason
Paper • 2311.11045 • Published • 73 -
In-Context Former: Lightning-fast Compressing Context for Large Language Model
Paper • 2406.13618 • Published -
ViPer: Visual Personalization of Generative Models via Individual Preference Learning
Paper • 2407.17365 • Published • 12 -
KAN or MLP: A Fairer Comparison
Paper • 2407.16674 • Published • 43