Alexandre Rame's picture

4 7 2

Alexandre Rame

alexrame

·

https://alexrame.github.io/

AI & ML interests

None yet

Organizations

None yet

authored a paper 4 months ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 53

authored a paper 5 months ago

On Teacher Hacking in Language Model Distillation

Paper • 2502.02671 • Published Feb 4 • 18

authored a paper 9 months ago

Diversity-Rewarded CFG Distillation

Paper • 2410.06084 • Published Oct 8, 2024 • 10

authored a paper 11 months ago

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31, 2024 • 78

authored 2 papers 12 months ago

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22, 2024 • 10

BOND: Aligning LLMs with Best-of-N Distillation

Paper • 2407.14622 • Published Jul 19, 2024 • 19

authored a paper about 1 year ago

WARP: On the Benefits of Weight Averaged Rewarded Policies

Paper • 2406.16768 • Published Jun 24, 2024 • 23

authored 2 papers over 1 year ago

Direct Language Model Alignment from Online AI Feedback

Paper • 2402.04792 • Published Feb 7, 2024 • 33

WARM: On the Benefits of Weight Averaged Reward Models

Paper • 2401.12187 • Published Jan 22, 2024 • 20

authored a paper almost 2 years ago

Unified Model for Image, Video, Audio and Language Tasks

Paper • 2307.16184 • Published Jul 30, 2023 • 15