Leandro von Werra
lvwerra
AI & ML interests
NLP and RL
Recent Activity
liked
a model
5 days ago
ds4sd/SmolDocling-256M-preview
liked
a model
5 days ago
rasbt/llama-3.2-from-scratch
authored
a paper
7 days ago
SmolVLM: Redefining small and efficient multimodal models
Organizations
lvwerra's activity
another pass
#64 opened about 2 months ago
by
lvwerra

fix-figures
#55 opened about 2 months ago
by
lvwerra

comms-figures
#52 opened about 2 months ago
by
lvwerra

conclusion
#50 opened about 2 months ago
by
lvwerra

very-important-updates
#49 opened about 2 months ago
by
lvwerra

xrsrke/link_nanotron_fp8_appexdix
1
#21 opened about 2 months ago
by
neuralink

xrsrke/fix_width_height_for_fp8_graph
#46 opened about 2 months ago
by
neuralink

xrsrke/add_interactive_fp8_loss_curve
#43 opened about 2 months ago
by
neuralink

small SP fix
#41 opened about 2 months ago
by
lvwerra

appendix-a0
#37 opened about 2 months ago
by
lvwerra

add ressources 1
#32 opened about 2 months ago
by
eliebak

references
#20 opened about 2 months ago
by
lvwerra

memory-layout-widget
#12 opened about 2 months ago
by
lvwerra

plolty-plots
#11 opened about 2 months ago
by
lvwerra

yet-another-fix
#9 opened about 2 months ago
by
lvwerra
