MoonRide (MoonRide)

upvoted a collection 3 months ago

Gemma 3 Release

Collection

24 items • Updated 12 days ago • 384

upvoted a paper 3 months ago

FuseChat: Knowledge Fusion of Chat Models

Paper • 2408.07990 • Published Aug 15, 2024 • 14

upvoted 3 papers 4 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 145

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 160

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 142

upvoted an article 4 months ago

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 214

upvoted a collection 4 months ago

The Big Benchmarks Collection

Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 229

upvoted a paper 4 months ago

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published Feb 4 • 65

upvoted an article 4 months ago

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.26k

upvoted 3 papers 4 months ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published Jan 29 • 59

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Paper • 2501.18837 • Published Jan 31 • 10

Diffusion Autoencoders are Scalable Image Tokenizers

Paper • 2501.18593 • Published Jan 30 • 1

upvoted an article 4 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 867

upvoted a paper 4 months ago

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 75

upvoted 5 papers 5 months ago

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 24

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published Jan 7 • 46

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 53

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 92

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 107

upvoted a paper 6 months ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 95

MoonRide

AI & ML interests

Organizations

MoonRide's activity

Gemma 3 Release

FuseChat: Knowledge Fusion of Chat Models

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Open R1: Update #2

The Big Benchmarks Collection

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Open-source DeepResearch – Freeing our search agents

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Diffusion Autoencoders are Scalable Image Tokenizers

Open-R1: a fully open reproduction of DeepSeek-R1

Humanity's Last Exam

Titans: Learning to Memorize at Test Time

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

The GAN is dead; long live the GAN! A Modern GAN Baseline

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Are Your LLMs Capable of Stable Reasoning?