Celso F's picture

Celso F

celsowm

·

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

huggingface/InferenceSupport:ai21labs/AI21-Jamba-Mini-1.7

new activity 5 days ago

huggingface/InferenceSupport:nvidia/OpenReasoning-Nemotron-32B

new activity 6 days ago

HuggingFaceTB/SmolLM3-3B:Infinity reasoning with this prompt

View all activity

Organizations

None yet

upvoted a collection about 2 months ago

StarVector SVG Datasets (🏆SVG-Bench)

Datasets for training and evaluating SVG generation models • 11 items • Updated Jan 12 • 19

upvoted a paper about 2 months ago

GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Paper • 2505.20355 • Published May 26 • 36

upvoted a paper 2 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 220

upvoted a collection 3 months ago

Qwen3

74 items • Updated 4 days ago • 910

upvoted 2 collections 4 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 584

Portuguese LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: • 16 items • Updated 19 minutes ago • 36

upvoted 2 papers 4 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 134

Long Context Tuning for Video Generation

Paper • 2503.10589 • Published Mar 13 • 14

upvoted a paper 9 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 67

upvoted an article 12 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

By

and 5 others •

Aug 12, 2024

• 112

upvoted an article about 1 year ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 238

upvoted a collection over 1 year ago

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated 15 days ago • 339

upvoted a paper about 2 years ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 80