22 16 45

Noob

noobmldude

AI & ML interests

Explainable AI

Recent Activity

liked a dataset 9 days ago

LingoIITGN/COMI-LINGUA

new activity 10 days ago

mistralai/Devstral-Small-2505:Finetuning further

liked a model 12 days ago

FractalAIResearch/Fathom-R1-14B

View all activity

Organizations

noobmldude's activity

liked a dataset 9 days ago

LingoIITGN/COMI-LINGUA

Viewer • Updated 5 days ago • 126k • 666 • 4

New activity in mistralai/Devstral-Small-2505 10 days ago

Finetuning further

#21 opened 10 days ago by

noobmldude

liked a model 12 days ago

FractalAIResearch/Fathom-R1-14B

Text Generation • Updated 5 days ago • 14.4k • • 255

liked a Space about 1 month ago

WikiRacing Language Models

🏃

Find answers by racing against LLM in a quiz game

liked a model about 1 month ago

JetBrains/Mellum-4b-base

Text Generation • Updated May 7 • 9.26k • 366

upvoted an article about 2 months ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

and 8 others •

Apr 29, 2024

• 78

New activity in mistralai/Codestral-22B-v0.1 about 2 months ago

🚩 Report: Not working

#55 opened 5 months ago by

garbagedog2

liked a Space about 2 months ago

On-Device LLM Throughput Calculator

🚀

Generate throughput plots for LLMs on devices

upvoted 2 papers 2 months ago

LocAgent: Graph-Guided LLM Agents for Code Localization

Paper • 2503.09089 • Published Mar 12 • 13

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

liked a dataset 2 months ago

bigcode/self-oss-instruct-sc2-exec-filter-50k

Viewer • Updated Nov 4, 2024 • 50.7k • 361 • 99

liked a Space 2 months ago

197

Check My Progress Deep RL Course

👀

Check your progress in a Deep RL course

New activity in nanotron/ultrascale-playbook 2 months ago

How to download as pdf?

👀 7

#74 opened 4 months ago by

vcoyk

upvoted 3 papers 3 months ago

2BP: 2-Stage Backpropagation

Paper • 2405.18047 • Published May 28, 2024 • 27

UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function

Paper • 2410.21438 • Published Oct 28, 2024 • 2

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published Mar 13 • 29

liked a Space 4 months ago

2.67k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

New activity in hexgrad/Kokoro-82M 4 months ago

Free kokoro TTS endpoint - no strings attached.

🔥 8

#37 opened 5 months ago by

mhenrichsen

liked a Space 4 months ago

769

TTS Arena V2

🏆

Vote on the latest TTS models!

upvoted a paper 4 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 232