Knut Jägersberg's picture

Knut Jägersberg

KnutJaegersberg

·

jagersbergknut

AI & ML interests

NLP, opinion mining, narrative intelligence

Recent Activity

upvoted a collection 1 day ago

liked a model 3 days ago

Alibaba-NLP/WebSailor-3B

liked a model 3 days ago

unsloth/Devstral-Small-2507-GGUF

View all activity

Organizations

upvoted a collection 1 day ago

MLM vs CLM

65 items • Updated 11 days ago • 1

upvoted 2 collections 4 days ago

💧 LFM2

LFM2 is a new generation of hybrid models, designed for edge AI and on-device deployment. • 9 items • Updated 1 day ago • 62

ThinkPRM

Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 • 8 items • Updated 25 days ago • 3

upvoted a collection 6 days ago

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 10 items • Updated 3 days ago • 55

upvoted an article 6 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

6 days ago

• 498

upvoted a collection 6 days ago

POLAR

5 items • Updated 5 days ago • 9

upvoted a paper 9 days ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published 12 days ago • 50

upvoted a paper 10 days ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published 12 days ago • 50

upvoted a collection 12 days ago

Reward Models

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 3 days ago • 15

upvoted a collection 19 days ago

Weaver

The models and datasets for Weaver: Shrinking the Generation-Verification Gap with Weak Verifiers • 21 items • Updated 19 days ago • 1

upvoted a paper 30 days ago

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

Paper • 2503.05139 • Published Mar 7 • 3

upvoted 2 papers about 1 month ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 243

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30 • 51

upvoted 3 collections about 1 month ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 22 items • Updated 22 days ago • 68

Common Pile v0.1 Filtered Data

An LLM pre-training dataset produced by filtering and deduplicating the raw text collected in the Common Pile v0.1 • 31 items • Updated Jun 6 • 13

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 36 items • Updated 12 days ago • 142

upvoted a collection about 2 months ago

One-RL-to-See-Them-All

https://github.com/MiniMax-AI/One-RL-to-See-Them-All • 5 items • Updated May 26 • 14

upvoted 2 papers about 2 months ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20 • 23

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

Paper • 2502.02481 • Published Feb 4 • 15

upvoted a collection about 2 months ago

Falcon-H1

Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 37 items • Updated about 1 month ago • 47