49 222 1049

Jade

euclaise

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Frankentext: Stitching random text fragments into long-form narratives

upvoted a paper 3 days ago

SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

upvoted a paper 3 days ago

OpenThoughts: Data Recipes for Reasoning Models

View all activity

Organizations

euclaise's activity

upvoted 3 papers 3 days ago

upvoted 7 papers 11 days ago

How new data permeates LLM knowledge and how to dilute it

Paper • 2504.09522 • Published Apr 13 • 8

BLEUBERI: BLEU is a surprisingly effective reward for instruction following

Paper • 2505.11080 • Published 26 days ago • 5

Text Generation Beyond Discrete Token Sampling

Paper • 2505.14827 • Published 21 days ago • 10

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

Paper • 2505.15778 • Published 20 days ago • 17

Hybrid Latent Reasoning via Reinforcement Learning

Paper • 2505.18454 • Published 18 days ago • 5

HoPE: Hybrid of Position Embedding for Length Generalization in Vision-Language Models

Paper • 2505.20444 • Published 15 days ago • 3

ATLAS: Learning to Optimally Memorize the Context at Test Time

Paper • 2505.23735 • Published 12 days ago • 23

upvoted 3 papers 13 days ago

Reinforcing General Reasoning without Verifiers

Paper • 2505.21493 • Published 14 days ago • 26

Exploring the Latent Capacity of LLMs for One-Step Text Generation

Paper • 2505.21189 • Published 14 days ago • 60

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Paper • 2505.19641 • Published 16 days ago • 64

upvoted 7 papers about 1 month ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 85

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 113

Tina: Tiny Reasoning Models via LoRA

Paper • 2504.15777 • Published Apr 22 • 55

RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale

Paper • 2505.03005 • Published May 5 • 32

AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization

Paper • 2504.21659 • Published Apr 30 • 12

TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models

Paper • 2504.20605 • Published Apr 29 • 13

Practical Efficiency of Muon for Pretraining

Paper • 2505.02222 • Published May 4 • 37