Jade's picture

Jade

euclaise

·

AI & ML interests

None yet

Recent Activity

liked a dataset 5 days ago

trl-lib/tldr-preference

upvoted a paper 8 days ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

upvoted a paper 9 days ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

View all activity

Organizations

euclaise's activity

liked a dataset 5 days ago

trl-lib/tldr-preference

Viewer • Updated Jan 8 • 179k • 275 • 1

upvoted a paper 8 days ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published 9 days ago • 24

upvoted a paper 9 days ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published 15 days ago • 12

liked 4 models 13 days ago

zed-industries/zeta

Updated Feb 27 • 5.1k • 265

featherless-ai/Qwerky-72B

Text Generation • Updated 20 days ago • 1.56k • 48

LGAI-EXAONE/EXAONE-Deep-32B

Text Generation • Updated 28 days ago • 25.3k • 287

yandex/YandexGPT-5-Lite-8B-instruct

Updated 16 days ago • 4.28k • 59

upvoted 2 papers 13 days ago

Multi-Token Attention

Paper • 2504.00927 • Published 15 days ago • 43

JudgeLRM: Large Reasoning Models as a Judge

Paper • 2504.00050 • Published 16 days ago • 57

upvoted 2 papers 15 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 16 days ago • 61

General Reasoning Requires Learning to Reason from the Get-go

Paper • 2502.19402 • Published Feb 26 • 5

upvoted 2 papers 22 days ago

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Paper • 2410.13276 • Published Oct 17, 2024 • 29

Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published 26 days ago • 35

liked 2 datasets 25 days ago

open-r1/codeforces-cots

Viewer • Updated 19 days ago • 254k • 12.3k • 140

glaiveai/reasoning-v1-20m

Viewer • Updated 28 days ago • 22.2M • 13.3k • 192

upvoted 2 papers 26 days ago

BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?

Paper • 2503.15242 • Published 28 days ago • 9

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 27 days ago • 46

upvoted a paper 28 days ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published 29 days ago • 137

upvoted 2 papers 30 days ago

Self-Taught Self-Correction for Small Language Models

Paper • 2503.08681 • Published Mar 11 • 13

Shifting Long-Context LLMs Research from Input to Output

Paper • 2503.04723 • Published Mar 6 • 20