14 17 11

Suyuchen Wang PRO

sheryc

https://suyuchen.wang/

AI & ML interests

Playing with LLMs

Recent Activity

upvoted a paper 1 day ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

authored a paper 4 days ago

System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts

upvoted a paper 6 days ago

System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts

View all activity

Organizations

sheryc's activity

upvoted a paper 1 day ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 3 days ago • 123

upvoted a paper 6 days ago

System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts

Paper • 2505.18962 • Published 12 days ago • 12

upvoted 2 papers 8 days ago

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published 9 days ago • 91

GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks

Paper • 2504.12764 • Published Apr 17 • 41

upvoted a paper 17 days ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published 20 days ago • 116

upvoted a paper 21 days ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published 21 days ago • 118

upvoted a paper 2 months ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 285

upvoted 2 papers 4 months ago

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published Feb 12 • 57

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published Feb 3 • 39

upvoted a paper 6 months ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

upvoted a paper 9 months ago

LongRecipe: Recipe for Efficient Long Context Generalization in Large Languge Models

Paper • 2409.00509 • Published Aug 31, 2024 • 43

upvoted a collection 12 months ago

VisionLM

Collection

1219 items • Updated 1 day ago • 72

upvoted a paper 12 months ago

VCR: Visual Caption Restoration

Paper • 2406.06462 • Published Jun 10, 2024 • 13

upvoted 4 papers about 1 year ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 68

The Road Less Scheduled

Paper • 2405.15682 • Published May 24, 2024 • 28

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 92

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11, 2024 • 55