1 19 4

Yifan Zeng

yokey

https://xhmy.github.io/

AI & ML interests

Large Language Model, Agentic AI, Deep Learning

Recent Activity

upvoted a paper about 1 month ago

Reinforcement Pre-Training

updated a collection about 1 month ago

LLM

upvoted a paper about 1 month ago

WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 243

updated a collection about 1 month ago

LLM

Collection

21 items • Updated about 1 month ago

upvoted 2 papers about 1 month ago

WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue

Paper • 2506.01881 • Published Jun 2 • 6

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published May 29 • 92

upvoted a paper 3 months ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published Apr 7 • 25

updated a collection 4 months ago

LLM

Collection

21 items • Updated about 1 month ago

upvoted 2 papers 4 months ago

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published Mar 7 • 58

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 72

upvoted a paper 5 months ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 74

upvoted an article 5 months ago

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

•

Feb 10

• 58

upvoted a paper 6 months ago

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published Jan 10 • 66

liked a model 6 months ago

sfairXC/FsfairX-LLaMA3-RM-v0.1

Text Classification • 8B • Updated Oct 14, 2024 • 1.44k • 59

upvoted a paper 7 months ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 47

upvoted a paper 8 months ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 62

New activity in google/gemma-2-9b 8 months ago

RuntimeError: Index put requires the source and destination dtypes match, got BFloat16 for the destination and Float for the source.

➕ 3

#24 opened about 1 year ago by

saireddy

upvoted 2 papers 8 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 83

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Paper • 2410.19609 • Published Oct 25, 2024 • 18

authored a paper 9 months ago

TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

Paper • 2410.16033 • Published Oct 18, 2024

liked a model 9 months ago

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Text Generation • 71B • Updated Apr 13 • 259k • • 2.05k

commented a paper 9 months ago

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Paper • 2410.13828 • Published Oct 17, 2024 • 4 •

Yifan Zeng

AI & ML interests

Recent Activity

Organizations

yokey's activity

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

RuntimeError: Index put requires the source and destination dtypes match, got BFloat16 for the destination and Float for the source.