PEIYI, WANG's picture

PEIYI, WANG

peiyiwang89

·

AI & ML interests

None yet

Recent Activity

authored a paper 29 days ago

Chain-of-Thought Tokens are Computer Program Variables

authored a paper 4 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

authored a paper 9 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

View all activity

Organizations

None yet

peiyiwang89's activity

authored a paper 29 days ago

Chain-of-Thought Tokens are Computer Program Variables

Paper • 2505.04955 • Published 30 days ago • 8

authored a paper 4 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 400

authored a paper 9 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 73

authored a paper 12 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 64

authored 2 papers over 1 year ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 123

Silkie: Preference Distillation for Large Visual Language Models

Paper • 2312.10665 • Published Dec 17, 2023 • 11

updated a model over 1 year ago

peiyiwang89/wav2vec2-large-xls-r-300m-taiwanese-colab

Automatic Speech Recognition • Updated Nov 30, 2023 • 15