3 12 1

Haoze Wu

WaitHZ

https://waithz.github.io/

AI & ML interests

Modular DL, Complex Reasoning

Recent Activity

authored a paper 4 days ago

ReCode: Updating Code API Knowledge with Reinforcement Learning

authored a paper 4 days ago

Model-Task Alignment Drives Distinct RL Outcomes

upvoted a paper 5 days ago

Model-Task Alignment Drives Distinct RL Outcomes

View all activity

Organizations

authored 2 papers 4 days ago

ReCode: Updating Code API Knowledge with Reinforcement Learning

Paper • 2506.20495 • Published Jun 25 • 8

Model-Task Alignment Drives Distinct RL Outcomes

Paper • 2508.21188 • Published 9 days ago • 8

upvoted a paper 5 days ago

Model-Task Alignment Drives Distinct RL Outcomes

Paper • 2508.21188 • Published 9 days ago • 8

updated a collection 2 months ago

ReCode

Collection

2 items • Updated Jul 21

updated a dataset 2 months ago

zjunlp/ReCode-Train-Data

Viewer • Updated Jul 1 • 1.78k • 21 • 2

published a dataset 2 months ago

zjunlp/ReCode-Train-Data

Viewer • Updated Jul 1 • 1.78k • 21 • 2

updated a collection 2 months ago

ReCode

Collection

2 items • Updated Jul 21

liked a dataset 3 months ago

MiniMaxAI/SynLogic

Viewer • Updated Jul 2 • 49.3k • 465 • 92

commented a paper 6 months ago

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6 • 20 •

upvoted a paper 6 months ago

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6 • 20

commented a paper 6 months ago

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6 • 20 •

upvoted an article 6 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

upvoted a paper 7 months ago

Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More

Paper • 2502.07490 • Published Feb 11 • 9

upvoted 2 articles 7 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

•

Mar 1, 2020

• 243

Article

You could have designed state of the art positional encoding

•

Nov 25, 2024

• 355

upvoted 2 papers 8 months ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 49

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 45

commented a paper 8 months ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 45 •

upvoted 2 papers 12 months ago

Benchmarking Chinese Knowledge Rectification in Large Language Models

Paper • 2409.05806 • Published Sep 9, 2024 • 15

OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs

Paper • 2409.05152 • Published Sep 8, 2024 • 33

Haoze Wu

AI & ML interests

Recent Activity

Organizations

WaitHZ's activity

Open-R1: Update #1

How to generate text: using different decoding methods for language generation with Transformers

You could have designed state of the art positional encoding