Haoxiang Wang's picture

Haoxiang Wang

Haoxiang-Wang

·

https://haoxiang-wang.github.io/

AI & ML interests

Machine Learning (Transfer Learning, OOD Generalization, Domain Adaptation, Meta-Learning)

Recent Activity

updated a model 11 days ago

nvidia/NFT-32B

published a model 11 days ago

nvidia/NFT-32B

published a model 11 days ago

nvidia/NFT-7B

View all activity

Organizations

upvoted a paper about 2 months ago

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Paper • 2505.18116 • Published May 23 • 4

upvoted a paper 4 months ago

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18 • 51

upvoted a paper 5 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 84

upvoted a collection 7 months ago

Cosmos

The collection of Cosmos models • 31 items • Updated 5 days ago • 292

upvoted a paper about 1 year ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 72

upvoted a collection about 1 year ago

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8, 2024 • 24

upvoted a paper over 1 year ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 257