Wei Ping's picture

Wei Ping

wping

·

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

nvidia/Qwen2.5-CascadeRL-RM-72B

liked a model 7 days ago

zai-org/GLM-4.7

liked a model 12 days ago

MiniMaxAI/MiniMax-M2.1

View all activity

Organizations

upvoted a paper 22 days ago

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Paper • 2512.13607 • Published 24 days ago • 30

upvoted a collection about 1 month ago

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 7 days ago • 44

upvoted a collection 5 months ago

DeepSeek-V3.1

4 items • Updated Nov 27, 2025 • 257

upvoted a collection 6 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Nov 14, 2025 • 162

upvoted a paper 7 months ago

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16, 2025 • 26

upvoted a paper 8 months ago

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22, 2025 • 35

upvoted 2 collections 8 months ago

AceReason

Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated 16 days ago • 20

Qwen3

84 items • Updated 8 days ago • 1.55k

upvoted 2 collections 9 months ago

AceMath-RL

Math reasoning models trained through reinforcement learning (RL) • 1 item • Updated 16 days ago • 5

Nemotron-UltraLong

3 items • Updated 16 days ago • 19

upvoted a paper 10 months ago

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18, 2025 • 50

upvoted a collection 12 months ago

AceMath

We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated 16 days ago • 16

upvoted a collection about 1 year ago

DeepSeek-V3

4 items • Updated Nov 27, 2025 • 278

upvoted a paper about 1 year ago

AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling

Paper • 2412.15084 • Published Dec 19, 2024 • 13

upvoted a collection over 1 year ago

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 16 days ago • 52

upvoted a paper over 1 year ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 74

upvoted 2 collections over 1 year ago

Llama3-ChatQA-2

This is the collection that presents ChatQA-2, a suite of 128K long-context models, that also have exceptional RAG capabilities • 3 items • Updated 16 days ago • 5

NV-Embed

NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks. • 3 items • Updated 16 days ago • 17