1 10 20

Shangzhi Zhang

Snorlax

AI & ML interests

None yet

Recent Activity

liked a dataset 21 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset

updated a collection about 2 months ago

LLMs

liked a Space about 2 months ago

opencompass/open_vlm_leaderboard

View all activity

Organizations

Snorlax's activity

liked a dataset 21 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated about 14 hours ago • 3.91M • 4.69k • 405

updated a collection about 2 months ago

LLMs

Collection

28 items • Updated Mar 2

liked a Space about 2 months ago

710

Open VLM Leaderboard

🌎

VLMEvalKit Evaluation Results Collection

liked a dataset about 2 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21 • 110k • 3.72k • 629

liked a model about 2 months ago

stepfun-ai/GOT-OCR-2.0-hf

Image-Text-to-Text • Updated Jan 31 • 27.8k • 189

upvoted an article 3 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

• 229

upvoted a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 381

updated 4 models 4 months ago

updated 7 models 5 months ago

Snorlax/ppo-Pyramids

Reinforcement Learning • Updated Dec 1, 2024 • 14

Snorlax/ppo-SnowballTarget

Reinforcement Learning • Updated Dec 1, 2024 • 12

Snorlax/Reinforce-PixelCopter

Reinforcement Learning • Updated Dec 1, 2024

Snorlax/Reinforce-CartPole-v1

Reinforcement Learning • Updated Nov 30, 2024

Snorlax/dqn-SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated Nov 30, 2024 • 4

Snorlax/q-Taxi-v3

Reinforcement Learning • Updated Nov 26, 2024

Snorlax/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Nov 26, 2024

liked a Space 5 months ago

317

Huggy

🐶

Play with a stick-catching AI dog 🐶

updated a model 5 months ago

Snorlax/ppo-Huggy

Reinforcement Learning • Updated Nov 25, 2024 • 14