Shizhe Diao's picture

Shizhe Diao

shizhediao

·

https://shizhediao.github.io/

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 month ago

shizhediao/SCP-116K-cleaned

published a dataset about 1 month ago

shizhediao/SCP-116K-cleaned

liked a model about 1 month ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

View all activity

Organizations

commented a paper about 1 month ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 133 •

commented a paper about 2 months ago

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28 • 42 •

commented a paper 3 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 92 •

New activity in OptimalScale/gpt-neo2.7B-inst-tuning almost 2 years ago

Librarian Bot: Add base_model information to model

#1 opened almost 2 years ago by

New activity in OptimalScale/robin-65b-v2-delta almost 2 years ago

Please make a model card for robin

#1 opened about 2 years ago by

New activity in OptimalScale/robin-13b-v2-delta about 2 years ago

output gibberish

#1 opened about 2 years ago by

New activity in OptimalScale/Robin-7b about 2 years ago

Apply for community grant: Academic project

#1 opened about 2 years ago by