Shang Hong Sim

shanghong

https://shanghongsim.github.io/

AI & ML interests

Neural decoding, neuroengineering, signal processing

Recent Activity

upvoted an article 17 days ago

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

updated a model about 1 month ago

shanghong/llama3.1_8b_stage1

published a model about 1 month ago

shanghong/llama3.1_8b_stage1

View all activity

Organizations

upvoted an article 17 days ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

and 2 others •

Mar 20, 2024

• 98

upvoted an article 4 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 182

upvoted a paper 5 months ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13 • 36

upvoted 3 collections 5 months ago

upvoted an article 5 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 876

upvoted a collection 5 months ago

Trust-Align

Collection

12 items • Updated Feb 11 • 3

upvoted a paper 6 months ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published Jan 18 • 15

upvoted a paper 8 months ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Paper • 2411.06176 • Published Nov 9, 2024 • 46

Shang Hong Sim

AI & ML interests

Recent Activity

Organizations

shanghong's activity

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Open-R1: a fully open reproduction of DeepSeek-R1