2 6 68

Eli Chen

elichen3051

AI & ML interests

Learning Algorithm, Reinforcement Learning, Data Synthesize, Benchmarking

Recent Activity

liked a model 18 days ago

openai/gpt-oss-120b

liked a dataset about 1 month ago

HuggingFaceTB/smoltalk2

published a dataset 3 months ago

elichen-skymizer/lm-eval-ruler-results-private-32K

View all activity

Organizations

upvoted 2 articles 3 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

•

Apr 18

• 41

Article

The 4 Things Qwen-3's Chat Template Teaches Us

•

Apr 30

• 64

upvoted a collection 8 months ago

Sparse Foundational Llama 2 Models

Collection

Sparse pre-trained and fine-tuned Llama models made by Neural Magic + Cerebras • 27 items • Updated Apr 16 • 9

upvoted an article 9 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 47

upvoted 2 collections about 1 year ago

🍷 FineWeb

Collection

7 items • Updated Jun 20 • 25

📚 FineWeb-Edu

Collection

FineWeb-Edu datasets, classifier and ablation model • 5 items • Updated Jun 12, 2024 • 15

Eli Chen

AI & ML interests

Recent Activity

Organizations

elichen3051's activity

Gotchas in Tokenizer Behavior Every Developer Should Know

The 4 Things Qwen-3's Chat Template Teaches Us

Efficient LLM Pretraining: Packed Sequences and Masked Attention