Loser Cheems's picture

Loser Cheems

JingzeShi

·

https://github.com/LoserCheems

LoserCheems

AI & ML interests

I like training small languge models.

Recent Activity

updated a dataset about 1 hour ago

SmallDoge/MoE_dataset

updated a dataset about 2 hours ago

SmallDoge/MoE_dataset

updated a dataset about 2 hours ago

SmallDoge/MoE_dataset

View all activity

Organizations

upvoted a paper 18 days ago

CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models

Paper • 2506.07463 • Published 19 days ago • 10

upvoted a paper 29 days ago

Concise Reasoning, Big Gains: Pruning Long Reasoning Trace with Difficulty-Aware Prompting

Paper • 2505.19716 • Published May 26 • 5

upvoted a collection 4 months ago

🤓Small-Datasets

Multi-stage high-quality datasets makes the model more helpful! • 8 items • Updated 26 days ago • 3

upvoted 4 collections 5 months ago

Doge-Downstream-Applications

2 items • Updated Apr 21 • 2

🐶Doge-CheckPoints

A series of checkPoint weights that can continue training on new datasets without spikes of the training. • 6 items • Updated Apr 21 • 2

🐕Small-Doges

Doge family of small language models! • 18 items • Updated Apr 21 • 6

YuLan-Mini

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 6 items • Updated Apr 14 • 16

upvoted a collection 6 months ago

Doge

Doge family of small language models. • 12 items • Updated Mar 28 • 6

upvoted a paper 6 months ago

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Paper • 2412.11834 • Published Dec 16, 2024 • 8

upvoted a paper 7 months ago

Cheems: Wonderful Matrices More Efficient and More Effective Architecture

Paper • 2407.16958 • Published Jul 24, 2024 • 4