-
Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective
Paper • 2502.17262 • Published • 21 -
MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion
Paper • 2502.04235 • Published • 22 -
AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection
Paper • 2505.07293 • Published • 26
shenke
shenke18
AI & ML interests
None yet
Recent Activity
authored
a paper
12 days ago
Seed1.5-VL Technical Report
authored
a paper
12 days ago
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement
Learning
authored
a paper
12 days ago
MMaDA: Multimodal Large Diffusion Language Models
Organizations
Collections
1
models
0
None public yet
datasets
0
None public yet