arxiv:2602.05711
Loser Cheems
JingzeShi
ยท
AI & ML interests
I like training small languge models.
Recent Activity
updated a model 22 days ago
JingzeShi/flash-sparse-attention liked a model about 2 months ago
BAAI/OpenSeek-Mid-v1 updated a model 2 months ago
JingzeShi/flash-sparse-attention