Normal1919's picture

1 9 74

Normal1919

Normal1919

·

O5-7

AI & ML interests

None yet

Recent Activity

updated a model 13 days ago

Normal1919/THW

published a model 18 days ago

Normal1919/THW

upvoted a paper 22 days ago

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

View all activity

Organizations

None yet

upvoted a paper 22 days ago

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 20

upvoted a paper about 1 month ago

Rope to Nope and Back Again: A New Hybrid Attention Strategy

Paper • 2501.18795 • Published Jan 30 • 12

upvoted an article about 1 month ago

Article

SmolLM3: 支持多语言与长上下文推理的小模型

Jul 8

• 10

upvoted 4 papers 5 months ago

Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation

Paper • 2505.18842 • Published May 24 • 36

Time Blindness: Why Video-Language Models Can't See What Humans Can?

Paper • 2505.24867 • Published May 30 • 80

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 140

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27 • 108

upvoted a paper 6 months ago

QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 56

upvoted a collection over 1 year ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 237