Allan Victor's picture

44 281

Allan Victor

BecomeAllan

·

https://becomeallan.github.io/webportfolio/

AI & ML interests

Deep Learning

Recent Activity

liked a model 1 day ago

KE-Team/Ke-Omni-R-3B

upvoted a collection 4 days ago

🌞 May 2025 - Open works from the Chinese community

liked a model 7 days ago

google/gemma-3n-E4B-it-litert-preview

View all activity

Organizations

BecomeAllan's activity

upvoted a collection 4 days ago

🌞 May 2025 - Open works from the Chinese community

43 items • Updated 1 day ago • 8

upvoted a collection 10 days ago

Any-to-Any Models, Datasets, Spaces

16 items • Updated 5 days ago • 19

upvoted a collection 11 days ago

Releases 23 May

34 items • Updated 11 days ago • 8

upvoted a paper 28 days ago

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published about 1 month ago • 64

upvoted a collection about 1 month ago

Pleias-RAG

New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated Apr 24 • 27

upvoted a collection about 2 months ago

InternVL3

34 items • Updated Apr 20 • 70

upvoted a collection 2 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 8 days ago • 195

upvoted a collection 3 months ago

Open-RS

Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" • 8 items • Updated Mar 21 • 12

upvoted an article 3 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 292

upvoted a paper 4 months ago

Distributed Inference and Fine-tuning of Large Language Models Over The Internet

Paper • 2312.08361 • Published Dec 13, 2023 • 28

upvoted a paper 5 months ago

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Paper • 2406.05955 • Published Jun 10, 2024 • 28

upvoted 3 collections 6 months ago

LLaMA-O1-1129 Datasets, Models, Codes and Papers

8 items • Updated Dec 3, 2024 • 18

🧠 Reasoning Models

8 items • Updated 9 days ago • 39

🖼️ MLLMs

39 items • Updated Mar 28 • 12

upvoted 2 collections 7 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 266

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 40 items • Updated 4 days ago • 116

upvoted a paper 7 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 95

upvoted an article 8 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

By

and 5 others •

Sep 18, 2024

• 246

upvoted a paper 10 months ago

AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents

Paper • 2407.04363 • Published Jul 5, 2024 • 34

upvoted a collection 11 months ago

LLM Compiler

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27, 2024 • 150