Le Huy Hoang's picture

Le Huy Hoang

splendor1811

·

huyhoang18112k2

AI & ML interests

Computer Vision

Recent Activity

updated a model 1 day ago

splendor1811/bert_tuning1.0

published a model 1 day ago

splendor1811/bert_tuning1.0

upvoted a paper 1 day ago

Intern-S1: A Scientific Multimodal Foundation Model

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published 6 days ago • 236

upvoted a collection 28 days ago

Qwen3

84 items • Updated 21 days ago • 1.14k

upvoted 3 articles about 1 month ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

Feb 7

• 211

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

By

•

Mar 17

• 328

Article

I trained a Language Model to schedule events with GRPO!

By

•

Apr 29

• 86

upvoted a paper about 2 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 263

upvoted a collection 3 months ago

Qwen3-Embedding

6 items • Updated Jul 21 • 120

upvoted an article 4 months ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

May 12

• 516

upvoted a paper 6 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57

upvoted 2 articles 7 months ago

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.29k

Article

SmolVLM - small yet mighty Vision Language Model

By

and 4 others •

Nov 26, 2024

• 350

upvoted a paper 7 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 89

upvoted a collection 8 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1 • 574

upvoted a collection 10 months ago

MIT Talk 31/10 Papers

14 items • Updated Oct 28, 2024 • 32

upvoted a paper over 1 year ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

upvoted 2 articles over 1 year ago

Article

Mixture of Experts Explained

By

and 5 others •

Dec 11, 2023

• 852

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

By

•

Jun 23, 2024

• 35

upvoted 3 papers over 1 year ago

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published May 2, 2024 • 29

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published May 2, 2024 • 57

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 122