Ken Tsui

kenhktsui

https://kenhktsui.github.io/

AI & ML interests

ML engineer, researcher VLM, LLM benchmark Opinions are my own

Recent Activity

upvoted a paper about 20 hours ago

A Very Big Video Reasoning Suite

liked a model about 1 month ago

moonshotai/Kimi-K2.5

liked a dataset about 2 months ago

VITRA-VLA/VITRA-1M

View all activity

Organizations

upvoted a paper about 20 hours ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published 7 days ago • 497

upvoted 5 papers 5 months ago

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 547

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published Sep 29, 2025 • 9

upvoted a collection 6 months ago

Self Correction Bench

Collection

Benchmarking LLM capability of external and internal error correction • 4 items • Updated 2 days ago • 1

upvoted 3 papers 8 months ago

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

Paper • 2507.02778 • Published Jul 3, 2025 • 9

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 77

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published May 30, 2025 • 14

upvoted 2 articles 9 months ago

Article

LeRobot goes to driving school: World’s largest open-source self-driving dataset

Mar 11, 2025

•

105

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

787

upvoted an article 10 months ago

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

598

upvoted a collection 11 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 697

upvoted a paper 11 months ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25, 2025 • 41

upvoted an article 11 months ago

Article

Breaking resolution curse of vision-language models

Feb 24, 2024

•

upvoted an article about 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4, 2025

•

1.32k

upvoted 3 papers about 1 year ago

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published Feb 3, 2025 • 21

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 288

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108

Ken Tsui

AI & ML interests

Recent Activity

Organizations

kenhktsui's activity

LeRobot goes to driving school: World’s largest open-source self-driving dataset

Uncensor any LLM with abliteration

Vision Language Models (Better, faster, stronger)

Breaking resolution curse of vision-language models

Open-source DeepResearch – Freeing our search agents