3 10 27

Tim Wu

changtimwu

AI & ML interests

DL,IoT,Devop

Recent Activity

liked a model 9 days ago

Qwen/Qwen3-32B-FP8

liked a Space about 2 months ago

nanotron/ultrascale-playbook

upvoted a paper 2 months ago

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving

View all activity

Organizations

changtimwu's activity

liked a model 9 days ago

Qwen/Qwen3-32B-FP8

Text Generation • Updated 21 days ago • 41.1k • 51

liked a Space about 2 months ago

2.68k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 2 months ago

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving

Paper • 2401.09670 • Published Jan 18, 2024 • 2

upvoted an article 3 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 608

liked a model 3 months ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated May 1 • 500k • 1.42k

liked a Space 3 months ago

243

Qwen2.5 VL 72B Instruct

💻

Chat with an AI that understands text and images

upvoted a paper 4 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 125

liked a model 4 months ago

QuantFactory/Llama-3.2-Taiwan-Legal-3B-Instruct-GGUF

Text Generation • Updated Nov 2, 2024 • 1.32k • 11

upvoted 2 papers 4 months ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 101

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 122

liked a Space 10 months ago

115

Llama3.1 S V0.2 Checkpoint 2024 08 20

😻

Convert text to audio and vice versa

liked 2 models 11 months ago

shenzhi-wang/Llama3.1-8B-Chinese-Chat

Text Generation • Updated Jul 29, 2024 • 14.9k • • 262

openbmb/MiniCPM-Llama3-V-2_5-gguf

Updated Feb 27 • 5.52k • 213

liked a Space about 1 year ago

215

Microsoft Phi-3-Vision-128k

😻

Generate image descriptions

liked a model about 1 year ago

google/paligemma-3b-pt-224

Image-Text-to-Text • Updated Sep 21, 2024 • 42.6k • 328

updated a model about 1 year ago

changtimwu/speaker-segmentation-fine-tuned-callhome-jpn

Updated May 2, 2024 • 11

liked 2 models about 1 year ago

crusoeai/Llama-3-8B-Instruct-262k-GGUF

Updated May 5, 2024 • 347 • 48

bullerwins/gradientai_Llama-3-8B-Instruct-262k_exl2_8.0bpw

Text Generation • Updated Apr 26, 2024 • 15 • 3

upvoted an article about 1 year ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22, 2024

• 237

upvoted a paper about 1 year ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 257