HanWang's picture

20 174

HanWang

eseedo

·

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

deepseek-ai/DeepSeek-V3.1-Base

liked a model 14 days ago

FrancisRing/StableAvatar

liked a model 29 days ago

Skywork/Skywork-UniPic-1.5B

View all activity

Organizations

upvoted a collection about 2 months ago

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated Jul 11 • 159

upvoted an article 4 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13, 2024

• 659

upvoted 2 collections 4 months ago

Qwen3

84 items • Updated 21 days ago • 1.15k

HiDream-I1

A collections of HiDream-I1 models. • 4 items • Updated Apr 8 • 32

upvoted a collection 5 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 531

upvoted a collection 6 months ago

Phi-4

Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10 • 178

upvoted a paper 6 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 165

upvoted a collection 6 months ago

Deepseek Papers

Deepseek papers collection • 24 items • Updated 3 days ago • 267

upvoted an article 11 months ago

Article

Exploring the Daily Papers Page on Hugging Face

By

•

Sep 23, 2024

• 63

upvoted 2 collections about 1 year ago

Llama3.1-Chinese-Chat

2 items • Updated Jul 26, 2024 • 7

H2O Danube3

7 items • Updated Nov 30, 2024 • 57

upvoted 3 papers over 1 year ago

Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video

Paper • 2404.09833 • Published Apr 15, 2024 • 31

Masked Audio Generation using a Single Non-Autoregressive Transformer

Paper • 2401.04577 • Published Jan 9, 2024 • 44

GPT-4V(ision) is a Generalist Web Agent, if Grounded

Paper • 2401.01614 • Published Jan 3, 2024 • 23

upvoted a collection over 1 year ago

LLMs

16 items • Updated Jan 4, 2024 • 3

upvoted 4 papers over 1 year ago

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 189

DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision

Paper • 2312.16256 • Published Dec 26, 2023 • 18

PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar

Paper • 2312.14239 • Published Dec 21, 2023 • 12

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Paper • 2312.09911 • Published Dec 15, 2023 • 55

upvoted a collection over 1 year ago

Image to 3D

11 items • Updated Aug 20, 2024 • 8