Guo-Hua Wang's picture

Guo-Hua Wang

Flourish

·

https://doctorkey.github.io/

DoctorKey

AI & ML interests

None yet

Recent Activity

updated a model 24 days ago

AIDC-AI/Ovis-Image-7B

liked a model 26 days ago

AIDC-AI/Ovis2.6-80B-A3B

liked a model 4 months ago

AIDC-AI/Ovis2.6-30B-A3B

View all activity

Organizations

upvoted a paper 6 months ago

Ovis-Image Technical Report

Paper • 2511.22982 • Published Nov 28, 2025 • 7

upvoted a collection 6 months ago

Ovis-Image

Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering under stringent computational constraints. • 7 items • Updated Dec 4, 2025 • 7

upvoted a collection 7 months ago

Diffusion-SDPO

2 items • Updated Nov 11, 2025 • 1

upvoted a paper 7 months ago

Diffusion-SDPO: Safeguarded Direct Preference Optimization for Diffusion Models

Paper • 2511.03317 • Published Nov 5, 2025 • 7

upvoted a paper 10 months ago

Ovis2.5 Technical Report

Paper • 2508.11737 • Published Aug 15, 2025 • 116

upvoted 4 collections 10 months ago

Ovis2.5

Our next-generation MLLMs for native-resolution vision and advanced reasoning • 5 items • Updated Aug 19, 2025 • 58

Ovis-U1

4 items • Updated Dec 1, 2025 • 1

TeEFusion

2 items • Updated Nov 11, 2025 • 1

CHATS

3 items • Updated Dec 1, 2025 • 1

upvoted a paper 11 months ago

TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance

Paper • 2507.18192 • Published Jul 24, 2025 • 8

upvoted a collection 11 months ago

Ovis-U1

An unified model for multimodal understanding, text-to-image generation, and image editing. • 3 items • Updated Jul 2, 2025 • 7

upvoted a paper 11 months ago

Ovis-U1 Technical Report

Paper • 2506.23044 • Published Jun 29, 2025 • 63

upvoted 2 papers about 1 year ago

CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation

Paper • 2502.12579 • Published Feb 18, 2025 • 1

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5, 2025 • 82

upvoted a collection over 1 year ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated Mar 25, 2025 • 67