17 43 5

Chen Dongping

shuaishuaicdp

https://dongping-chen.github.io/

shuaishuaicdp

AI & ML interests

Research for happy.

Recent Activity

updated a collection about 21 hours ago

MixSet

updated a collection about 21 hours ago

MixSet

updated a collection about 21 hours ago

LiveVQA

View all activity

Organizations

shuaishuaicdp's activity

upvoted 2 collections about 21 hours ago

LiveVQA

Collection

Dataset, benchmark and model checkpoints from paper LiveVQA. • 4 items • Updated about 2 hours ago • 1

GUI-World

Collection

Models and datasets from paper GUI-World. • 3 items • Updated about 21 hours ago • 1

upvoted 3 papers about 2 months ago

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 51

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Paper • 2504.10766 • Published Apr 14 • 40

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published Apr 10 • 47

upvoted a paper 2 months ago

LiveVQA: Live Visual Knowledge Seeking

Paper • 2504.05288 • Published Apr 7 • 15

upvoted 12 papers 3 months ago

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

Paper • 2503.04812 • Published Mar 4 • 15

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 66

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published Mar 10 • 39

LLM as a Broken Telephone: Iterative Generation Distorts Information

Paper • 2502.20258 • Published Feb 27 • 27

CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom

Paper • 2503.01836 • Published Mar 3 • 14

Wikipedia in the Era of LLMs: Evolution and Risks

Paper • 2503.02879 • Published Mar 4 • 22

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Paper • 2502.20321 • Published Feb 27 • 30

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published Feb 27 • 28

CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale

Paper • 2502.16645 • Published Feb 23 • 22

upvoted 2 papers 4 months ago

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20 • 46

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 160