sixiang chen's picture

19 35

sixiang chen

Ephemeral182

·

https://ephemeral182.github.io

Ephemeral182

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

Qwen/Qwen-Image-Layered

liked a model 7 days ago

Qwen/Qwen-Image-Edit-2511

liked a model 8 days ago

ostris/Z-Image-De-Turbo

View all activity

Organizations

upvoted a paper 15 days ago

JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization

Paper • 2511.23002 • Published Nov 28 • 26

upvoted a paper 27 days ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published 29 days ago • 69

upvoted a paper about 1 month ago

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

Paper • 2511.18050 • Published Nov 22 • 37

upvoted a paper 3 months ago

LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer

Paper • 2509.22414 • Published Sep 26 • 21

upvoted 3 papers 6 months ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30 • 89

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 56

JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Paper • 2506.17612 • Published Jun 21 • 64

upvoted 2 papers 7 months ago

PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Paper • 2506.10741 • Published Jun 12 • 27

Sparse Sampling Transformer with Uncertainty-Driven Ranking for Unified Removal of Raindrops and Rain Streaks

Paper • 2308.14153 • Published Aug 27, 2023 • 2

upvoted a paper 9 months ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 64

upvoted 8 papers about 1 year ago

Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis

Paper • 2412.01819 • Published Dec 2, 2024 • 34

KV Prediction for Improved Time to First Token

Paper • 2410.08391 • Published Oct 10, 2024 • 12

PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness

Paper • 2410.07035 • Published Oct 9, 2024 • 17

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Paper • 2410.09008 • Published Oct 11, 2024 • 17

From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning

Paper • 2410.06456 • Published Oct 9, 2024 • 37

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation

Paper • 2410.09009 • Published Oct 11, 2024 • 15

EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models

Paper • 2410.07133 • Published Oct 9, 2024 • 19

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10, 2024 • 52