7 17 10

Xiangyu Z

PhoenixZ

AI & ML interests

None yet

Recent Activity

updated a dataset 12 days ago

PhoenixZ/RISEBench

upvoted a paper 15 days ago

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

upvoted a paper 26 days ago

EnerVerse-AC: Envisioning Embodied Environments with Action Condition

View all activity

Organizations

None yet

PhoenixZ's activity

upvoted a paper 15 days ago

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Paper • 2505.19815 • Published 16 days ago • 36

upvoted a paper 26 days ago

EnerVerse-AC: Envisioning Embodied Environments with Action Condition

Paper • 2505.09723 • Published 27 days ago • 22

upvoted 2 papers 2 months ago

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published Apr 10 • 34

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published Apr 3 • 68

upvoted 4 papers 3 months ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 88

upvoted a collection 3 months ago

FLUX.1

Collection

A collection of our FLUX.1 models and LoRAs. • 8 items • Updated Apr 15 • 100

upvoted 2 papers 4 months ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 73

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published Jan 20 • 30

upvoted a paper 6 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 158

upvoted a collection 8 months ago

CompassJudger

Collection

4 items • Updated Oct 16, 2024 • 8

upvoted a paper 8 months ago

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published Oct 21, 2024 • 61

upvoted 3 papers 12 months ago

MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning

Paper • 2406.17770 • Published Jun 25, 2024 • 19

Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs

Paper • 2406.14544 • Published Jun 20, 2024 • 36

MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding

Paper • 2406.14515 • Published Jun 20, 2024 • 34