Rongyao Fang's picture

4 8 10

Rongyao Fang PRO

LucasFang

·

https://rongyaofang.github.io/

rongyaofang

AI & ML interests

Multimodal Large Language Model targeting AGI

Recent Activity

liked a model 6 days ago

deepseek-ai/DeepSeek-R1-0528

authored a paper about 1 month ago

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

upvoted a paper about 1 month ago

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

View all activity

Organizations

None yet

authored a paper about 1 month ago

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

Paper • 2505.17022 • Published May 22 • 26

authored a paper 4 months ago

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13 • 51

authored a paper 7 months ago

StreamChat: Chatting with Streaming Video

Paper • 2412.08646 • Published Dec 11, 2024 • 18

authored 3 papers 8 months ago

Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking

Paper • 2303.05475 • Published Mar 9, 2023

RBGNet: Ray-based Grouping for 3D Object Detection

Paper • 2204.02251 • Published Apr 5, 2022 • 1

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 57

authored a paper over 1 year ago

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis

Paper • 2403.12963 • Published Mar 19, 2024 • 8