Learning Flow Fields in Attention for Controllable Person Image Generation Paper โข 2412.08486 โข Published Dec 11, 2024 โข 37
MarDini: Masked Autoregressive Diffusion for Video Generation at Scale Paper โข 2410.20280 โข Published Oct 26, 2024 โข 23
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models Paper โข 2407.11213 โข Published Jul 15, 2024 โข 3
Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning Paper โข 2403.06728 โข Published Mar 11, 2024 โข 2
VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation Paper โข 2311.16492 โข Published Nov 27, 2023 โข 2
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation Paper โข 2303.15994 โข Published Mar 28, 2023 โข 2
Text Promptable Surgical Instrument Segmentation with Vision-Language Models Paper โข 2306.09244 โข Published Jun 15, 2023 โข 2