Submitted by IlyaGusev 69 PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation · 1 authors 99 2
Submitted by pkanithi 58 MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications · 10 authors 6
Submitted by akhaliq 22 Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models · 7 authors 3
Submitted by sonta7 21 Gated Slot Attention for Efficient Linear-Time Sequence Modeling · 12 authors 2
Submitted by sandeep123 14 Can Large Language Models Unlock Novel Scientific Research Ideas? · 4 authors 8
Submitted by akhaliq 11 VMAS: Video-to-Music Generation via Semantic Alignment in Web Music Videos · 5 authors 2
Submitted by akhaliq 11 Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering · 10 authors 4
Submitted by benbogin 8 SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories · 8 authors 2
Submitted by akhaliq 8 MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis · 5 authors 2
Submitted by thughost 8 ProteinBench: A Holistic Evaluation of Protein Foundation Models · 10 authors 2