BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms Paper • 2505.15141 • Published May 21 • 4
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design Paper • 2505.16175 • Published May 22 • 41
Optimizing Anytime Reasoning via Budget Relative Policy Optimization Paper • 2505.13438 • Published May 19 • 35
On Evaluating Adversarial Robustness of Large Vision-Language Models Paper • 2305.16934 • Published May 26, 2023
Intriguing Properties of Data Attribution on Diffusion Models Paper • 2311.00500 • Published Nov 1, 2023 • 2
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast Paper • 2402.08567 • Published Feb 13, 2024 • 2
Robustness and Accuracy Could Be Reconcilable by (Proper) Definition Paper • 2202.10103 • Published Feb 21, 2022
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning Paper • 2402.13669 • Published Feb 21, 2024 • 1
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses Paper • 2406.01288 • Published Jun 3, 2024 • 1
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs Paper • 2406.09136 • Published Jun 13, 2024
RegMix: Data Mixture as Regression for Language Model Pre-training Paper • 2407.01492 • Published Jul 1, 2024 • 40
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates Paper • 2410.07137 • Published Oct 9, 2024 • 8
Improving Long-Text Alignment for Text-to-Image Diffusion Models Paper • 2410.11817 • Published Oct 15, 2024 • 15