Taming the Titans: A Survey of Efficient LLM Inference Serving Paper • 2504.19720 • Published 24 days ago • 10
A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More Paper • 2407.16216 • Published Jul 23, 2024
100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models Paper • 2505.00551 • Published 21 days ago • 31
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published 14 days ago • 141