LlamaSeg: Image Segmentation via Autoregressive Mask Generation Paper • 2505.19422 • Published May 26 • 3
ReEx-SQL: Reasoning with Execution-Aware Reinforcement Learning for Text-to-SQL Paper • 2505.12768 • Published May 19 • 3
VisNumBench: Evaluating Number Sense of Multimodal Large Language Models Paper • 2503.14939 • Published Mar 19 • 5
Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training Paper • 2405.15319 • Published May 24, 2024 • 30
DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales Paper • 2308.01320 • Published Aug 2, 2023 • 46
Benchmarking Large Language Model Capabilities for Conditional Generation Paper • 2306.16793 • Published Jun 29, 2023 • 7