-
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Paper • 2402.15506 • Published • 17 -
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent
Paper • 2404.03648 • Published • 29 -
Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts
Paper • 2405.19893 • Published • 32 -
Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
Paper • 2405.19888 • Published • 7
Collections
Discover the best community collections!
Collections including paper arxiv:2508.01858
-
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 256 -
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models
Paper • 2506.06395 • Published • 130 -
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models
Paper • 2506.05176 • Published • 68 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 270
-
The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models
Paper • 2507.23313 • Published • 1 -
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
Paper • 2508.03448 • Published • 1 -
C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor
Paper • 2508.01311 • Published • 2 -
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model
Paper • 2505.21179 • Published • 13
-
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Paper • 2402.15506 • Published • 17 -
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent
Paper • 2404.03648 • Published • 29 -
Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts
Paper • 2405.19893 • Published • 32 -
Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
Paper • 2405.19888 • Published • 7
-
The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models
Paper • 2507.23313 • Published • 1 -
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
Paper • 2508.03448 • Published • 1 -
C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor
Paper • 2508.01311 • Published • 2 -
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model
Paper • 2505.21179 • Published • 13
-
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 256 -
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models
Paper • 2506.06395 • Published • 130 -
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models
Paper • 2506.05176 • Published • 68 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 270