Training a Utility-based Retriever Through Shared Context Attribution for Retrieval-Augmented Language Models Paper • 2504.00573 • Published Apr 1 • 2
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices • 22 items • Updated 17 days ago • 72
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs Paper • 2507.03253 • Published Jul 4 • 18