MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems Paper • 2605.28732 • Published 1 day ago • 24
SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research Paper • 2605.22878 • Published 8 days ago • 52
OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models Paper • 2605.00877 • Published Apr 25 • 15
Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis Paper • 2604.24198 • Published Apr 27 • 22
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language Paper • 2604.19667 • Published Apr 21 • 22
OceanPile Collection A Large-Scale Multimodal Ocean Corpus for Foundation Models • 8 items • Updated 9 days ago • 2
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published Apr 4 • 38
SkillX: Automatically Constructing Skill Knowledge Bases for Agents Paper • 2604.04804 • Published Apr 6 • 35
How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities Paper • 2603.02578 • Published Mar 3 • 25
view article Article Making LLMs Truly Remember You | LightMem: Lightweight and Efficient Memory-Augmented Generation xzwnlp • Feb 28 • 4
view article Article Create, Evaluate, and Connect AI Skills | SkillNet: A Large-Scale Agentic "Skill Graph" Knowledge Base xzwnlp • Feb 28 • 13
InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem Paper • 2602.14367 • Published Feb 16 • 17