papers - a passagereptile455 Collection

passagereptile455 's Collections

papers

papers

updated 4 days ago

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

Paper • 2503.14734 • Published Mar 18 • 4
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Paper • 2401.02117 • Published Jan 4, 2024 • 34
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 128
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding

Paper • 2506.16035 • Published Jun 19 • 87
Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published about 1 month ago • 60
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Paper • 2507.18553 • Published 27 days ago • 39
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

Paper • 2507.19478 • Published 26 days ago • 29
CLEAR: Error Analysis via LLM-as-a-Judge Made Easy

Paper • 2507.18392 • Published 27 days ago • 19
PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving

Paper • 2507.17596 • Published 28 days ago • 5
Specification Self-Correction: Mitigating In-Context Reward Hacking Through Test-Time Refinement

Paper • 2507.18742 • Published 27 days ago • 5
Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI

Paper • 2507.10510 • Published Jul 14 • 4
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

Paper • 2507.19457 • Published 26 days ago • 24
Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Paper • 2507.16534 • Published 29 days ago • 6
A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 245
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 228
Group Sequence Policy Optimization

Paper • 2507.18071 • Published 28 days ago • 289
Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 156
MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4 • 150
Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2 • 128
GUI-G^2: Gaussian Reward Modeling for GUI Grounding

Paper • 2507.15846 • Published about 1 month ago • 130
Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published 29 days ago • 116
T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published Jul 8 • 115
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 126
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Paper • 2410.10813 • Published Oct 14, 2024 • 12
LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

Paper • 2506.11928 • Published Jun 13 • 24
Defeating Prompt Injections by Design

Paper • 2503.18813 • Published Mar 24 • 22