6 1

Caiming Xiong

cxiong

AI & ML interests

None yet

Recent Activity

commented on a paper 29 days ago

SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents

authored a paper 5 months ago

Fractured Chain-of-Thought Reasoning

authored a paper 5 months ago

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

View all activity

Organizations

commented a paper 29 days ago

SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents

Paper • 2509.06283 • Published about 1 month ago • 17 •

authored 5 papers 5 months ago

Fractured Chain-of-Thought Reasoning

Paper • 2505.12992 • Published May 19 • 23

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19 • 45

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 120

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 97

Scalable Chain of Thoughts via Elastic Reasoning

Paper • 2505.05315 • Published May 8 • 26

authored 2 papers 8 months ago

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published Feb 6 • 25

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published Jan 31 • 39

authored a paper 9 months ago

Demystifying Domain-adaptive Post-training for Financial LLMs

Paper • 2501.04961 • Published Jan 9 • 12

authored 2 papers about 1 year ago

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

Paper • 2410.04698 • Published Oct 7, 2024 • 13

ThinK: Thinner Key Cache by Query-Driven Pruning

Paper • 2407.21018 • Published Jul 30, 2024 • 32

upvoted a paper over 1 year ago

OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Paper • 2407.02371 • Published Jul 2, 2024 • 54

authored a paper over 1 year ago

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Paper • 2407.01370 • Published Jul 1, 2024 • 89

updated 3 models over 1 year ago

authored 3 papers over 1 year ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 71

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Paper • 2402.15506 • Published Feb 23, 2024 • 18

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 69

authored a paper almost 2 years ago

Unlocking Anticipatory Text Generation: A Constrained Approach for Faithful Decoding with Large Language Models

Paper • 2312.06149 • Published Dec 11, 2023 • 3

Caiming Xiong

AI & ML interests

Recent Activity

Organizations

cxiong's activity