Felix Tuma's picture

102 58

Felix Tuma

floom

·

AI & ML interests

NLP

Recent Activity

updated a collection 3 days ago

PotentialApplication

upvoted a paper 3 days ago

Prompt Orchestration Markup Language

upvoted a paper 3 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

View all activity

Organizations

None yet

updated a collection 3 days ago

PotentialApplication

38 items • Updated 3 days ago

upvoted 2 papers 3 days ago

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published 4 days ago • 36

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published 17 days ago • 100

updated a collection 7 days ago

PotentialApplication

38 items • Updated 3 days ago

upvoted a paper 7 days ago

AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators

Paper • 2508.09101 • Published 11 days ago • 7

updated a collection 7 days ago

PotentialApplication

38 items • Updated 3 days ago

updated a collection 9 days ago

PotentialApplication

38 items • Updated 3 days ago

upvoted 4 papers 9 days ago

Can LLM-Generated Textual Explanations Enhance Model Classification Performance? An Empirical Study

Paper • 2508.09776 • Published 10 days ago • 3

Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models

Paper • 2508.09968 • Published 10 days ago • 14

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Paper • 2508.09138 • Published 11 days ago • 34

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

Paper • 2508.07976 • Published 12 days ago • 45

updated a collection 12 days ago

PotentialApplication

38 items • Updated 3 days ago

upvoted 2 papers 16 days ago

LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

Paper • 2508.01780 • Published 20 days ago • 13

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published 21 days ago • 219

upvoted a paper 26 days ago

Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published Jul 21 • 60

upvoted a paper 29 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published about 1 month ago • 289

upvoted 2 papers about 1 month ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published Jul 22 • 117

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR

Paper • 2507.15778 • Published Jul 21 • 19