123 642 1

Michael Barry

MichaelBarryUK

AI & ML interests

None yet

Recent Activity

commented on a paper about 6 hours ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

commented on a paper about 24 hours ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

upvoted a paper 1 day ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

View all activity

Organizations

None yet

commented a paper about 6 hours ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published 7 days ago • 6 •

commented a paper about 24 hours ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published 7 days ago • 6 •

upvoted a paper 1 day ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published 3 days ago • 24

commented a paper 1 day ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published 3 days ago • 24 •

commented a paper 15 days ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published 21 days ago • 219 •

upvoted 8 papers 17 days ago

Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe

Paper • 2508.01691 • Published 19 days ago • 9

A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models

Paper • 2508.01548 • Published 20 days ago • 13

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Paper • 2508.02317 • Published 18 days ago • 16

Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

Paper • 2508.01059 • Published 21 days ago • 32

Qwen-Image Technical Report

Paper • 2508.02324 • Published 18 days ago • 210

commented a paper 28 days ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published Jul 22 • 116 •

commented a paper 30 days ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published Jul 22 • 116 •

commented 3 papers about 1 month ago

Favicon Trojans: Executable Steganography Via Ico Alpha Channel Exploitation

Paper • 2507.09074 • Published Jul 11 • 6 •

Favicon Trojans: Executable Steganography Via Ico Alpha Channel Exploitation

Paper • 2507.09074 • Published Jul 11 • 6 •

Favicon Trojans: Executable Steganography Via Ico Alpha Channel Exploitation

Paper • 2507.09074 • Published Jul 11 • 6 •

commented 2 papers 3 months ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20 • 23 •

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20 • 23 •

Michael Barry

AI & ML interests

Recent Activity

Organizations

MichaelBarryUK's activity