1 25 14

Kariuki james kariuki

JK-TK

AI & ML interests

I love ML and AI

Recent Activity

upvoted a paper about 18 hours ago

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

upvoted a paper 1 day ago

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning

upvoted a paper 3 days ago

AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research

View all activity

Organizations

upvoted a paper about 18 hours ago

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

Paper • 2507.15061 • Published 3 days ago • 32

upvoted a paper 1 day ago

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning

Paper • 2507.12841 • Published 6 days ago • 37

upvoted 2 papers 3 days ago

AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research

Paper • 2507.13300 • Published 6 days ago • 16

DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering

Paper • 2507.11527 • Published 8 days ago • 30

upvoted a paper 10 days ago

AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

Paper • 2507.05687 • Published 15 days ago • 26

upvoted an article 17 days ago

Article

Open Source All About Data Processing, Dataverse

•

Apr 4, 2024

• 3

upvoted an article 19 days ago

Article

Gemma 3n fully available in the open-source ecosystem!

and 7 others •

27 days ago

• 111

upvoted a paper 25 days ago

Thought Anchors: Which LLM Reasoning Steps Matter?

Paper • 2506.19143 • Published 30 days ago • 11

upvoted 3 papers 30 days ago

All is Not Lost: LLM Recovery without Checkpoints

Paper • 2506.15461 • Published Jun 18 • 37

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

Paper • 2506.09049 • Published Jun 10 • 35

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5 • 79

upvoted 2 papers about 1 month ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 39

Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16 • 41

upvoted 5 papers 2 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 217

MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

Paper • 2505.07782 • Published May 12 • 18

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 120

A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models

Paper • 2505.07591 • Published May 12 • 11

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 95

upvoted 2 papers 3 months ago

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

Paper • 2505.02735 • Published May 5 • 32

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published May 5 • 84