Harry Soteriou's picture

28 45

Harry Soteriou

HarrySoteriou

·

HarrySoteriou

AI & ML interests

LLMs, Deep Reinforcement Learning, TinyML, Computer Vision

Recent Activity

liked a dataset 3 days ago

openbmb/Ultra-FineWeb

upvoted a paper 4 days ago

Phi-4 Technical Report

View all activity

Organizations

None yet

HarrySoteriou's activity

upvoted 7 papers 4 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 117

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 391

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 291

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published 8 days ago • 132

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 10 days ago • 141

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 23 days ago • 106

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7 • 131

upvoted a paper 9 days ago

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published 13 days ago • 31

upvoted a collection 15 days ago

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & outperforms all leading quantization methods. • 29 items • Updated 15 days ago • 89

upvoted 2 papers 18 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 255

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 276

upvoted 2 papers 2 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 112

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 87

upvoted 4 papers 3 months ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 74

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 186

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 70

upvoted a paper 5 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368