free-bit's picture

48 25

free-bit

free-bit

·

AI & ML interests

None yet

Recent Activity

liked a model about 7 hours ago

Qwen/Qwen3-VL-8B-Instruct

liked a model about 7 hours ago

rednote-hilab/dots.ocr

liked a model about 7 hours ago

allenai/olmOCR-2-7B-1025-FP8

View all activity

Organizations

None yet

upvoted 2 papers 3 days ago

Point Transformer V3: Simpler, Faster, Stronger

Paper • 2312.10035 • Published Dec 15, 2023 • 22

Sonata: Self-Supervised Learning of Reliable Point Representations

Paper • 2503.16429 • Published Mar 20 • 13

upvoted 18 papers 7 days ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published 9 days ago • 107

Self-Adapting Language Models

Paper • 2506.10943 • Published Jun 12 • 7

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23 • 273

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 77

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 127

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 430

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 156

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 212

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 252

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 168

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 133

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 202

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 306

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 98

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 154

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 319

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5 • 76

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 145