VLM-Reasoning

non-profit

https://huggingface.co/organizations

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Lin-Chen authored a paper about 2 months ago

VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning

YuZeng260 authored a paper about 2 months ago

VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning

Lin-Chen authored a paper 2 months ago

Seed1.5-VL Technical Report

View all activity

Lin-Chen

authored a paper about 2 months ago

VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning

Paper • 2505.22019 • Published May 28 • 11

YuZeng260

authored a paper about 2 months ago

VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning

Paper • 2505.22019 • Published May 28 • 11

Lin-Chen

authored a paper 2 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 148

YuZeng260

updated a dataset 2 months ago

VLM-Reasoning/VCR-Bench

Viewer • Updated May 11 • 1.03k • 75 • 6

Osilly

authored 2 papers 3 months ago

Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification

Paper • 2412.00876 • Published Dec 1, 2024

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Paper • 2503.06749 • Published Mar 9 • 31

ChthollyTree

updated a dataset 3 months ago

VLM-Reasoning/VCR-Bench

Viewer • Updated May 11 • 1.03k • 75 • 6

YuZeng260

authored a paper 3 months ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10 • 47

yukunqi

updated a dataset 3 months ago

VLM-Reasoning/VCR-Bench

Viewer • Updated May 11 • 1.03k • 75 • 6

yukunqi

in VLM-Reasoning/VCR-Bench 3 months ago

Update README.md

#1 opened 3 months ago by

ChthollyTree

lovesnowbest

authored a paper 3 months ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10 • 47

Osilly

authored a paper 3 months ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10 • 47

ChthollyTree

authored a paper 3 months ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10 • 47

yukunqi

authored a paper 3 months ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10 • 47

Lin-Chen

authored a paper 3 months ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10 • 47

gaotiexinqu

published a dataset 3 months ago

VLM-Reasoning/VCR-Bench

Viewer • Updated May 11 • 1.03k • 75 • 6

gaotiexinqu

updated a dataset 4 months ago

VLM-Reasoning/VCR-Bench

Viewer • Updated May 11 • 1.03k • 75 • 6

lovesnowbest

authored a paper 5 months ago

ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

Paper • 2502.18017 • Published Feb 25 • 20

lovesnowbest

authored a paper 6 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 106

Lin-Chen

authored a paper 7 months ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 99

AI & ML interests

Recent Activity

Team members 7

VLM-Reasoning's activity

Update README.md