Zebra-CoT

university

https://github.com/LeonLixyz/vlm_reasoning

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

leonli66 updated a dataset about 7 hours ago

vlm-reasoning-cot/Zebra-CoT

EspressoTonic updated a dataset about 23 hours ago

vlm-reasoning-cot/3d-Multi-Hop-Counting-new

Zikui authored a paper 3 days ago

Cross-Modal Safety Alignment: Is textual unlearning all you need?

View all activity

vlm-reasoning-cot's activity

leonli66

updated a dataset about 7 hours ago

vlm-reasoning-cot/Zebra-CoT

Viewer • Updated about 7 hours ago • 118k • 861

EspressoTonic

updated a dataset about 23 hours ago

vlm-reasoning-cot/3d-Multi-Hop-Counting-new

Viewer • Updated about 23 hours ago • 10k • 228

Zikui

authored 5 papers 3 days ago

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Paper • 2506.05523 • Published 6 days ago • 31

kaiyuyue

authored a paper 9 days ago

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Paper • 2505.22664 • Published 14 days ago • 6

deqing

authored a paper 15 days ago

Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models

Paper • 2505.14071 • Published 23 days ago • 1

deqing

authored a paper 4 months ago

FoNE: Precise Single-Token Number Embeddings via Fourier Features

Paper • 2502.09741 • Published Feb 13 • 14

Bill1235813

authored 4 papers 7 months ago

Generalization Differences between End-to-End and Neuro-Symbolic Vision-Language Reasoning Systems

Paper • 2210.15037 • Published Oct 26, 2022 • 1

TLDR: Token-Level Detective Reward Model for Large Vision Language Models

Paper • 2410.04734 • Published Oct 7, 2024 • 17

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 39

VisualLens: Personalization through Visual History

Paper • 2411.16034 • Published Nov 25, 2024 • 18

deqing

authored a paper 7 months ago

VisualLens: Personalization through Visual History

Paper • 2411.16034 • Published Nov 25, 2024 • 18

deqing

authored a paper 8 months ago

TLDR: Token-Level Detective Reward Model for Large Vision Language Models

Paper • 2410.04734 • Published Oct 7, 2024 • 17

kaiyuyue

authored a paper 12 months ago

From Pixels to Prose: A Large Dataset of Dense Image Captions

Paper • 2406.10328 • Published Jun 14, 2024 • 18

deqing

authored a paper 12 months ago

Pre-trained Large Language Models Use Fourier Features to Compute Addition

Paper • 2406.03445 • Published Jun 5, 2024

deqing

authored 2 papers about 1 year ago

Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models

Paper • 2310.17086 • Published Oct 26, 2023 • 1

IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations

Paper • 2404.01266 • Published Apr 1, 2024 • 3

AI & ML interests

Recent Activity

Team members 7

vlm-reasoning-cot's activity