elejke's picture

elejke

elejke

·

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago

opendatalab/MinerU-HTML

upvoted a paper 17 days ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

liked a model about 1 month ago

lightonai/LightOnOCR-2-1B

View all activity

Organizations

None yet

upvoted a paper 17 days ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published 20 days ago • 71

upvoted a paper 3 months ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 231

upvoted a paper 4 months ago

Don't Blind Your VLA: Aligning Visual Representations for OOD Generalization

Paper • 2510.25616 • Published Oct 29, 2025 • 105

upvoted a paper 8 months ago

VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents

Paper • 2504.09795 • Published Apr 14, 2025 • 2

upvoted 3 collections over 1 year ago

GUI Datasets

Datasets from the graphical user interfaces domain (screenshots). • 20 items • Updated Dec 3, 2024 • 8

Document Processing

Any model or dataset dealing with documentary-type objects (layout detection, VQA, OCR, etc.) • 11 items • Updated Sep 4, 2025 • 4

SVG Collection

Collection of SVG files from various sources. • 7 items • Updated Oct 22, 2023 • 6

upvoted 2 papers over 1 year ago

High-Quality Image Restoration Following Human Instructions

Paper • 2401.16468 • Published Jan 29, 2024 • 15

MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs

Paper • 2406.11833 • Published Jun 17, 2024 • 62

upvoted a collection almost 2 years ago

Idefics2 🐶

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6, 2024 • 92

upvoted a paper almost 2 years ago

OmniFusion Technical Report

Paper • 2404.06212 • Published Apr 9, 2024 • 77