ISEKAI

community

https://github.com/isekai-portal/Link-Context-Learning

isekai-portal

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

weepiess2383 authored a paper 3 days ago

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

weepiess2383 authored a paper 3 days ago

CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models

Amoik authored a paper 6 days ago

REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding

View all activity

ISEKAI-Portal's activity

weepiess2383

authored 2 papers 3 days ago

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Paper • 2501.08453 • Published Jan 14 • 1

CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models

Paper • 2503.18886 • Published 3 days ago • 16

Amoik

authored a paper 6 days ago

REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding

Paper • 2503.07413 • Published 17 days ago • 2

liuziwei7

authored a paper 10 days ago

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Paper • 2503.12605 • Published 11 days ago • 30

liuziwei7

authored a paper 21 days ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published 22 days ago • 38

liuziwei7

authored a paper about 1 month ago

WHAC: World-grounded Humans and Cameras

Paper • 2403.12959 • Published Mar 19, 2024 • 3

liuziwei7

authored a paper about 2 months ago

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Paper • 2502.04328 • Published Feb 6 • 30

liuziwei7

authored 3 papers 2 months ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 25

CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Paper • 2501.08983 • Published Jan 15 • 20

RepVideo: Rethinking Cross-Layer Representation for Video Generation

Paper • 2501.08994 • Published Jan 15 • 15

weepiess2383

authored a paper 2 months ago

RepVideo: Rethinking Cross-Layer Representation for Video Generation

Paper • 2501.08994 • Published Jan 15 • 15

liuziwei7

authored 4 papers 3 months ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 26

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Paper • 2501.03847 • Published Jan 7 • 23

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Paper • 2412.09645 • Published Dec 10, 2024 • 36

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

Paper • 2412.09626 • Published Dec 12, 2024 • 20

liuziwei7

authored 5 papers 4 months ago

FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models

Paper • 2412.07674 • Published Dec 10, 2024 • 20

Imagine360: Immersive 360 Video Generation from Perspective Anchor

Paper • 2412.03552 • Published Dec 4, 2024 • 28

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Paper • 2412.00174 • Published Nov 29, 2024 • 23

Material Anything: Generating Materials for Any 3D Object via Diffusion

Paper • 2411.15138 • Published Nov 22, 2024 • 50

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20, 2024 • 34

AI & ML interests

Recent Activity

Team members 3

ISEKAI-Portal's activity