4 22 170

JingyeChen22

https://jingyechen.github.io

JingyeChen

AI & ML interests

OCR, Document Analysis, Text-to-X

Recent Activity

liked a model 8 days ago

black-forest-labs/FLUX.1-Redux-dev

commented on a paper 9 days ago

ImgEdit: A Unified Image Editing Dataset and Benchmark

upvoted a paper 9 days ago

ImgEdit: A Unified Image Editing Dataset and Benchmark

View all activity

Organizations

JingyeChen22's activity

upvoted a paper 9 days ago

ImgEdit: A Unified Image Editing Dataset and Benchmark

Paper • 2505.20275 • Published 11 days ago • 17

upvoted 2 papers about 2 months ago

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published Apr 11 • 40

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 63

upvoted a paper 2 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 78

upvoted a paper 5 months ago

Large Motion Video Autoencoding with Cross-modal Video VAE

Paper • 2412.17805 • Published Dec 23, 2024 • 24

upvoted a collection 6 months ago

RoLoRA

Collection

[EMNLP2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization • 3 items • Updated Sep 26, 2024 • 3

upvoted a paper 6 months ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 97

upvoted 4 papers 8 months ago

3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection

Paper • 2410.01647 • Published Oct 2, 2024 • 31

upvoted a paper 9 months ago

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

Paper • 2408.15239 • Published Aug 27, 2024 • 31

upvoted 4 papers over 1 year ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 130

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Paper • 2312.16862 • Published Dec 28, 2023 • 31

MagicScroll: Nontypical Aspect-Ratio Image Generation for Visual Storytelling via Multi-Layered Semantic-Aware Denoising

Paper • 2312.10899 • Published Dec 18, 2023 • 15

LLM-FP4: 4-Bit Floating-Point Quantized Transformers

Paper • 2310.16836 • Published Oct 25, 2023 • 14

upvoted a collection over 1 year ago

🕹️ AI Games

Collection

An ongoing collection of games you can play on HF Spaces • 14 items • Updated Oct 3, 2024 • 29

upvoted 3 papers over 1 year ago

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Paper • 2310.00426 • Published Sep 30, 2023 • 60

RMT: Retentive Networks Meet Vision Transformers

Paper • 2309.11523 • Published Sep 20, 2023 • 33

Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 50