Jitesh Jain's picture

20 4 9

Jitesh Jain

praeclarumjj3

·

https://praeclarumjj3.github.io/

AI & ML interests

None yet

Organizations

authored 2 papers 7 months ago

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Paper • 2405.05949 • Published May 9, 2024 • 3

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Paper • 2412.09585 • Published Dec 12, 2024 • 11

authored a paper over 1 year ago

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Paper • 2312.14233 • Published Dec 21, 2023 • 17

authored 3 papers about 2 years ago

Matting Anything

Paper • 2306.05399 • Published Jun 8, 2023 • 6

Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand

Paper • 2208.03382 • Published Aug 5, 2022

OneFormer: One Transformer to Rule Universal Image Segmentation

Paper • 2211.06220 • Published Nov 10, 2022