Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jitesh Jain's picture
20 4 9

Jitesh Jain

praeclarumjj3
fkryan's profile picture SpringSnow's profile picture Flying-Lynx's profile picture
·
https://praeclarumjj3.github.io/
  • praeclarumjj
  • praeclarumjj3

AI & ML interests

None yet

Organizations

Ai2's profile picture SHI Labs's profile picture

authored 2 papers 7 months ago

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Paper • 2405.05949 • Published May 9, 2024 • 3

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Paper • 2412.09585 • Published Dec 12, 2024 • 11
authored a paper over 1 year ago

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Paper • 2312.14233 • Published Dec 21, 2023 • 17
authored 3 papers about 2 years ago

Matting Anything

Paper • 2306.05399 • Published Jun 8, 2023 • 6

Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand

Paper • 2208.03382 • Published Aug 5, 2022

OneFormer: One Transformer to Rule Universal Image Segmentation

Paper • 2211.06220 • Published Nov 10, 2022
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs