Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jitesh Jain's picture
21 4 9

Jitesh Jain

praeclarumjj3
frimelle's profile picture Flying-Lynx's profile picture 21world's profile picture
·
https://praeclarumjj3.github.io/
  • praeclarumjj
  • praeclarumjj3

AI & ML interests

None yet

Recent Activity

updated a Space 25 days ago
shi-labs/VisPer-LM
updated a Space about 2 months ago
shi-labs/VisPer-LM
updated a Space about 2 months ago
shi-labs/VisPer-LM
View all activity

Organizations

Ai2's profile picture SHI Labs's profile picture

authored 2 papers 11 months ago

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Paper • 2405.05949 • Published May 9, 2024 • 3

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Paper • 2412.09585 • Published Dec 12, 2024 • 11
authored a paper almost 2 years ago

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Paper • 2312.14233 • Published Dec 21, 2023 • 17
authored 3 papers over 2 years ago

Matting Anything

Paper • 2306.05399 • Published Jun 8, 2023 • 6

Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand

Paper • 2208.03382 • Published Aug 5, 2022

OneFormer: One Transformer to Rule Universal Image Segmentation

Paper • 2211.06220 • Published Nov 10, 2022
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs