Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
qihoo360 's Collections
RefVTON
RzenEmbed
TinyR1
FG-CLIP 2
360Zhinao3
FG-CLIP
360Zhinao
360Zhinao2
Light-R1
Light-IF

FG-CLIP 2

updated 13 days ago

FG-CLIP 2 is the foundation model for fine-grained vision-language understanding in both English and Chinese.

Upvote
5

  • FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model

    Paper • 2510.10921 • Published Oct 13 • 10

  • qihoo360/fg-clip2-base

    Zero-Shot Image Classification • 0.4B • Updated 13 days ago • 4.07k • 21

  • qihoo360/fg-clip2-large

    Zero-Shot Image Classification • 0.9B • Updated 30 days ago • 612 • 9

  • qihoo360/fg-clip2-so400m

    Zero-Shot Image Classification • 1B • Updated 30 days ago • 400 • 5

  • qihoo360/LIT-CN

    Updated 30 days ago • 197 • 1

  • qihoo360/BoxClass-CN

    Updated Oct 15 • 98 • 1

  • qihoo360/DOCCI-CN

    Viewer • Updated 30 days ago • 5k • 248 • 1

  • qihoo360/DCI-CN

    Updated 30 days ago • 374

  • Running
    1

    FG CLIP2 Densefeature Demo

    🦀
    1

    Visualize image similarity to labels


  • Running
    1

    FG CLIP2 Retrieval Demo

    💻
    1

    Classify images based on given labels

Upvote
5
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs