Xin Zhang's picture

Xin Zhang

izhx

·

https://izhx.github.io/

AI & ML interests

NLP, IR, and Multimodal.

Recent Activity

published a model 24 days ago

vec-ai/lychee-rerank-mm

upvoted a collection 26 days ago

new activity 27 days ago

izhx/COMP5423-25Fall-HQ-small:Upload data v1

View all activity

Organizations

upvoted a collection 26 days ago

NanoBEIR 🍺

A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 22

upvoted an article 30 days ago

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

By

•

Jul 9, 2024

• 72

upvoted an article about 1 month ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

Oct 1

• 123

upvoted a paper about 2 months ago

TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them

Paper • 2509.21117 • Published Sep 25 • 29

upvoted a paper 3 months ago

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Paper • 2508.06600 • Published Aug 8 • 40

upvoted a collection 3 months ago

BrowseComp-Plus

7 items • Updated 24 days ago • 7

upvoted an article 3 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 954

upvoted a paper 5 months ago

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5 • 74

upvoted 2 collections 5 months ago

Qwen3-Reranker

3 items • Updated Jul 21 • 64

Qwen3-Embedding

6 items • Updated Jul 21 • 134

upvoted a collection 7 months ago

MTEB Papers

This is a collection of MTEB papers (not exhaustive). • 7 items • Updated Apr 16 • 2

upvoted a paper 7 months ago

MIEB: Massive Image Embedding Benchmark

Paper • 2504.10471 • Published Apr 14 • 20

upvoted a paper 9 months ago

GME: Improving Universal Multimodal Retrieval by Multimodal LLMs

Paper • 2412.16855 • Published Dec 22, 2024 • 5

upvoted a collection 11 months ago

GME Models

General Multimodal Embedding Models Released by Tongyi Lab of Alibaba Group • 3 items • Updated Dec 24, 2024 • 8

upvoted 2 articles about 1 year ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

and 1 other •

Oct 14, 2024

• 97

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

Aug 21, 2024

• 42

upvoted a paper over 1 year ago

mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval

Paper • 2407.19669 • Published Jul 29, 2024 • 25

upvoted 3 collections over 1 year ago

GTE models

General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated Jan 21 • 31

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Jul 21 • 372

Nomic Embed Vision

Vision Encoders aligned to Nomic Embed Text making Nomic Embed multimodal! • 2 items • Updated Jun 5, 2024 • 9