Running 1.17k 1.17k FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality text data for LLMs using FineWeb
jinaai/jina-embeddings-v4-vllm-retrieval Visual Document Retrieval • 4B • Updated Sep 17 • 15.4k • 32