Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper • 2506.16035 • Published Jun 19 • 88
ZClip: Adaptive Spike Mitigation for LLM Pre-Training Paper • 2504.02507 • Published Apr 3 • 88
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Jul 21 • 226