Collections
Discover the best community collections!
Collections including paper arxiv:2407.21783
-
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90 -
Visual Instruction Tuning
Paper • 2304.08485 • Published • 20 -
Improved Baselines with Visual Instruction Tuning
Paper • 2310.03744 • Published • 39 -
PALO: A Polyglot Large Multimodal Model for 5B People
Paper • 2402.14818 • Published • 24
-
The Llama 3 Herd of Models
Paper • 2407.21783 • Published • 117 -
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper • 2409.12191 • Published • 78 -
Baichuan Alignment Technical Report
Paper • 2410.14940 • Published • 51 -
A Survey of Small Language Models
Paper • 2410.20011 • Published • 46
-
Qwen2.5 Technical Report
Paper • 2412.15115 • Published • 377 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 152 -
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement
Paper • 2409.12122 • Published • 4 -
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 211
-
Apple Intelligence Foundation Language Models
Paper • 2407.21075 • Published • 5 -
The Llama 3 Herd of Models
Paper • 2407.21783 • Published • 117 -
Nemotron-4 340B Technical Report
Paper • 2406.11704 • Published -
Gemma 2: Improving Open Language Models at a Practical Size
Paper • 2408.00118 • Published • 79
-
RLHF Workflow: From Reward Modeling to Online RLHF
Paper • 2405.07863 • Published • 71 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper • 2405.09818 • Published • 132 -
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models
Paper • 2405.15574 • Published • 55 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90
-
The Llama 3 Herd of Models
Paper • 2407.21783 • Published • 117 -
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper • 2409.12191 • Published • 78 -
Baichuan Alignment Technical Report
Paper • 2410.14940 • Published • 51 -
A Survey of Small Language Models
Paper • 2410.20011 • Published • 46
-
Qwen2.5 Technical Report
Paper • 2412.15115 • Published • 377 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 152 -
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement
Paper • 2409.12122 • Published • 4 -
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 211
-
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90 -
Visual Instruction Tuning
Paper • 2304.08485 • Published • 20 -
Improved Baselines with Visual Instruction Tuning
Paper • 2310.03744 • Published • 39 -
PALO: A Polyglot Large Multimodal Model for 5B People
Paper • 2402.14818 • Published • 24
-
Apple Intelligence Foundation Language Models
Paper • 2407.21075 • Published • 5 -
The Llama 3 Herd of Models
Paper • 2407.21783 • Published • 117 -
Nemotron-4 340B Technical Report
Paper • 2406.11704 • Published -
Gemma 2: Improving Open Language Models at a Practical Size
Paper • 2408.00118 • Published • 79
-
RLHF Workflow: From Reward Modeling to Online RLHF
Paper • 2405.07863 • Published • 71 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper • 2405.09818 • Published • 132 -
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models
Paper • 2405.15574 • Published • 55 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90