Collections

Discover the best community collections!

Collections including paper arxiv:2503.16365
Cognition
Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.
VisionLM
Collection by about 8 hours ago
VLA Models
Vision Language Models for Robotics
JARVIS-VLA-v1
Vision-Language-Action Models in Minecraft.
GUI Agents
Collection by May 16
Multimodal LLM
Collection by about 4 hours ago
Papers
Collection by Apr 10
Inbox
Collection by Oct 17
readings
Collection by 5 days ago
Cognition
Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.
VisionLM
Collection by about 8 hours ago
Multimodal LLM
Collection by about 4 hours ago
VLA Models
Vision Language Models for Robotics
Papers
Collection by Apr 10
JARVIS-VLA-v1
Vision-Language-Action Models in Minecraft.
Inbox
Collection by Oct 17
GUI Agents
Collection by May 16
readings
Collection by 5 days ago