-
NousResearch/Llama-3.2-1B
Text Generation β’ 1B β’ Updated β’ 89.9k β’ 19 -
meta-llama/Llama-3.2-1B
Text Generation β’ 1B β’ Updated β’ 1.79M β’ 2.15k -
xai-org/grok-1
Text Generation β’ Updated β’ 2.49k β’ 2.37k -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation β’ 5B β’ Updated β’ 23.6k β’ 104
Collections
Discover the best community collections!
Collections including paper arxiv:2204.05149
-
557
Vision Arena (Testing VLMs side-by-side)
πΌDisplay image analysis results
-
The Carbon Footprint of Machine Learning Training Will Plateau, Then Shrink
Paper β’ 2204.05149 β’ Published β’ 9 -
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Paper β’ 2409.17146 β’ Published β’ 121
-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper β’ 2501.18585 β’ Published β’ 61 -
RWKV-7 "Goose" with Expressive Dynamic State Evolution
Paper β’ 2503.14456 β’ Published β’ 153 -
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Paper β’ 2503.15265 β’ Published β’ 46 -
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper β’ 2503.15558 β’ Published β’ 50
-
NousResearch/Llama-3.2-1B
Text Generation β’ 1B β’ Updated β’ 89.9k β’ 19 -
meta-llama/Llama-3.2-1B
Text Generation β’ 1B β’ Updated β’ 1.79M β’ 2.15k -
xai-org/grok-1
Text Generation β’ Updated β’ 2.49k β’ 2.37k -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation β’ 5B β’ Updated β’ 23.6k β’ 104
-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper β’ 2501.18585 β’ Published β’ 61 -
RWKV-7 "Goose" with Expressive Dynamic State Evolution
Paper β’ 2503.14456 β’ Published β’ 153 -
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Paper β’ 2503.15265 β’ Published β’ 46 -
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper β’ 2503.15558 β’ Published β’ 50
-
557
Vision Arena (Testing VLMs side-by-side)
πΌDisplay image analysis results
-
The Carbon Footprint of Machine Learning Training Will Plateau, Then Shrink
Paper β’ 2204.05149 β’ Published β’ 9 -
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Paper β’ 2409.17146 β’ Published β’ 121