Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2507.01006

GLM-4.1V-Thinking

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238
zai-org/GLM-4.1V-9B-Thinking

Image-Text-to-Text • 10B • Updated 26 days ago • 391k • • 755
zai-org/GLM-4.1V-9B-Base

Image-Text-to-Text • 10B • Updated 26 days ago • 6.96k • 61
Running

31

GLM-4.1V-9B-Thinking-API-Demo

🚀

31

THUDM/GLM-4.1V-9B-Thinking Demo

a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics

about 10 hours ago

End-to-End Goal-Driven Web Navigation

Paper • 1602.02261 • Published Feb 6, 2016
Learning Language Games through Interaction

Paper • 1606.02447 • Published Jun 8, 2016
Naturalizing a Programming Language via Interactive Learning

Paper • 1704.06956 • Published Apr 23, 2017
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Paper • 1802.08802 • Published Feb 24, 2018 • 1

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 274
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238
A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 258

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 309
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated 26 days ago • 41.8k • • 692
zai-org/GLM-4.5V-FP8

Image-Text-to-Text • 108B • Updated 26 days ago • 112k • • 38
Running

23

GLM 4.5V Demo App

🏃

23

Demo App of dmg file
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238

RL+reason model

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 28
Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published Jan 27 • 30
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization

Paper • 2412.12098 • Published Dec 16, 2024 • 4

💡AI Insight Talk Series 4: Multi Modal models

internlm/Intern-S1

Image-Text-to-Text • 241B • Updated 20 days ago • 53.2k • 249
Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256
MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 89
openbmb/MiniCPM-V-4_5

Image-Text-to-Text • 9B • Updated Oct 10 • 47.6k • 1.02k

excelletPaperForLLM

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 190
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published Jul 7 • 63
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28 • 56

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 314
Qwen/Qwen3-14B-GGUF

Text Generation • 15B • Updated May 9 • 10.6k • 53
Qwen/Qwen3-8B-GGUF

Text Generation • 8B • Updated May 21 • 63.1k • 72
Qwen/Qwen3-4B-GGUF

Text Generation • 4B • Updated May 21 • 9.02k • 36

GLM-4.1V-Thinking

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238
zai-org/GLM-4.1V-9B-Thinking

Image-Text-to-Text • 10B • Updated 26 days ago • 391k • • 755
zai-org/GLM-4.1V-9B-Base

Image-Text-to-Text • 10B • Updated 26 days ago • 6.96k • 61
Running

31

GLM-4.1V-9B-Thinking-API-Demo

🚀

31

THUDM/GLM-4.1V-9B-Thinking Demo

RL+reason model

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 28
Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published Jan 27 • 30
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization

Paper • 2412.12098 • Published Dec 16, 2024 • 4

a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics

about 10 hours ago

End-to-End Goal-Driven Web Navigation

Paper • 1602.02261 • Published Feb 6, 2016
Learning Language Games through Interaction

Paper • 1606.02447 • Published Jun 8, 2016
Naturalizing a Programming Language via Interactive Learning

Paper • 1704.06956 • Published Apr 23, 2017
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Paper • 1802.08802 • Published Feb 24, 2018 • 1

💡AI Insight Talk Series 4: Multi Modal models

internlm/Intern-S1

Image-Text-to-Text • 241B • Updated 20 days ago • 53.2k • 249
Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256
MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 89
openbmb/MiniCPM-V-4_5

Image-Text-to-Text • 9B • Updated Oct 10 • 47.6k • 1.02k

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 274
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238
A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 258

excelletPaperForLLM

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 309
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 190
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published Jul 7 • 63
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28 • 56

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated 26 days ago • 41.8k • • 692
zai-org/GLM-4.5V-FP8

Image-Text-to-Text • 108B • Updated 26 days ago • 112k • • 38
Running

23

GLM 4.5V Demo App

🏃

23

Demo App of dmg file
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 314
Qwen/Qwen3-14B-GGUF

Text Generation • 15B • Updated May 9 • 10.6k • 53
Qwen/Qwen3-8B-GGUF

Text Generation • 8B • Updated May 21 • 63.1k • 72
Qwen/Qwen3-4B-GGUF

Text Generation • 4B • Updated May 21 • 9.02k • 36

Previous
1
2
3
4
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs