Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Inference Providers
·
Metrics for top trending models
Browse all models
Learn more
Model
Provider
Input $/1M
Output $/1M
Context
Latency(s)
Throughput(t/s)
Tools
Structured
Qwen/Qwen2.5-VL-72B-Instruct
Qwen2.5-VL-72B-Instruct
nebius
0.25
0.75
32000
0.43
33
No
Yes
Qwen/Qwen2.5-VL-72B-Instruct
Qwen2.5-VL-72B-Instruct
hyperbolic
cheapest
fastest
0.6
0.6
32768
0.63
38
No
No
Qwen/Qwen2.5-VL-72B-Instruct
Qwen2.5-VL-72B-Instruct
ovhcloud
-
-
-
0.53
32
No
Yes