Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Inference Providers
·
Metrics for top trending models
Browse all models
Learn more
Model
Provider
Input $/1M
Output $/1M
Context
Latency(s)
Throughput(t/s)
Tools
Structured
Qwen/Qwen2.5-VL-72B-Instruct
Qwen2.5-VL-72B-Instruct
nebius
fastest
0.25
0.75
32000
0.39
36
No
Yes
Qwen/Qwen2.5-VL-72B-Instruct
Qwen2.5-VL-72B-Instruct
hyperbolic
cheapest
0.6
0.6
32768
0.67
33
No
No
Qwen/Qwen2.5-VL-72B-Instruct
Qwen2.5-VL-72B-Instruct
ovhcloud
-
-
-
0.60
32
No
Yes