Skip to main content

Model Pricing

Model Calls

Official Models

Official models are services published by the platform or its partners, providing a more stable and efficient experience. The pricing of model calls varies depending on the task type. Text dialogue models are billed based on token input/output, while image and video generation models are billed per generated image or video.

Text Dialogue

Model NameBilling UnitInput Price (RMB)Output Price (RMB)
GpuGeek/DeepSeek-R1-671B/Million Tokens832
GpuGeek/DeepSeek-R1-Distill-Llama-70B/Million Tokens28
GpuGeek/DeepSeek-R1-Distill-Qwen-32B/Million Tokens1.56
GpuGeek/DeepSeek-R1-Distill-Qwen-14B/Million Tokens0.62.4
GpuGeek/DeepSeek-R1-Distill-Llama-8B/Million Tokens0.40.8
GpuGeek/DeepSeek-R1-Distill-Qwen-7B/Million Tokens0.40.8
GpuGeek/DeepSeek-R1-Distill-Qwen-1.5B/Million Tokens0.20.4
GpuGeek/qwen2.5-0.5B/Million Tokens0.10.2

Text-to-Image

Model NameBilling UnitOutput Price (RMB)
GpuGeek/Stable-Diffusion-3.5-Large-Turbo/Image0.15
GpuGeek/Cogview4-6B/Image0.15

Text-to-Video

Model NameDuration & ResolutionBilling UnitOutput Price (RMB)
Vidu/text2video-1.54S360P/Video1.25
Vidu/text2video-1.5-HD4S720P/Video3.13
Vidu/text2video-1.5-FHD4S1080P, 8S720P/Video6.25

Image-to-Video

Model NameDuration & ResolutionBilling UnitOutput Price (RMB)
Vidu/image2video-2.04S720p/Video1.25
Vidu/image2video-2.0-HD4S1080p, 8S720p/Video3.125

Personal Public Models

Personal published public models are billed based on runtime duration per call. The cost per unit time depends on the hardware chosen by the publisher. Currently supported hardware type:

Hardware TypeBilling UnitPrice (RMB)
RTX-4090-24G/Card2.18/hour

Private Models

Private models are billed based on the online duration of the hardware configured for the model. Online billing includes startup time, request processing time, and idle time after startup. When the model receives no requests, it will switch to a "cold start" state in about 10 minutes, no longer occupying hardware or incurring costs.

Hardware TypeBilling UnitPrice (RMB)
RTX-4090-24G/Card2.18/hour

Model Deployment

Hardware TypeBilling UnitPrice (RMB)
RTX-4090-24G/Card2.18/hour