Tencent Cloud is set to launch the official version of its DeepSeek-V4 model on its TokenHub platform in mid-July. The new model will feature a 'factory-direct supply' approach and introduce a peak/off-peak billing mechanism. During off-peak hours, DeepSeek-V4-Pro will charge 0.025 yuan per cache hit, 3 yuan for inference input, and 6 yuan for inference output per million tokens. Peak hour rates will double to 0.05 yuan, 6 yuan, and 12 yuan respectively. DeepSeek-V4-Flash will have off-peak charges of 0.02 yuan per cache hit, 1 yuan for inference input, and 2 yuan for inference output, with peak rates set at 0.04 yuan, 2 yuan, and 4 yuan. Peak hours are defined as 9:00–12:00 and 14:00–18:00 (UTC+8). Adjustments to the Token Plan Enterprise Edition's credit deduction rules will also be implemented for different periods.