Tencent Cloud is set to launch the official version of its DeepSeek-V4 model on its TokenHub platform in mid-July. The new model will feature a 'factory-direct supply' approach and introduce a peak/off-peak billing mechanism. During off-peak hours, DeepSeek-V4-Pro will charge 0.025 yuan per cache hit, 3 yuan for inference input, and 6 yuan for inference output per million tokens. Peak hour rates will double to 0.05 yuan, 6 yuan, and 12 yuan respectively. DeepSeek-V4-Flash will have off-peak charges of 0.02 yuan per cache hit, 1 yuan for inference input, and 2 yuan for inference output, with peak rates set at 0.04 yuan, 2 yuan, and 4 yuan. Peak hours are defined as 9:00–12:00 and 14:00–18:00 (UTC+8). Adjustments to the Token Plan Enterprise Edition's credit deduction rules will also be implemented for different periods.
Tencent Cloud to Launch DeepSeek-V4 Model with New Pricing in July
Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.
