NVIDIA has unveiled cost details for its Blackwell GPU, highlighting a significant shift in cost efficiency for AI inference tasks. While the Blackwell GPU's unit price is nearly double that of its predecessor, the Hopper, at $2.65 per hour compared to $1.41, it offers a dramatic reduction in cost per token. The Blackwell GPU achieves a 65x increase in token throughput, reducing the cost per million tokens from $4.20 to $0.12, assuming all software optimizations are enabled.
The analysis, based on the DeepSeek-R1 model, shows that with optimizations like FP4 low-precision inference and Multi-Token Prediction (MTP), the cost per million tokens can drop to as low as $0.11. Without MTP, the cost is approximately $2.35 per million tokens. These figures underscore the potential for significant cost savings in AI inference tasks using the Blackwell GPU, although results may vary with different models and configurations.
NVIDIA's Blackwell GPU Doubles Price, Slashes Token Cost by 35x
Avertissement : Le contenu proposé sur Phemex News est à titre informatif uniquement. Nous ne garantissons pas la qualité, l'exactitude ou l'exhaustivité des informations provenant d'articles tiers. Ce contenu ne constitue pas un conseil financier ou d'investissement. Nous vous recommandons vivement d'effectuer vos propres recherches et de consulter un conseiller financier qualifié avant toute décision d'investissement.
