NVIDIA has unveiled cost details for its Blackwell GPU, highlighting a significant shift in cost efficiency for AI inference tasks. While the Blackwell GPU's unit price is nearly double that of its predecessor, the Hopper, at $2.65 per hour compared to $1.41, it offers a dramatic reduction in cost per token. The Blackwell GPU achieves a 65x increase in token throughput, reducing the cost per million tokens from $4.20 to $0.12, assuming all software optimizations are enabled. The analysis, based on the DeepSeek-R1 model, shows that with optimizations like FP4 low-precision inference and Multi-Token Prediction (MTP), the cost per million tokens can drop to as low as $0.11. Without MTP, the cost is approximately $2.35 per million tokens. These figures underscore the potential for significant cost savings in AI inference tasks using the Blackwell GPU, although results may vary with different models and configurations.