StepAudio 2.5 ASR Debuts with MTP Tech for Enhanced Transcription

Jiepao Xingchen has launched its advanced automatic speech recognition model, StepAudio 2.5 ASR, featuring Multi-Token Prediction (MTP) technology. This innovation accelerates inference speed and utilizes a 32K context window, allowing seamless transcription of 30-minute audio without slicing. The model's ASR+MTP-5 architecture boosts inference throughput by 400%, reduces latency by 60%, and cuts costs by 80%, achieving a peak rate of 500 tokens per second. Tests show improved accuracy and lower word error rates compared to competitors.

Source: Show Original

Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.

You may also like