Tongyi Lab has launched its latest speech recognition model, Fun-ASR 1.5, on April 20. The model, now available via Alibaba Cloud's Bailian and the ModelScope community, supports 30 languages, seven major Chinese dialect groups, and over 20 regional accents with a single model. This eliminates the need for separate models for each dialect. Internal tests show a 56.2% reduction in character error rate for dialect scenarios compared to the previous version, with five dialects achieving over 90% accuracy.
The model also features enhanced recognition for classical poetry, boasting a 97% character-level accuracy. This unified system addresses the long-tail challenge of Chinese dialect speech recognition, making it commercially viable for applications like educational live streaming, local government hotlines, and interview transcription, simplifying deployment by removing the need for multiple recognition pipelines.
Tongyi Lab Unveils Fun-ASR 1.5 with Advanced Dialect Recognition
Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.
