Baidu has launched version 3.5 of its open-source OCR tool, PaddleOCR, introducing significant new capabilities. The update includes the release of PaddleOCR.js, a browser inference SDK that allows PP-OCRv5 to run directly in browsers with WebGPU and Wasm acceleration, ensuring data privacy by keeping it within the browser. Additionally, PaddleOCR now supports one-click conversion of Word, Excel, and PPT documents into Markdown format.
The update also integrates a Transformers backend, providing access to 20 primary models via Hugging Face and enabling seamless switching between PaddlePaddle's static graph, dynamic graph, and Transformers modes. Furthermore, results from the PaddleOCR-VL series, PP-StructureV3, and PP-DocTranslation can now be exported in DOCX format, enhancing document processing capabilities.
Baidu's PaddleOCR 3.5 Debuts with Enhanced Browser and Document Features
Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.
