Datalab has released Surya OCR 2, a new multilingual OCR model achieving 83.3% accuracy on the olmOCR-bench, setting a new standard for models under 3 billion parameters. Despite having only 650 million parameters, Surya OCR 2 outperforms its predecessor, which had 9 billion parameters, by achieving a Pareto optimal balance between parameter count and accuracy. The model integrates layout analysis, text recognition, and table detection into a single vision-language model, while maintaining separate lightweight models for text line detection and OCR error detection.
Surya OCR 2 supports 91 languages with an overall pass rate of 87.2% and features optimizations for damaged documents and handwritten text. It offers high deployment efficiency, achieving 5.35 pages per second on NVIDIA GPUs and supporting local inference on Apple M1 devices. The model is open-sourced under the Apache 2.0 license, with weights available under the OpenRAIL-M license. Datalab also introduced a paid API for the enhanced 4-billion-parameter Chandra 2 model.
Surya OCR 2 Sets New Benchmark with 83.3% Accuracy and 650 Million Parameters
Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.
