French AI firm Mistral AI has unveiled Mistral OCR, a strong new API for Optical Character Recognition that reinforces doc evaluation. The software processes pictures and PDFs, precisely pulling out structured textual content, media, tables, and equations.
“Roughly 90% of the world’s organisational knowledge is saved as paperwork, and to harness this potential, we’re introducing Mistral OCR,” mentioned the Mistral AI. The API integrates with Retrieval-Augmented Era (RAG) programs, making it appropriate for processing multimodal paperwork comparable to slides and sophisticated PDFs.
Mistral OCR is now the default mannequin for doc understanding on Le Chat and is obtainable by way of the API ‘mistral-ocr-latest’ at 1000 pages per greenback, with batch inference doubling effectivity.
The API is accessible on Mistral’s developer suite, La Plateforme, and can quickly be obtainable by cloud, inference companions, and on-premises deployment.
Mistral OCR helps multilingual and multimodal content material, outperforming main OCR fashions in benchmarks. It has been examined in opposition to Google Doc AI, Azure OCR, Gemini fashions, and GPT-4o, scoring 94.89 general, with excessive efficiency in mathematical expressions, scanned paperwork, and tables.
Mistral OCR can deal with a various vary of scripts, fonts, and languages. “This versatility is essential for each international organisations that deal with paperwork from various linguistic backgrounds, in addition to hyperlocal companies serving area of interest markets,” the corporate mentioned.
The API processes as much as 2000 pages per minute on a single node. It additionally helps “doc-as-prompt” performance, permitting structured output extraction in codecs like JSON. This characteristic permits integration with downstream workflows.
Beta clients are utilizing Mistral OCR for scientific analysis, historic preservation, customer support, and technical literature indexing. Analysis establishments have leveraged it to transform tutorial papers into AI-ready codecs, whereas heritage organizations are digitizing historic information. Customer support groups are reworking manuals into searchable data bases.
For enterprises dealing with delicate knowledge, Mistral AI provides a self-hosted deployment possibility. “Organisations with strict knowledge privateness necessities can preserve full management over their infrastructure,” Mistral AI mentioned.
Mistral AI plans to enhance the mannequin additional and increase on-premises deployment within the coming weeks.
The publish Mistral AI Launches OCR API, Beats Azure OCR, Google Gemini, and OpenAI GPT-4o appeared first on Analytics India Journal.