MMOCR logo


Open-source toolbox for text detection and text recognition
+ 1

What is MMOCR?

It is an open-source toolbox based on PyTorch and mmdetection for text detection, text recognition, and the corresponding downstream tasks including key information extraction.
MMOCR is a tool in the Image Analysis API category of a tech stack.
MMOCR is an open source tool with GitHub stars and GitHub forks. Here’s a link to MMOCR's open source repository on GitHub

MMOCR Integrations

MMOCR's Features

  • Comprehensive pipeline
  • Multiple models
  • Modular design
  • Numerous utilities

MMOCR Alternatives & Comparisons

What are some alternatives to MMOCR?
Google Cloud Vision API
Google Cloud Vision API enables developers to understand the content of an image by encapsulating powerful machine learning models in an easy to use REST API.
Tesseract OCR
Tesseract was originally developed at Hewlett-Packard Laboratories Bristol and at Hewlett-Packard Co, Greeley Colorado between 1985 and 1994, with some more changes made in 1996 to port to Windows, and some C++izing in 1998. In 2005 Tesseract was open sourced by HP. Since 2006 it is developed by Google.
Amazon Rekognition
Amazon Rekognition is a service that makes it easy to add image analysis to your applications. With Rekognition, you can detect objects, scenes, and faces in images. You can also search and compare faces. Rekognition’s API enables you to quickly add sophisticated deep learning-based visual search and image classification to your applications.
This library supports over 60 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS.
It is ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai.
See all alternatives
Related Comparisons
No related comparisons found

MMOCR's Followers
4 developers follow MMOCR to keep up with related blogs and decisions.