Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
Tesseract was originally developed at Hewlett-Packard Laboratories Bristol and at Hewlett-Packard Co, Greeley Colorado between 1985 and 1994, with some more changes made in 1996 to port to Windows, and some C++izing in 1998. In 2005 Tesseract was open sourced by HP. Since 2006 it is developed by Google. | It is an open-source toolbox based on PyTorch and mmdetection for text detection, text recognition, and the corresponding downstream tasks including key information extraction. |
| - | Comprehensive pipeline; Multiple models; Modular design; Numerous utilities |
Statistics | |
GitHub Stars 70.7K | GitHub Stars 4.7K |
GitHub Forks 10.4K | GitHub Forks 776 |
Stacks 97 | Stacks 0 |
Followers 287 | Followers 5 |
Votes 8 | Votes 0 |
Pros & Cons | |
Pros
Cons
| No community feedback yet |
Integrations | |
| No integrations available | |

Google Cloud Vision API enables developers to understand the content of an image by encapsulating powerful machine learning models in an easy to use REST API.

Amazon Rekognition is a service that makes it easy to add image analysis to your applications. With Rekognition, you can detect objects, scenes, and faces in images. You can also search and compare faces. Rekognition’s API enables you to quickly add sophisticated deep learning-based visual search and image classification to your applications.

This library supports over 60 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS.

Editaimg helps you edit images with AI: remove backgrounds, edit text on images, upscale resolution, retouch faces, and export in popular formats.

AI-powered OCR and document extraction API converts documents to structured JSON in seconds. 98%+ accuracy for invoices, Aadhaar, PAN, salary slips & 20+ document types. Pay per page.

Automate invoice processing with an invoice ocr api to save time, reduce errors, and streamline financial workflows in ERP systems.
Advanced AI watermark remover that cleanly removes logos, text, and stamps from photos in seconds.

Lets you upload any image, describe the edit you want, and download professional results in seconds. Intelligent image understanding, diverse style transformations, advanced color processing, and lightning-fast batch exports power every ai photo editor gemini workflow. The photo editor gemini ai pipeline handles portraits, products, and marketing visuals securely.
Extract text from images and PDFs with 99.9% accuracy. Supports handwriting recognition and 100+ languages. No registration required. Best free AI-powered OCR solution.

A powerful image-to-table extraction utility. It allows developers to parse JPG/PNG images containing tabular data and convert them into machine-readable formats (Excel, CSV, JSON) for data processing pipelines.