Tesseract OCR

A Story by
Tesseract Open Source OCR Engine

What is Tesseract OCR?

Tesseract was originally developed at Hewlett-Packard Laboratories Bristol and at Hewlett-Packard Co, Greeley Colorado between 1985 and 1994, with some more changes made in 1996 to port to Windows, and some C++izing in 1998. In 2005 Tesseract was open sourced by HP. Since 2006 it is developed by Google.
Tesseract OCR is a tool in the Image Analysis API category of a tech stack.

Who is using it?

13 companies use Tesseract OCR in their tech stacks, including Shelf, Foretag, and backend.

Shelf

Foretag

backend

The Paperless Project

X-Ray

Data Engineering

Rubyroid-Labs-Tech-Stack

Services

ESCHR

DLabs.AI

OptoSweden AB

Irene

Why developers like Tesseract OCR

Very lightweight library
Building training set is easy