Tesseract OCR

A Story by
Tesseract Open Source OCR Engine

What is Tesseract OCR?

Tesseract was originally developed at Hewlett-Packard Laboratories Bristol and at Hewlett-Packard Co, Greeley Colorado between 1985 and 1994, with some more changes made in 1996 to port to Windows, and some C++izing in 1998. In 2005 Tesseract was open sourced by HP. Since 2006 it is developed by Google.
Tesseract OCR is a tool in the Image Analysis API category of a tech stack.

Who is using it?

11 companies use Tesseract OCR in their tech stacks, including Shelf, all, and The Paperless Project.

Shelf

all

The Paperless Project

X-Ray

Rubyroid-Labs-Tech-Stack

Data Engineering

Services

DLabs.AI

ESCHR

OptoSweden AB

Irene

Why developers like Tesseract OCR

Very lightweight library
Building training set is easy