Need advice about which tool to choose?Ask the StackShare community!
Amazon Elastic Transcoder vs Tesseract OCR: What are the differences?
What is Amazon Elastic Transcoder? Media transcoding in the cloud using Amazon EC2. Convert or transcode media files from their source format into versions that will playback on devices like smartphones, tablets and PCs. Create a transcoding “job” specifying the location of your source media file and how you want it transcoded. Amazon Elastic Transcoder also provides transcoding presets for popular output formats. All these features are available via service API, AWS SDKs and the AWS Management Console.
What is Tesseract OCR? Tesseract Open Source OCR Engine. Tesseract was originally developed at Hewlett-Packard Laboratories Bristol and at Hewlett-Packard Co, Greeley Colorado between 1985 and 1994, with some more changes made in 1996 to port to Windows, and some C++izing in 1998. In 2005 Tesseract was open sourced by HP. Since 2006 it is developed by Google.
Amazon Elastic Transcoder belongs to "Media Transcoding" category of the tech stack, while Tesseract OCR can be primarily classified under "Image Analysis API".
Tesseract OCR is an open source tool with 27.8K GitHub stars and 5.31K GitHub forks. Here's a link to Tesseract OCR's open source repository on GitHub.
Crowdsourced Testing, N49, and MyBhutan are some of the popular companies that use Amazon Elastic Transcoder, whereas Tesseract OCR is used by Shelf, ESCHR, and DLabs. Amazon Elastic Transcoder has a broader approval, being mentioned in 28 company stacks & 5 developers stacks; compared to Tesseract OCR, which is listed in 6 company stacks and 6 developer stacks.
AWS Rekognition has an OCR feature but can recognize only up to 50 words per image, which is a deal-breaker for us. (see my tweet).
Also, we discovered fantastic speed and quality improvements in the 4.x versions of Tesseract. Meanwhile, the quality of AWS Rekognition's OCR remains to be mediocre in comparison.
We run Tesseract serverlessly in AWS Lambda via aws-lambda-tesseract library that we made open-source.
Pros of Amazon Elastic Transcoder
Pros of Tesseract OCR
- Building training set is easy5
- Very lightweight library2
Sign up to add or upvote prosMake informed product decisions
Cons of Amazon Elastic Transcoder
Cons of Tesseract OCR
- Works best with white background and black text1