It is a unified, developer-friendly API to the best available Speech-To-Text and Text-To-Speech services.
It can be used to complement any regular touch user interface with a real time voice user interface.
It offers real time feedback for faster and more intuitive experience that enables end user to recover from possible errors quickly and with no interruptions.
wav2letter++ is a fast open source speech processing toolkit from the Speech Team at Facebook AI Research. It is written entirely in C++ and uses the ArrayFire tensor library and the flashlight machine learning library for maximum efficiency. Our approach is detailed in this arXiv paper.
It is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
It is an On-Premises, Streaming Speech Recognition System built with PyTorch and fastai.