Alon Yair
alony
Chief Architect | Onvego
3 points

Tools alony is Following

Google Cloud Spe...
Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network mod...
Google Cloud Tex...
Google Cloud Text-to-Speech enables developers to synthesize natural-sounding speech with 30 voices, availa...
wav2letter++
wav2letter++ is a fast open source speech processing toolkit from the Speech Team at Facebook AI Research. ...
Botium Speech Pr...
It is a unified, developer-friendly API to the best available Speech-To-Text and Text-To-Speech services.
Kaldi
It is a state-of-the-art automatic speech recognition toolkit. It is intended for use by speech recognition...
Speechly
It can be used to complement any regular touch user interface with a real time voice user interface. It o...
Deepspeech
It is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devi...
Whisper
It is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is ...
Picovoice Leopar...
It is an on-device speech-to-text engine. By processing voice data locally on the device, it offers private...