Alon Yair
alony
Chief Architect
|
Onvego
3 points
Tools alony is Following
Google Cloud Spe...
cloud.google.com/speech
Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network mod...
Google Cloud Tex...
cloud.google.com/text-to-sp...
Google Cloud Text-to-Speech enables developers to synthesize natural-sounding speech with 30 voices, availa...
wav2letter++
github.com/facebookresearch...
wav2letter++ is a fast open source speech processing toolkit from the Speech Team at Facebook AI Research. ...
Botium Speech Pr...
github.com/codeforequity-at...
It is a unified, developer-friendly API to the best available Speech-To-Text and Text-To-Speech services.
Kaldi
github.com/kaldi-asr/kaldi
It is a state-of-the-art automatic speech recognition toolkit. It is intended for use by speech recognition...
Speechly
speechly.com
It can be used to complement any regular touch user interface with a real time voice user interface. It o...
Deepspeech
github.com/mozilla/DeepSpeech
It is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devi...
Whisper
github.com/openai/whisper
It is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is ...
Picovoice Leopar...
picovoice.ai/platform/cat
It is an on-device speech-to-text engine. By processing voice data locally on the device, it offers private...