SpeechPy logo

SpeechPy

💬A Library for Speech Processing and Recognition
1
11
+ 1
0

What is SpeechPy?

The purpose of this project is to provide a package for speech processing and feature extraction. This library provides most frequent used speech features including MFCCs and filterbank energies alongside with the log-energy of filterbanks.
SpeechPy is a tool in the Speech Recognition Tools category of a tech stack.
SpeechPy is an open source tool with 880 GitHub stars and 105 GitHub forks. Here’s a link to SpeechPy's open source repository on GitHub

Who uses SpeechPy?

Developers

SpeechPy Integrations

SpeechPy's Features

  • Mel Frequency Cepstral Coefficients(MFCCs)
  • Filterbank Energies
  • Log Filterbank Energies

SpeechPy Alternatives & Comparisons

What are some alternatives to SpeechPy?
Kaldi
It is a state-of-the-art automatic speech recognition toolkit. It is intended for use by speech recognition researchers and professionals.
Deepspeech
It is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Botium Speech Processing
It is a unified, developer-friendly API to the best available Speech-To-Text and Text-To-Speech services.
Speechly
It can be used to complement any regular touch user interface with a real time voice user interface. It offers real time feedback for faster and more intuitive experience that enables end user to recover from possible errors quickly and with no interruptions.
wav2letter++
wav2letter++ is a fast open source speech processing toolkit from the Speech Team at Facebook AI Research. It is written entirely in C++ and uses the ArrayFire tensor library and the flashlight machine learning library for maximum efficiency. Our approach is detailed in this arXiv paper.
See all alternatives

SpeechPy's Followers
11 developers follow SpeechPy to keep up with related blogs and decisions.