Need advice about which tool to choose?Ask the StackShare community!

Kaldi

23
25
+ 1
0
SpeechPy

1
11
+ 1
0
Add tool

Kaldi vs SpeechPy: What are the differences?

What is Kaldi? Toolkit for speech recognition. It is a state-of-the-art automatic speech recognition toolkit. It is intended for use by speech recognition researchers and professionals.

What is SpeechPy? 💬A Library for Speech Processing and Recognition. The purpose of this project is to provide a package for speech processing and feature extraction. This library provides most frequent used speech features including MFCCs and filterbank energies alongside with the log-energy of filterbanks.

Kaldi and SpeechPy can be primarily classified as "Speech Recognition" tools.

Kaldi and SpeechPy are both open source tools. Kaldi with 9.38K GitHub stars and 4.17K forks on GitHub appears to be more popular than SpeechPy with 819 GitHub stars and 109 GitHub forks.

Get Advice from developers at your company using StackShare Enterprise. Sign up for StackShare Enterprise.
Learn More

What is Kaldi?

It is a state-of-the-art automatic speech recognition toolkit. It is intended for use by speech recognition researchers and professionals.

What is SpeechPy?

The purpose of this project is to provide a package for speech processing and feature extraction. This library provides most frequent used speech features including MFCCs and filterbank energies alongside with the log-energy of filterbanks.

Need advice about which tool to choose?Ask the StackShare community!

What companies use Kaldi?
What companies use SpeechPy?
    No companies found
    See which teams inside your own company are using Kaldi or SpeechPy.
    Sign up for StackShare EnterpriseLearn More

    Sign up to get full access to all the companiesMake informed product decisions

    What tools integrate with Kaldi?
    What tools integrate with SpeechPy?
      No integrations found
      What are some alternatives to Kaldi and SpeechPy?
      Deepspeech
      It is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
      Botium Speech Processing
      It is a unified, developer-friendly API to the best available Speech-To-Text and Text-To-Speech services.
      Speechly
      It can be used to complement any regular touch user interface with a real time voice user interface. It offers real time feedback for faster and more intuitive experience that enables end user to recover from possible errors quickly and with no interruptions.
      wav2letter++
      wav2letter++ is a fast open source speech processing toolkit from the Speech Team at Facebook AI Research. It is written entirely in C++ and uses the ArrayFire tensor library and the flashlight machine learning library for maximum efficiency. Our approach is detailed in this arXiv paper.
      LibreASR
      It is an On-Premises, Streaming Speech Recognition System built with PyTorch and fastai.
      See all alternatives