StackShareStackShare
Follow on
StackShare

Discover and share technology stacks from companies around the world.

Follow on

© 2025 StackShare. All rights reserved.

Product

  • Stacks
  • Tools
  • Feed

Company

  • About
  • Contact

Legal

  • Privacy Policy
  • Terms of Service
  1. Stackups
  2. AI
  3. Voice & Audio Models
  4. Speech Recognition Tools
  5. Deepspeech vs SpeechPy

Deepspeech vs SpeechPy

OverviewComparisonAlternatives

Overview

SpeechPy
SpeechPy
Stacks1
Followers11
Votes0
GitHub Stars884
Forks105
Deepspeech
Deepspeech
Stacks9
Followers5
Votes0
GitHub Stars26.6K
Forks4.1K

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs
CLI (Node.js)
or
Manual

Detailed Comparison

SpeechPy
SpeechPy
Deepspeech
Deepspeech

The purpose of this project is to provide a package for speech processing and feature extraction. This library provides most frequent used speech features including MFCCs and filterbank energies alongside with the log-energy of filterbanks.

It is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Mel Frequency Cepstral Coefficients(MFCCs);Filterbank Energies;Log Filterbank Energies
Open source; Tensorflow based; Mozilla project
Statistics
GitHub Stars
884
GitHub Stars
26.6K
GitHub Forks
105
GitHub Forks
4.1K
Stacks
1
Stacks
9
Followers
11
Followers
5
Votes
0
Votes
0
Integrations
Python
Python
Linux
Linux
Windows
Windows
macOS
macOS
Android OS
Android OS

What are some alternatives to SpeechPy, Deepspeech?

Speechly

Speechly

It can be used to complement any regular touch user interface with a real time voice user interface. It offers real time feedback for faster and more intuitive experience that enables end user to recover from possible errors quickly and with no interruptions.

Kaldi

Kaldi

It is a state-of-the-art automatic speech recognition toolkit. It is intended for use by speech recognition researchers and professionals.

Botium Speech Processing

Botium Speech Processing

It is a unified, developer-friendly API to the best available Speech-To-Text and Text-To-Speech services.

wav2letter++

wav2letter++

wav2letter++ is a fast open source speech processing toolkit from the Speech Team at Facebook AI Research. It is written entirely in C++ and uses the ArrayFire tensor library and the flashlight machine learning library for maximum efficiency. Our approach is detailed in this arXiv paper.

LibreASR

LibreASR

It is an On-Premises, Streaming Speech Recognition System built with PyTorch and fastai.

WhisperFusion

WhisperFusion

It builds upon the capabilities of the WhisperLive and WhisperSpeech by integrating Mistral, a Large Language Model (LLM), on top of the real-time speech-to-text pipeline. Both LLM and Whisper are optimized to run efficiently as TensorRT engines, maximizing performance and real-time processing capabilities.

Writeout.ai

Writeout.ai

Transcribe and translate audio files using OpenAI's Whisper API. You can upload any audio file, and the application will send it through the OpenAI Whisper API using Laravel's queued jobs. Translation makes use of the new OpenAI Chat API and chunks the generated VTT file into smaller parts to fit them into the prompt context limit.

Related Comparisons

Postman
Swagger UI

Postman vs Swagger UI

Mapbox
Google Maps

Google Maps vs Mapbox

Mapbox
Leaflet

Leaflet vs Mapbox vs OpenLayers

Twilio SendGrid
Mailgun

Mailgun vs Mandrill vs SendGrid

Runscope
Postman

Paw vs Postman vs Runscope