StackShareStackShare
Follow on
StackShare

Discover and share technology stacks from companies around the world.

Follow on

© 2025 StackShare. All rights reserved.

Product

  • Stacks
  • Tools
  • Feed

Company

  • About
  • Contact

Legal

  • Privacy Policy
  • Terms of Service
  1. Stackups
  2. AI
  3. Voice & Audio Models
  4. Speech Recognition Tools
  5. Whisper vs SpeechPy

Whisper vs SpeechPy

OverviewComparisonAlternatives

Overview

SpeechPy
SpeechPy
Stacks1
Followers11
Votes0
GitHub Stars884
Forks105
Whisper
Whisper
Stacks24
Followers28
Votes1
GitHub Stars90.3K
Forks11.3K

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs
CLI (Node.js)
or
Manual

Detailed Comparison

SpeechPy
SpeechPy
Whisper
Whisper

The purpose of this project is to provide a package for speech processing and feature extraction. This library provides most frequent used speech features including MFCCs and filterbank energies alongside with the log-energy of filterbanks.

It is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

Mel Frequency Cepstral Coefficients(MFCCs);Filterbank Energies;Log Filterbank Energies
Automatic speech recognition; Trained on a large dataset of diverse audio; Multi-task model; Can perform multilingual speech recognition; Can perform speech translation and language identification
Statistics
GitHub Stars
884
GitHub Stars
90.3K
GitHub Forks
105
GitHub Forks
11.3K
Stacks
1
Stacks
24
Followers
11
Followers
28
Votes
0
Votes
1
Integrations
Python
Python
PyTorch
PyTorch
Python
Python

What are some alternatives to SpeechPy, Whisper?

Speechly

Speechly

It can be used to complement any regular touch user interface with a real time voice user interface. It offers real time feedback for faster and more intuitive experience that enables end user to recover from possible errors quickly and with no interruptions.

Grok-1

Grok-1

It is the base model weights and network architecture of Grok-1, the large language model. Grok-1 is a 314 billion parameter Mixture-of-Experts model trained from scratch by xAI.

Google Gemini

Google Gemini

It is Google’s largest and most capable AI model. It is built to be multimodal, it can generalize, understand, operate across, and combine different types of info — like text, images, audio, video, and code.

LLaMA

LLaMA

It is a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI.

Grok 4

Grok 4

Try Grok 4 on GPT Proto. Access xAI’s most advanced 1.7T LLM with 130K context, multimodal support, and real-time data integration for dynamic analysis.

FlowEngine

FlowEngine

Build n8n workflows with AI and deploy in 30 seconds. Free hosting, workflow analyzer, and 100+ LLM models included.

OpenAI

OpenAI

Creating safe artificial general intelligence that benefits all of humanity. Our work to create safe and beneficial AI requires a deep understanding of the potential risks and benefits, as well as careful consideration of the impact.

Claude

Claude

It is a next-generation AI assistant. It is accessible through chat interface and API. It is capable of a wide variety of conversational and text-processing tasks while maintaining a high degree of reliability and predictability.

GPT-4 by OpenAI

GPT-4 by OpenAI

It is a large multimodal model (accepting text inputs and emitting text outputs today, with image inputs coming in the future) that can solve difficult problems with greater accuracy than any of our previous models, thanks to its broader general knowledge and advanced reasoning capabilities.

Cohere.com

Cohere.com

It offers an API to add cutting-edge language processing to any system. Through training, users can create massive models customized to their use case and trained on their data.

Related Comparisons

Postman
Swagger UI

Postman vs Swagger UI

Mapbox
Google Maps

Google Maps vs Mapbox

Mapbox
Leaflet

Leaflet vs Mapbox vs OpenLayers

Twilio SendGrid
Mailgun

Mailgun vs Mandrill vs SendGrid

Runscope
Postman

Paw vs Postman vs Runscope