Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
The purpose of this project is to provide a package for speech processing and feature extraction. This library provides most frequent used speech features including MFCCs and filterbank energies alongside with the log-energy of filterbanks. | It is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification. |
Mel Frequency Cepstral Coefficients(MFCCs);Filterbank Energies;Log Filterbank Energies | Automatic speech recognition;
Trained on a large dataset of diverse audio;
Multi-task model;
Can perform multilingual speech recognition;
Can perform speech translation and language identification |
Statistics | |
GitHub Stars 884 | GitHub Stars 90.3K |
GitHub Forks 105 | GitHub Forks 11.3K |
Stacks 1 | Stacks 24 |
Followers 11 | Followers 28 |
Votes 0 | Votes 1 |
Integrations | |

It can be used to complement any regular touch user interface with a real time voice user interface. It offers real time feedback for faster and more intuitive experience that enables end user to recover from possible errors quickly and with no interruptions.

It is the base model weights and network architecture of Grok-1, the large language model. Grok-1 is a 314 billion parameter Mixture-of-Experts model trained from scratch by xAI.

It is Google’s largest and most capable AI model. It is built to be multimodal, it can generalize, understand, operate across, and combine different types of info — like text, images, audio, video, and code.

It is a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI.

Try Grok 4 on GPT Proto. Access xAI’s most advanced 1.7T LLM with 130K context, multimodal support, and real-time data integration for dynamic analysis.

Build n8n workflows with AI and deploy in 30 seconds. Free hosting, workflow analyzer, and 100+ LLM models included.

Voibe is an offline voice dictation app for macOS that lets you write at the speed of thought. It works everywhere (Mail, Notes, Browsers, Slack, VS Code, ChatGPT, etc.), making it easy to draft messages, capture ideas, and produce long content without breaking concentration.

Use e4tools’ free online paraphrasing tool to quickly rewrite sentences, paragraphs, and articles with AI for clarity, readability, and originality.

Creating safe artificial general intelligence that benefits all of humanity. Our work to create safe and beneficial AI requires a deep understanding of the potential risks and benefits, as well as careful consideration of the impact.

It is a next-generation AI assistant. It is accessible through chat interface and API. It is capable of a wide variety of conversational and text-processing tasks while maintaining a high degree of reliability and predictability.