Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
We made AudioKit open-source because we believe that clear, powerful audio development is best developed and maintained through a large, active base of developers and users. Our core code, tests, examples, and website are all available for contributions. | Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API. The API recognizes over 80 languages and variants, to support your global user base. |
Well-Named Classes and Parameters;Sensible Defaults;Tight Xcode Integration;Easy Installation;Clear Documentation and Common File Templates;Powerful Sequences and Phrases | Over 80 Languages;Return Text Results In Real-Time;Accurate In Noisy Environments;Powered by Machine Learning |
Statistics | |
GitHub Stars 11.2K | GitHub Stars - |
GitHub Forks 1.6K | GitHub Forks - |
Stacks 19 | Stacks 39 |
Followers 32 | Followers 74 |
Votes 0 | Votes 1 |
Pros & Cons | |
No community feedback yet | Pros
|

TalkAny—Free AI Speaking Practice Platform. Practice English/Chinese speaking with AI 24/7; no partner needed. Get real-time grammar correction, pronunciation feedback, and natural expression tips. Perfect for IELTS, TOEFL, DET exam prep, daily conversation, and job interviews. Zero pressure, unlimited practice. Start speaking now!

Transcribe and translate speech in over 60 languages, in real-time, with high accuracy.

All-in-one content studio — easily create any photo, video or audio clip with AI. Affordable, easy to use and featuring the latest AI models.

VibeMusicing is an AI music tool that creates original songs, lyrics, and beats instantly—fast, customizable, and royalty-free for all types of creators.

Build AI video, image, and audio pipelines with a simple composable API

AI note taking app that transforms voice recordings, text, images, audio files and videos into clear, summarized notes for meetings, lectures, journals, and more.

Transcribe phone calls or build voice powered apps. Recognize unlimited industry specific words and phrases without any training required. All at simple, affordable pricing.

Deepgram helps you harness the potential of your voice data with intelligent speech models built to scale and continuously improve over time. The API is the gateway to Deepgram's Brain AI models, and gives you customizable access to fast, high accuracy transcription and phonetic search. Deepgram Brain can understand nearly every audio format available.

It is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Produce high quality recordings without having to shell out thousands of dollars for equipment. The only thing you need is your guitar, your computer, and a digital audio workstation.