speech-to-text
Whispering is an open-source speech-to-text application. Press a keyboard shortcut, speak, and your words will transcribe, transform, then copy and paste at the cursor.
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.
Privacy-first and free Speech-to-Text.
Murmure is an AI-powered, offline speech-to-text tool designed with Privacy first in mind and powered by NVIDIA Parakeet 🦜. Your voice always stays yours.
A desktop application that extracts YouTube playlist transcripts and enhances them using Google's Gemini AI models., the output is a book in any language you want.
Good Tape is a transcription service for your interview tape. (available in french)
Hold-to-talk speech-to-text for macOS. 100% local, powered by WhisperKit and local LLM cleanup. Hold Control to record, release to transcribe and paste.
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift.
Coqui STT (frogSTT) is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. frogSTT is battle tested in both production and research rocket
Offline audio transcription and translation.
Transcribe and translate audio offline on your personal computer. Powered by OpenAI's Whisper.
speak into any text field.
A free, open source, and extensible speech-to-text application that works completely offline.
Handy is a cross-platform desktop application built with Tauri (Rust + React/TypeScript) that provides simple, privacy-focused speech transcription. Press a shortcut, speak, and have your words appear in any text field—all without sending your voice to the cloud.
Related contents:
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
handy cli tool to convert your speech to clipboard text.
This tool is designed to recognize speech in real-time, convert it to text, and automatically copy the text to the system clipboard. The tool leverages API services for speech recognition and uses Python libraries for audio capture and clipboard management.
AI Call Analytics. Clean, annotate, and summarize call transcripts with GPT-4.5.
Open Source AI Calling Transcriptions, Summaries, and Analytics built on OpenAI Whisper.
Audio & Video Transcription | Speech-to-text. Smarter subtitling and transcription. We combine artificial and human intelligence to bring you accurate and fast transcripts, captions, and translated subtitles with ease.
Al powered voice to text.
Write 3x faster, without lifting a finger.
Related contents:
Petal is a native macOS menu bar app for fast, local-first audio transcription.
AI-Native macOS Launcher.
Command your Mac at the speed of thought.
Voice typing, text-to-speech, persistent memory, AI prompt, clipboard history, and thousands of extensions — all in one launcher built natively for macOS.
Local Speech to Text in PHP made easy thanks to Whisper.cpp and OpenAI
YouTube, Apple Podcast (and more) to readable Markdown.
yt2doc transcribes videos & audios online into readable Markdown documents.
Type so fast, your boss will think there's 3 of you!
BetterDictation is your personal scribe. You speak, and it will quickly and flawless transcribe into any app.
Self-hosted AI audio transcription.
This is Scriberr, a self-hostable AI audio transcription app. Scriberr uses the open-source Whisper models from OpenAI, to transcribe audio files locally on your hardware. It uses the Whisper.cpp high-performance inference engine for OpenAI's Whisper. Scriberr also allows you to summarize transcripts using OpenAI's ChatGPT API, with your own custom prompts. Summarization using ollama is also supported.
Related contents:
A free & open tool for transcribing audio interviews.
oTranscribe is a free web app designed to take the pain out of transcribing recorded interviews.
Transcribe and translate any audio file.
Free, fast and accurate transcription of audio files. 100% free to use.
sub-500ms latency phone agent orchestration. A voice agent framework in ~600 lines of Python.
Related contents:
say is always on, recording and transcribing your voice 24/7. Whenever inspiration strikes, just say it.
Transcribe Audio and Video to Text.
Unlimited audio & video transcription. Convert audio and video to accurate text in seconds.
A meeting note-taker that talks back.
OpenOats sits next to your call, transcribes both sides of the conversation in real time, and searches your own notes to surface things worth saying — right when you need them.
Fast and accurate automatic speech recognition (ASR) for edge devices.
Moonshine Voice is an open source AI toolkit for developers building real-time voice applications.
Accurate AI Transcriptions in Minutes.
Web service proposing to transcribe video and/or audio content using AI