speech-to-text
speak into any text field.
A free, open source, and extensible speech-to-text application that works completely offline.
Handy is a cross-platform desktop application built with Tauri (Rust + React/TypeScript) that provides simple, privacy-focused speech transcription. Press a shortcut, speak, and have your words appear in any text field—all without sending your voice to the cloud.
Whispering is an open-source speech-to-text application. Press a keyboard shortcut, speak, and your words will transcribe, transform, then copy and paste at the cursor.
AI Call Analytics. Clean, annotate, and summarize call transcripts with GPT-4.5.
Open Source AI Calling Transcriptions, Summaries, and Analytics built on OpenAI Whisper.
Al powered voice to text.
Write 3x faster, without lifting a finger.
Related contents:
A desktop application that extracts YouTube playlist transcripts and enhances them using Google's Gemini AI models., the output is a book in any language you want.
handy cli tool to convert your speech to clipboard text.
This tool is designed to recognize speech in real-time, convert it to text, and automatically copy the text to the system clipboard. The tool leverages API services for speech recognition and uses Python libraries for audio capture and clipboard management.
Local Speech to Text in PHP made easy thanks to Whisper.cpp and OpenAI
Transcribe Audio and Video to Text.
Unlimited audio & video transcription. Convert audio and video to accurate text in seconds.
YouTube, Apple Podcast (and more) to readable Markdown.
yt2doc transcribes videos & audios online into readable Markdown documents.
Self-hosted AI audio transcription.
This is Scriberr, a self-hostable AI audio transcription app. Scriberr uses the open-source Whisper models from OpenAI, to transcribe audio files locally on your hardware. It uses the Whisper.cpp high-performance inference engine for OpenAI's Whisper. Scriberr also allows you to summarize transcripts using OpenAI's ChatGPT API, with your own custom prompts. Summarization using ollama is also supported.
A free & open tool for transcribing audio interviews.
oTranscribe is a free web app designed to take the pain out of transcribing recorded interviews.
say is always on, recording and transcribing your voice 24/7. Whenever inspiration strikes, just say it.
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift.
Type so fast, your boss will think there's 3 of you!
BetterDictation is your personal scribe. You speak, and it will quickly and flawless transcribe into any app.
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Accurate AI Transcriptions in Minutes.
Web service proposing to transcribe video and/or audio content using AI
Offline audio transcription and translation.
Transcribe and translate audio offline on your personal computer. Powered by OpenAI's Whisper.
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.
Transcribe and translate any audio file.
Free, fast and accurate transcription of audio files. 100% free to use.
Good Tape is a transcription service for your interview tape. (available in french)
Coqui STT (frogSTT) is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. frogSTT is battle tested in both production and research rocket
Audio & Video Transcription | Speech-to-text. Smarter subtitling and transcription. We combine artificial and human intelligence to bring you accurate and fast transcripts, captions, and translated subtitles with ease.