handy cli tool to convert your speech to clipboard text.
This tool is designed to recognize speech in real-time, convert it to text, and automatically copy the text to the system clipboard. The tool leverages API services for speech recognition and uses Python libraries for audio capture and clipboard management.
Transcribe Audio and Video to Text.
Unlimited audio & video transcription. Convert audio and video to accurate text in seconds.
YouTube, Apple Podcast (and more) to readable Markdown.
yt2doc transcribes videos & audios online into readable Markdown documents.
Self-hosted AI audio transcription.
This is Scriberr, a self-hostable AI audio transcription app. Scriberr uses the open-source Whisper models from OpenAI, to transcribe audio files locally on your hardware. It uses the Whisper.cpp high-performance inference engine for OpenAI's Whisper. Scriberr also allows you to summarize transcripts using OpenAI's ChatGPT API, with your own custom prompts. Summarization using ollama is also supported.
A free & open tool for transcribing audio interviews.
oTranscribe is a free web app designed to take the pain out of transcribing recorded interviews.
say is always on, recording and transcribing your voice 24/7. Whenever inspiration strikes, just say it.
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift.
Type so fast, your boss will think there's 3 of you!
BetterDictation is your personal scribe. You speak, and it will quickly and flawless transcribe into any app.
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Offline audio transcription and translation.
Transcribe and translate audio offline on your personal computer. Powered by OpenAI's Whisper.
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.
Transcribe and translate any audio file.
Free, fast and accurate transcription of audio files. 100% free to use.
Coqui STT (frogSTT) is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. frogSTT is battle tested in both production and research rocket
Audio & Video Transcription | Speech-to-text.
Smarter subtitling and transcription.
We combine artificial and human intelligence to bring you accurate and fast transcripts, captions, and translated subtitles with ease.