Biapy's Bookmarks

OpenSuperWhisper

https://github.com/starmel/OpenSuperWhisper

OpenSuperWhisper is a macOS application that provides real-time audio transcription using the Whisper model. It offers a seamless way to record and transcribe audio with customizable settings and keyboard shortcuts.

Related contents:

L8 Principal's Agentic Engineering Workflow @ Kun Chen's YouTube.

audio-transcription foss macos mit-licensed open-source software speech-to-text whisper

Added 3 weeks ago

Violin

https://www.violin-ai.com/

Video Narrator. Open-source Video Translation Skill.

Upload a video. Violin transcribes the speech, translates it, synthesizes a native-sounding voice-over in the target language, and remuxes it back into the video — fully aligned, with optional SRT subtitles.

Violin @ GitHub.

audio-transcription foss mit-licensed open-source speech-to-text video

Added 1 month ago

Ghost Pepper

https://github.com/matthartman/ghost-pepper

Hold-to-talk speech-to-text for macOS. 100% local, powered by WhisperKit and local LLM cleanup. Hold Control to record, release to transcribe and paste.

apple-silicon foss macos mit-licensed open-source software speech-to-text whisper

Added 3 months ago

SuperCmd

https://supercmd.sh/

AI-Native macOS Launcher.

Command your Mac at the speed of thought.

Voice typing, text-to-speech, persistent memory, AI prompt, clipboard history, and thousands of extensions — all in one launcher built natively for macOS.

SuperCmd @ GitHub.

ai launcher macos raycast software source-available speech-to-text text-to-speech

Added 3 months ago

Petal

https://github.com/Aayush9029/petal

Petal is a native macOS menu bar app for fast, local-first audio transcription.

audio-transcription foss macos mit-licensed open-source software speech-to-text

Added 4 months ago

OpenOats

https://github.com/yazinsai/OpenOats

A meeting note-taker that talks back.

OpenOats sits next to your call, transcribes both sides of the conversation in real time, and searches your own notes to surface things worth saying — right when you need them.

audio-transcription foss macos meeting mit-licensed open-source software speech-to-text

Added 4 months ago

shuo 说

https://github.com/NickTikhonov/shuo

sub-500ms latency phone agent orchestration. A voice agent framework in ~600 lines of Python.

Related contents:

How I built a sub-500ms latency voice agent from scratch @ Nick Tikhonov.

ai foss mit-licensed open-source speech-recognition speech-to-text

Added 4 months ago

Moonshine Voice

https://github.com/moonshine-ai/moonshine

Fast and accurate automatic speech recognition (ASR) for edge devices.

Moonshine Voice is an open source AI toolkit for developers building real-time voice applications.

foss mit-licensed open-source speech-recognition speech-to-text toolkit voice

Added 4 months ago

Murmure

https://www.murmure.app/

Privacy-first and free Speech-to-Text.

Murmure is an AI-powered, offline speech-to-text tool designed with Privacy first in mind and powered by NVIDIA Parakeet 🦜. Your voice always stays yours.

Murmure @ GitHub.

ai foss gpl3-licensed linux macos nvidia open-source software speech-to-text windows

Added 8 months ago

Handy

https://handy.computer/

speak into any text field.

A free, open source, and extensible speech-to-text application that works completely offline.

Handy is a cross-platform desktop application built with Tauri (Rust + React/TypeScript) that provides simple, privacy-focused speech transcription. Press a shortcut, speak, and have your words appear in any text field—all without sending your voice to the cloud.

Handy @ GitHub.

Related contents:

Handy - Un outil de reconnaissance vocale incroyable (et open source) @ Korben :fr:.

foss linux macos mit-licensed open-source software speech-recognition speech-to-text windows

Added 9 months ago

Monologue

https://www.monologue.to/

Speech to text. talk to the computer.

Related contents:

I Started Talking to My Computer Instead of Typing. It Changed How I Think. @ Working Overtime's Every.

ai macos software speech-recognition speech-to-text

Added 10 months ago

Whispering

https://github.com/epicenter-so/epicenter/tree/main/apps/whispering

Whispering is an open-source speech-to-text application. Press a keyboard shortcut, speak, and your words will transcribe, transform, then copy and paste at the cursor.

foss linux macos mit-licensed open-source software speech-recognition speech-to-text windows

Added 11 months ago

Shinar

https://github.com/Chivo-Systems/Shinar/

AI Call Analytics. Clean, annotate, and summarize call transcripts with GPT-4.5.

Open Source AI Calling Transcriptions, Summaries, and Analytics built on OpenAI Whisper.

audio-transcription data-analytics foss gpl3-licensed open-source self-hosted speech-to-text web-app

Added 1 year ago

superwhisper

https://superwhisper.com/

Al powered voice to text.

Write 3x faster, without lifting a finger.

Related contents:

Vibe Coding and the Future of Software Engineering @ Alex P.

ai commercial ios macos software speech-recognition speech-to-text whisper windows

Added 1 year ago

YouTube Playlist Processor using Gemini API

https://github.com/Ebrizzzz/Youtube-playlist-to-formatted-text

A desktop application that extracts YouTube playlist transcripts and enhances them using Google's Gemini AI models., the output is a book in any language you want.

foss open-source software speech-to-text transcription youtube

Added 1 year ago

asr2clip

https://github.com/Oaklight/asr2clip

handy cli tool to convert your speech to clipboard text.

This tool is designed to recognize speech in real-time, convert it to text, and automatically copy the text to the system clipboard. The tool leverages API services for speech recognition and uses Python libraries for audio capture and clipboard management.

clipboard command-line foss open-source python speech-to-text

Added 1 year ago

whisper.php

https://github.com/CodeWithKyrian/whisper.php

Local Speech to Text in PHP made easy thanks to Whisper.cpp and OpenAI

foss library open-source php speech-to-text whisper

Added 1 year ago

TurboScribe

https://turboscribe.ai/

Transcribe Audio and Video to Text.

Unlimited audio & video transcription. Convert audio and video to accurate text in seconds.

Episode 590: Self-Host Before You're Toast @ Linux Unplugged.

audio commercial speech-to-text transcription video web-service

Added 1 year ago

yt2doc

https://github.com/shun-liang/yt2doc

YouTube, Apple Podcast (and more) to readable Markdown.

yt2doc transcribes videos & audios online into readable Markdown documents.

yt2doc - Pour transcrire vos vidéos en documents Markdown @ Korben :fr:.

audio-transcription command-line foss markdown open-source podcast python speech-to-text transcription youtube

Added 1 year ago

Scriberr

https://scriberr.app/

Self-hosted AI audio transcription.

This is Scriberr, a self-hostable AI audio transcription app. Scriberr uses the open-source Whisper models from OpenAI, to transcribe audio files locally on your hardware. It uses the Whisper.cpp high-performance inference engine for OpenAI's Whisper. Scriberr also allows you to summarize transcripts using OpenAI's ChatGPT API, with your own custom prompts. Summarization using ollama is also supported.

Scriberr @ GitHub.

Related contents:

Scriberr - La transcription IA qui reste chez vous @ Korben :fr:.

ai foss machine-learning mit-licensed open-source self-hosted speech-to-text transcription web-app whisper

Added 1 year ago

oTranscribe

https://otranscribe.com/

A free & open tool for transcribing audio interviews.

oTranscribe is a free web app designed to take the pain out of transcribing recorded interviews.

oTranscribe @ GitHub.

audio-transcription machine-learning open-source python speech-to-text web-service

Added 1 year ago

say

https://github.com/8ta4/say

say is always on, recording and transcribing your voice 24/7. Whenever inspiration strikes, just say it.

llm machine-learning macos recording speech-recognition speech-to-text

Added 1 year ago

sherpa-onnx

https://k2-fsa.github.io/sherpa/onnx/index.html

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift.

sherpa-onnx @ GitHub.

kaldi machine-learning open-source speech-to-text text-to-speech

Added 2 years ago

Whisper.c++

https://github.com/ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

ai machine-learning openai open-source optimization speech-to-text whisper

Added 2 years ago

BetterDictation.com

https://betterdictation.com/

Type so fast, your boss will think there's 3 of you!

BetterDictation is your personal scribe. You speak, and it will quickly and flawless transcribe into any app.

S4E10 - Quel destin pour l’Apple Vision Pro ? @ Underscore_'s Acast :fr:.

ai commercial desktop macos speech-to-text

Added 2 years ago

Distil-Whisper

https://github.com/huggingface/distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

ai machine-learning open-source speech-recognition speech-to-text whisper

Added 2 years ago

Otter.ai

https://otter.ai/

Voice Meeting Notes & Real-time Transcription

ai audio-transcription commercial machine-learning real-time speech-to-text web-service

Added 2 years ago

AI Transcriptions by Riverside

https://riverside.fm/transcription

Accurate AI Transcriptions in Minutes.

Web service proposing to transcribe video and/or audio content using AI

ai machine-learning speech-recognition speech-to-text web-service

Added 3 years ago

Buzz Captions

https://buzzcaptions.com/

Offline audio transcription and translation.

Transcribe and translate audio offline on your personal computer. Powered by OpenAI's Whisper.

Buzz Captions @ GitHub.

audio-transcription machine-learning openai speech-to-text whisper

Added 3 years ago

Whisper

https://openai.com/index/whisper/

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

Whisper @ GitHub.

machine-learning openai open-source python speech-recognition speech-to-text

Added 3 years ago

writeout.ai – Transcribe and translate any audio file

https://writeout.ai/

Transcribe and translate any audio file.

Free, fast and accurate transcription of audio files. 100% free to use.

writeout.ai @ GitHub

ai machine-learning open-source self-hosted speech-to-text voice web-app web-service

Added 3 years ago

Good Tape

https://www.mygoodtape.com/

Good Tape is a transcription service for your interview tape. (available in french)

audio audio-transcription speech-to-text web-service

Added 3 years ago

Coqui STT

https://coqui.ai/

Coqui STT (frogSTT) is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. frogSTT is battle tested in both production and research rocket

Coqui STT @ GitHub.

ai deep-learning development machine-learning speech-to-text

Added 3 years ago

Amberscript

https://www.amberscript.com/en/

Audio & Video Transcription | Speech-to-text. Smarter subtitling and transcription. We combine artificial and human intelligence to bring you accurate and fast transcripts, captions, and translated subtitles with ease.

accessibility ai audio speech-recognition speech-to-text transcription video web-service

Added 3 years ago