audio
Hertz-dev is an open-source, first-of-its-kind base model for full-duplex conversational audio.
Make stems, instrumental, or acapella version of any song! Isolate vocals, drums, bass, and other instrumental stems from any song.
StemRoller is the first free app which enables you to separate vocal and instrumental stems from any song with a single click! StemRoller uses Facebook's state-of-the-art Demucs algorithm for demixing songs and integrates search results from YouTube.
Festival, a music player.
Festival is a music player for local album collections.
Enjoy your music in a beautiful and modern UI. Beautiful and fast music streaming server.
Swing Music is a beautiful, self-hosted music player for your local audio files. Like a cooler Spotify ... but bring your own music.
A modern, self-hostable web spectrogram analyzer for your music library.
AudioDeck is a self-hosted web application that lets you visually analyze the audio quality of your music files. Think of it as a web-based, modern version of the classic Spek spectrogram analyzer.
Perfect for checking if your "high-quality" FLAC files are genuine or just upconverted low-quality MP3s.
Self-Hosted, Personal Music Server, designed for collectors and music maniacs.
Meelo is a self-hosted music server and web app. It works similarly to Plex, Jellyfin, Koel and Black Candy, but focuses on flexibility, browsing and listening experiences. Actually, Meelo is designed for music collectors
Room EQ Wizard Room Acoustics Software.
REW is free software for room acoustic measurement, loudspeaker measurement and audio device measurement. The audio measurement and analysis features of REW help you optimise the acoustics of your listening room, studio or home theater and find the best locations for your speakers, subwoofers and listening position. It includes tools for generating audio test signals; measuring SPL and impedance; measuring frequency and impulse responses; measuring distortion; generating phase, group delay and spectral decay plots, waterfalls, spectrograms and energy-time curves; generating real time analyser (RTA) plots; calculating reverberation times; calculating Thiele-Small parameters; determining the frequencies and decay times of modal resonances; displaying equaliser responses and automatically adjusting the settings of parametric equalisers to counter the effects of room modes and adjust responses to match a target curve.
A Material 3 YouTube Music client & local music player for Android. Forked from InnerTune.
Audio Editor.
AudioMass is a free, open source, web-based Audio and Waveform Editor. It runs entirely in the browser with no backend and no plugins required!
Related contents:
Automatic audio mastering plugin for live-streaming, podcasting and internet radio stations.
A lightweight Subsonic TUI music player built in Go with scrobbling support.
SubTUI is a lightweight TUI music player for Subsonic-compatible servers (Navidrome, Gonic, Airsonic, etc.) built with Go and the Bubble Tea framework. It uses mpv as the underlying audio engine supporting multiple audio formats. It supports scrobbeling ensuring your play counts are updated on your server and on any external services configured like Last.FM or ListenBrainz
High Fidelity Music Streaming. Listen. Discover. Repeat.
Hear your music in the best-in-class sound.
Convert & compress everything in 2 clicks!
File Converter is a very simple tool which allows you to convert and compress one or several file(s) using the context menu in windows explorer.
Analyze your microphone setup for free. Get advice on how to improve your microphone setup. We’ll make sure you sound podcast-ready.
Cast All The Things allows you to send videos from many, many online sources (YouTube, Vimeo, and a few hundred others) to your Chromecast. It also allows you to cast local files or render websites.
GUI for a Vocal Remover that uses Deep Neural Networks. This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 4-stem models).
A complete, cross-platform solution to record, convert and stream audio and video.
Related contents:
- FFmpeg By Example.
- FFmpeg devient fou et intègre l'IA Whisper d'OpenAI pour transcrire vos vidéos @ Korben :fr:.
- Chaining ffmpeg with a Browser Agent @ 100X Bot.
- FFmpeg - Comment normaliser le volume audio proprement avec loudnorm @ Korben :fr:.
- FFMPEG pour les nuls @ Korben :fr:.
- FFmpeg at Meta: Media Processing at Scale @ Engineering at Meta.
Buzz is a small but powerful Javascript library that allows you to easily take advantage of the new HTML5 audio element. It degrades properly on non-modern browsers.
Say goodbye to proprietary music players filled with ads, tracking, and profiling. Nuclear empowers you to listen to what you want, where you want, and how you want, for free.
Nuclear is a free music streaming program that pulls content from various free sources.
This means that you can search for your favorite artists, albums, and songs, and the player will find information about them, as well as song streams, lyrics, music recommendations, and more, aggregating data from multiple sources.
Nuclear has no ads and no tracking.
These 16,016 BBC Sound Effects are made available by the BBC in WAV format to download for use under the terms of the RemArc Licence. The Sound Effects are BBC copyright, but they may be used for personal, educational or research purposes, as detailed in the license.
Transform your Raspberry Pi, or PC with Volumio's innovative software.
Providing the perfect backdrop for your musical journey. Our audio solutions, which includes high-fidelity streaming software and music streamers, a recrafted to deliver exceptional sound quality without compromising on user-friendly style
Related contents:
Audio5js is a Javascript library that provides a seamless compatibility layer to the HTML5 Audio playback API, with multiple codec support and a Flash-based MP3 playback fallback for older or unsupported browsers. The motivation for creating Audio5js is to provide a light-weight, library-agnostic, Javascript-only interface for audio playback in the browser.
Lightweight volume notification for Linux.
Volnoti is a lightweight volume notification daemon for GNU/Linux and other POSIX operating systems. It is based on GTK+ and D-Bus and should work with any sensible window manager. The original aim was to create a volume notification daemon for lightweight window managers like LXDE or XMonad. It is known to work with a wide range of WMs, including GNOME, KDE, Xfce, LXDE, XMonad, i3 and many others. The source code is heavily based on the GNOME notification-daemon.
Mopidy is an extensible music server written in Python. It plays music from local disk, Spotify, SoundCloud, TuneIn, … Users can edit the playlist from any phone, tablet, or computer using a variety of MPD and web clients.
- Mopidy @ GitHub.
- Mopidy-MPD (Mopidy-MPD @ GitHub) is a Mopidy extension for controlling playback from MPD clients It's a frontend that provides a full MPD server implementation to make Mopidy available from MPD clients.
Rust powered waveform source separation.
A native Rust implementation of HTDemucs v4 — state-of-the-art music source separation. Splits any song into individual stems (drums, bass, vocals, etc.) using GPU-accelerated inference via Burn.
Runs as a native CLI (Metal on macOS, Vulkan on Linux/Windows), entirely in the browser via WebAssembly + WebGPU, or as a DAW plugin (VST3/CLAP, macOS) — no server, no uploads, fully local.
Related contents:
Music Assistant is a music library manager for your offline and online music sources which can easily stream your favourite music to a wide range of supported players and be combined with the power of Home Assistant!
Music Assistant is a free, opensource Media library manager that connects to your streaming services and a wide range of connected speakers. The server is the beating heart, the core of Music Assistant and must run on an always-on device like a Raspberry Pi, a NAS or an Intel NUC or alike.
CloudConvert is an online file converter. We support nearly all audio, video, document, ebook, archive, image, spreadsheet, and presentation formats. To get started, use the button below and select files to convert from your computer.
Automated Music Discovery and Collection Manager.
Bridge the gap between streaming services and your local music library. Automatically sync Spotify/Tidal/YouTube playlists to Plex/Jellyfin/Navidrome via Soulseek.
spotifyd is an open source Spotify client running as a UNIX daemon.
It streams music just like the official client,
but is more lightweight and supports more platforms.
It also supports the Spotify Connect protocol,
which allows the official clients to control it remotely.
Fast Music Remover is a lightweight tool designed to remove music, sound effects and noise from internet media. Processing takes about 8% of the original source length -that's under 5 seconds for a minute-long video!
🎚️ Open Source Audio Matching and Mastering. Matchering 2.0 is a novel Containerized Web Application and Python Library for audio matching and mastering.
It follows a simple idea - you take TWO audio files and feed them into Matchering. Our algorithm matches both of these tracks and provides you the mastered TARGET track with the same RMS, FR, peak amplitude and stereo width as the REFERENCE track has.
ClearerVoice-Studio is an open-source, AI-powered speech processing toolkit designed for researchers, developers, and end-users. It provides capabilities of speech enhancement, speech separation, target speaker extraction, and more. The toolkit provides state-of-the-art pre-trained models, along with training and inference scripts, all accessible from this repository.
A GUI frontend for @werman's Pulse Audio real-time noise suppression plugin. Cadmus is a graphical application which allows you to remove background noise from audio in real-time in any communication app. Cadmus adds a notification icon to your shell which allows you to easily select a microphone as a source, and subsequently creates a PulseAudio output which removes all recorded background noise (typing, ambient noise, etc). If you find the application useful, leave a star — it helps!
The Free Software Media System.
Jellyfin is the volunteer-built media solution that puts you in control of your media. Stream to any device from your own server, with no strings attached. Your media, your server, your way.
Related contents:
audio waveform player JavaScript library.
Wavesurfer.js is an open-source audio visualization library for creating interactive, customizable waveforms.
Wavesurfer.js is an interactive waveform rendering and audio playback library, perfect for web applications. It leverages modern web technologies to provide a robust and visually engaging audio experience.
A platform for all your audio. Your music. Your podcasts. Enjoy anywhere, share with anyone.
Visualize audio with CAVA.
- 11 drawing modes!
- Set any single color, a gradient or an image for background and foreground.
- Configure smoothing, noise reduction and a few other CAVA settings.
Related contents:
Opus is a totally open, royalty-free, highly versatile audio codec. Opus is unmatched for interactive speech and music transmission over the Internet, but is also intended for storage and streaming applications. It is standardized by the Internet Engineering Task Force (IETF) as RFC 6716 which incorporated technology from Skype's SILK codec and Xiph.Org's CELT codec.
A cross-platform audio recording/playback CLI tool with TUI, written in Rust. The goal is to be an audio Swiss Army Knife (asak), like SoX but more interactive and fun.
Audio & Video Transcription | Speech-to-text. Smarter subtitling and transcription. We combine artificial and human intelligence to bring you accurate and fast transcripts, captions, and translated subtitles with ease.
Software synthesizer based on the SoundFont 2 specifications.
FluidSynth is a real-time software synthesizer based on the SoundFont 2 specifications and has reached widespread distribution. FluidSynth itself doesn't have a graphical user interface, but due to its powerful API several applications utilize it.
audio waveform player JavaScript library.
Wavesurfer.js is an open-source audio visualization library for creating interactive, customizable waveforms.
Transcribe Audio and Video to Text.
Unlimited audio & video transcription. Convert audio and video to accurate text in seconds.
Open Source Internet Radio Web Player App.
Discover PawTunes, The Ultimate HTML5 Internet Radio Player with Purrfect Visuals, Customizable Templates, and Clean Code. Built for Pros, Loved by Cats!
Voici un petit musée des sons que nous n'entendons plus dans notre quotidien. Encore présent dans nos vies il a quelques années pour certain, d'autres ont disparus il y a bien longtemps, comme oubliés. Comme un musée physique, cette page est destinée à rassembler et à rendre accessible à tous, ce patrimoine sonore et collectif à travers une sélection.
MMAudio generates synchronized audio given video and/or text inputs.
Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis.
Related contents:
Your online audio toolkit. A collection of easy-to-use web tools for all your audio files.
Lyrion Music Server (formerly Logitech Media Server) is open-source server software which controls a wide range of Squeezebox audio players. Lyrion can stream your local music collection, internet radio stations, and content from many streaming services (with and without subscriptions).
Real-time microphone noise suppression on Linux. NoiseTorch-ng is an easy to use open source application for Linux with PulseAudio or PipeWire. It creates a virtual microphone that suppresses noise, in any application. Use whichever conferencing or VOIP application you like and simply select the filtered Virtual Microphone as input to torch the sound of your mechanical keyboard, computer fans, trains and the likes.
Audio Production Without Limits.
REAPER is a complete digital audio production application for computers, offering a full multitrack audio and MIDI recording, editing, processing, mixing and mastering toolset.
Related contents:
The Internet Archive is now offering over a million items from select collections and all new community uploads via torrents.
Multi-track audio editor for the web. Built with React & Tone.js.
Multitrack Web Audio editor and player with canvas waveform preview. Set cues, fades and shift multiple tracks in time. Record audio tracks or provide audio annotations. Export your mix to AudioBuffer or WAV! Add effects from Tone.js. Project inspired by Audacity.
Raw microphone recordings into broadcast-ready audio in one command. No configuration, and no surprises🕺
Your files emerge at -18 LUFS, the podcast/broadcast standard, with room rumble, background hiss, clicks, and harsh sibilance sorted automatically. Everything needed is embedded in the binary. This is not how audio tools usually work, and that is rather the point.
Related contents:
Good Tape is a transcription service for your interview tape. (available in french)
AirPods liberated from Apple's ecosystem.
LibrePods unlocks Apple's exclusive AirPods features on non-Apple devices. Get access to noise control modes, adaptive transparency, ear detection, hearing aid, customized transparency mode, battery status, and more - all the premium features you paid for but Apple locked to their ecosystem.
Related contents:
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
🎧 mpris media player command-line controller for vlc, mpv, RhythmBox, web browsers, cmus, mpd, spotify and others.
Playerctl is a command-line utility and library for controlling media players that implement the MPRIS D-Bus Interface Specification. Playerctl makes it easy to bind player actions, such as play and pause, to media keys. You can also get metadata about the playing track such as the artist and title for integration into statusline generators or other command-line tools.
AUDIO DATA TRANSMISSION. Send data with sound .
This application allows you to transmit and receive data through sound. It uses a simple encoding scheme to convert text into audio frequencies, which can be played through your speakers and picked up by a microphone.
Related contents: