Top Python Frameworks & Libraries for audio

Audio fingerprinting and recognition in Python

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Manipulate audio with a simple and easy high level interface

Automagically synchronize subtitles with video.

Python library for audio and music analysis

Code for the paper "Jukebox: A Generative Model for Music"

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Defeating Google's audio reCaptcha with 85% accuracy.

MusicBrainz Picard audio file tagger

:musical_note: :rainbow: Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi...

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)...

Cast macOS and Linux Audio/Video to your Google Cast and Sonos Devices

:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

Specifications and tools for 360º video and spatial audio.

Data manipulation and transformation for audio signal processing, powered by PyTorch

Finding the genre of a song with Deep Learning

WaveGAN: Learn to synthesize raw audio with generative adversarial networks

A GUI frontend for @werman's Pulse Audio real-time noise suppression plugin

A video, audio and podcast publication platform written in Python.

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding...

A modern audio book player for Linux using GTK+ 3

An asynchronized Python library to automate solving ReCAPTCHA v2 using audio

SincNet is a neural architecture for efficiently processing raw audio samples.

Python module for handling audio metadata

A captcha library that generates audio and image CAPTCHAs.

kapre: Keras Audio Preprocessors

Python audio and music signal processing library

Awesome pre-trained models toolkit based on PaddlePaddle.(300+ models including Image, Text, Audio and Video with Easy Inference & Serving deployment)...

The PyTorch-based audio source separation toolkit for researchers

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mi...

Audio MODEM Communication Library in Python

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in ind...

A data augmentations library for audio, image, text, and video.