Audio fingerprinting and recognition in Python
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Manipulate audio with a simple and easy high level interface
Automagically synchronize subtitles with video.
Python library for audio and music analysis
Code for the paper "Jukebox: A Generative Model for Music"
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Defeating Google's audio reCaptcha with 85% accuracy.
MusicBrainz Picard audio file tagger
:musical_note: :rainbow: Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi...
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)...
Cast macOS and Linux Audio/Video to your Google Cast and Sonos Devices
:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Specifications and tools for 360º video and spatial audio.
Data manipulation and transformation for audio signal processing, powered by PyTorch
Finding the genre of a song with Deep Learning
WaveGAN: Learn to synthesize raw audio with generative adversarial networks
Creates audio supercuts.
A GUI frontend for @werman's Pulse Audio real-time noise suppression plugin
Python DSP module
A video, audio and podcast publication platform written in Python.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding...
A modern audio book player for Linux using GTK+ 3
An asynchronized Python library to automate solving ReCAPTCHA v2 using audio
SincNet is a neural architecture for efficiently processing raw audio samples.
Python module for handling audio metadata
A captcha library that generates audio and image CAPTCHAs.
kapre: Keras Audio Preprocessors
Python audio and music signal processing library
Awesome pre-trained models toolkit based on PaddlePaddle.(300+ models including Image, Text, Audio and Video with Easy Inference & Serving deployment)...
The PyTorch-based audio source separation toolkit for researchers
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mi...
Audio MODEM Communication Library in Python
A PyTorch-based Speech Toolkit
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in ind...
A data augmentations library for audio, image, text, and video.