Awesome pre-trained models toolkit based on PaddlePaddle.(300+ models including Image, Text, Audio and Video with Easy Inference & Serving deployment)...
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
A data augmentations library for audio, image, text, and video.
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
:musical_note: :rainbow: Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi...
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)...
Cast macOS and Linux Audio/Video to your Google Cast and Sonos Devices
A GUI frontend for @werman's Pulse Audio real-time noise suppression plugin
:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Data manipulation and transformation for audio signal processing, powered by PyTorch
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding...
The PyTorch-based audio source separation toolkit for researchers
WaveGAN: Learn to synthesize raw audio with generative adversarial networks
SincNet is a neural architecture for efficiently processing raw audio samples.
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in ind...
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning....