-
Audioshake
- Frankfurt, Germany
-
13:09
(UTC +01:00) - @faro@sigmoid.social
audio
Efficient Training of Audio Transformers with Patchout
Music segmentation using convolutional neural networks.
Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.
TensorFlow implementation of adversarial drum synth (ADS) from the paper Adversarial Synthesis of Drum Sounds @ The 2020 DAFx Conference.
Stream and file based music metadata parser for node. Supporting a wide range of audio and tag formats.
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
Utilities for working with chord progressions
Creative Machine Learning course and notebook tutorials in JAX, PyTorch and Numpy
Phase-aware speech enchancement with Deep Complex U-Net
Conformer-based Metric GAN for speech enhancement
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
AcademiCodec: An Open Source Audio Codec Model for Academic Research
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting…
Clone a voice in 5 seconds to generate arbitrary speech in real-time
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
PAM is a no-reference audio quality metric for audio generation tasks
An Open Source text-to-speech system built by inverting Whisper.
A fast python library for aligning similar audio snippets passed in as NumPy arrays