-
Audioshake
- Frankfurt, Germany
-
06:51
- 1h ahead - @faro@sigmoid.social
Lists (6)
Sort Name ascending (A-Z)
Stars
- All languages
- Arduino
- Assembly
- C
- C#
- C++
- CMake
- CSS
- Clojure
- CoffeeScript
- Crystal
- Cuda
- Cython
- Dart
- Dockerfile
- Gherkin
- Go
- HTML
- Haskell
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- LLVM
- LiveScript
- Lua
- MATLAB
- MDX
- Makefile
- Max
- Nim
- Objective-C
- Objective-C++
- PHP
- Perl
- PostScript
- Python
- R
- Ruby
- Rust
- SCSS
- Sass
- Shell
- Standard ML
- Svelte
- Swift
- TeX
- TypeScript
- Vue
- XSLT
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denoising using an ONNX model. This repository contains everythi…
ICASSP 2025 paper Perceptual Noise-Masking with Music through Deep Spectral Envelope Shaping
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec
On-going VA modeling research. Modeling dynamic range compressor using S4.
Framework for differentiable black-box and gray-box audio effects modeling
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal proce…
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
Landing Page for All Things Source Separation
Llambada: Simple Text Controllable for accompaniment generation
(ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec
A low-bitrate single-codebook 16 kHz speech codec based on focal modulation
Rafaelmdcarneiro / Bend
Forked from HigherOrderCO/BendA massively parallel, high-level programming language
Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…
Neural network emulator for guitar amplifiers.
Unified automatic quality assessment for speech, music, and sound.
Flycast is a multiplatform Sega Dreamcast, Naomi, Naomi 2 and Atomiswave emulator
The official implementation of HierSpeech++