Skip to content
View faroit's full-sized avatar
🚀
Rocket Science
🚀
Rocket Science

Organizations

@RocketScienceAbteilung @sigsep

Block or report faroit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

a text-conditional diffusion probabilistic model capable of generating high fidelity audio.

Python 153 19 Updated May 29, 2024

SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denoising using an ONNX model. This repository contains everythi…

Python 65 12 Updated Aug 16, 2024

ICASSP 2025 paper Perceptual Noise-Masking with Music through Deep Spectral Envelope Shaping

Python 4 1 Updated Feb 24, 2025

Human Voice Wave Samples

83 8 Updated Jan 5, 2015

Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"

Python 9 1 Updated Oct 31, 2024
Python 19 3 Updated Feb 24, 2025

The official Soundwave repository

Python 126 16 Updated Mar 5, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 6,002 619 Updated Mar 5, 2025

High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec

Jupyter Notebook 96 9 Updated Jan 20, 2025

xet client tech, used in huggingface_hub

Rust 46 4 Updated Mar 12, 2025

On-going VA modeling research. Modeling dynamic range compressor using S4.

Jupyter Notebook 11 Updated Sep 9, 2023

Framework for differentiable black-box and gray-box audio effects modeling

Python 53 1 Updated Feb 24, 2025

Demo page of TCSinger

SCSS 4 Updated Feb 4, 2025
Python 3,915 313 Updated Mar 6, 2025

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal proce…

Python 440 17 Updated Jan 9, 2025

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023

Python 212 13 Updated Mar 13, 2023

Landing Page for All Things Source Separation

22 1 Updated Nov 7, 2024

Llambada: Simple Text Controllable for accompaniment generation

Python 28 3 Updated Feb 12, 2025

(ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec

Python 27 2 Updated Dec 20, 2024

Code will be available upon paper acceptance.

7 Updated Oct 13, 2024

Audio Annotation Tool for ML development

TypeScript 55 11 Updated Feb 5, 2025

A low-bitrate single-codebook 16 kHz speech codec based on focal modulation

Python 78 10 Updated Feb 12, 2025

A massively parallel, high-level programming language

Rust 3 Updated Feb 3, 2025

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 858 66 Updated Feb 9, 2025

Neural network emulator for guitar amplifiers.

Python 1 Updated Jan 20, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 412 25 Updated Mar 7, 2025

Flycast is a multiplatform Sega Dreamcast, Naomi, Naomi 2 and Atomiswave emulator

C++ 1,662 196 Updated Mar 7, 2025

Get RSS feeds in notion.so

Go 54 62 Updated Oct 24, 2024

The official implementation of HierSpeech++

Python 1,213 149 Updated Feb 20, 2024

Free, open source crypto trading bot

Python 37,165 7,311 Updated Mar 11, 2025
Next
Showing results