A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
-
Updated
Oct 22, 2024 - Python
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
List of direct speech-to-speech translation papers.
Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd like!
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
Applying deep learning to translate animation and re-generate audio.
HF Space app for End-to-End Speech-to-Speech Translation from Spanish to English using ESPnet
Official repository for our NeurIPS 2024 paper: DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation
Tool to generate English AI Dubbing for a YouTube video
cascaded speech-to-speech translation (STST), mapping from source speech in any language to target speech in English
A comparison of E2E and Cascading S2ST systems on the CVSS-C Spanish to English dataset (CommonVoice 4.0)
Speech to Speech Translation Python
Add a description, image, and links to the speech-to-speech-translation topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-speech-translation topic, visit your repo's landing page and select "manage topics."