Skip to content

eigenpunk/ComfyUI-audio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

1417b4c · Mar 3, 2024

History

44 Commits
Mar 3, 2024
Dec 6, 2023
Mar 3, 2024
Mar 2, 2024
Mar 3, 2024
Mar 3, 2024
Mar 2, 2024
Dec 13, 2023
Mar 3, 2024
Mar 3, 2024
Mar 3, 2024
Mar 3, 2024
Dec 30, 2023
Mar 3, 2024
Mar 2, 2024

Repository files navigation

ComfyUI-audio

generative audio tools for ComfyUI. highly experimental—expect things to break and/or change frequently or not at all.

NOTE: for the foreseeable future, i will be unable to continue working on this extension. please consider forking this repository!

features

installation

# TORCH_CUDA_INDEX_URL=https://download.pytorch.org/whl/cu118  # for cuda 11.8
TORCH_CUDA_INDEX_URL=https://download.pytorch.org/whl/cu121  # for cuda 12.1

cd ComfyUI/custom_nodes
git clone https://github.com/eigenpunk/ComfyUI-audio
cd ComfyUI-audio

# for linux
pip install -r requirements.txt --extra-index-url $TORCH_CUDA_INDEX_URL

# for windows
pip install -r requirements_windows.txt --extra-index-url $TORCH_CUDA_INDEX_URL

this extension is developed and tested on a Linux-based OS. i've not yet been able to get the extension fully working on Windows, so expect some difficulty if that is your platform. i've not tested the extension on macOS at all.

would be nice to have maybe

  • audio uploads
  • audio previews
  • prompt weights for text-to-music/audio
  • stereo musicgen
  • multi-band diffusion
  • more/faster tts model support
  • split generator nodes by model stages
    • e.g. tortoise:
      • autoregressor
      • clvp/cvvp
      • spectrogram diffusion
    • e.g. musicgen:
      • t5 text encode
      • encodec audio encode
      • generate with decoder
  • more audio generation models
    • magnet, etc
  • demucs
  • audiogen

NOTE: this work is solely a personal project; its development is not supported/sponsored by any past/present employer or any other external organization.