This script is designed to split an audiobook into chapters based on detected chapter headings in the audio. It uses the extremely fast whisper.cpp
model for transcription and ffmpeg
for audio processing on Apple silion.
- Transcribes audio files to text using the
whisper.cpp
model (defaultlarge-v3-turbo
) - Detects chapter headings in the transcribed text.
- Splits the audio file into separate chapters.
- Generates output files including SRT, CUE, Markdown, and raw text with timestamps.
- Tested only on Mac os
- Procesor Apple silicon (M1/M2/M3)
- Python 3.x
ffmpeg
mutagen
pywhispercpp
- Clone the repository:
- Install the required Python packages:
pip install -r requirements.txt
- Place your MP3 files in the
Input/
directory. - Run the script:
python Splitter.py
- Skip Phrases: You can customize the phrases to skip when detecting chapters by editing the
skip_phrases.json
file.
This project is licensed under the MIT License. See the LICENSE file for details.
- whisper.cpp for the transcription model.
- FFmpeg for audio processing.