This project was originally based on MeetingSummarizer, but has since evolved into an independent project with new features and goals. This project is tailored for Windows 11 users and provides a CLI utility to record audio from multiple sources and generate summaries of the recorded content. It leverages FFmpeg for audio recording, Whisper for transcription, and OpenAI's GPT model for summarization, offering a streamlined solution for capturing and distilling meeting information on Windows 11 platforms.
- Python: Download and install the latest version of Python from the official website or use your package manager.
- FFmpeg: Install FFmpeg using a package manager. For Windows, you can use Chocolatey:
choco install ffmpeg-full
-
Virtual Audio Device: Download and install the Virtual Audio Device for Windows.
-
OpenAI API Key: Sign up for an account on the OpenAI platform and obtain your API key.
-
Create .env file with the following:
OPENAI_API_KEY=<your-openai-api-key>
- Run the following command in the terminal, to list your microphones and audio devices:
ffmpeg -list_devices true -f dshow -i dummy
- Get the microphone and audio devices names and replace in cli.py file
Whisper is an Automatic Speech Recognition (ASR) system developed by OpenAI. It converts spoken language into written text and is trained on a large amount of multilingual and multitask supervised data collected from the web.
OpenAI Whisper: whisper
The command-line interface (CLI) utility allows you to record meetings and transcribe and summarize the recordings using the following commands:
- Record a meeting:
python cli.py record output.mp3
- Stop recording:
ctrl + c
- Transcribe and summarize a recorded meeting:
python cli.py summarize output.mp3
Replace output.mp3
with your desired output file name.
Install the required Python packages using the following command:
pip install -r requirements.txt
- Change input and output devices to Virtual Audio Device in Windows sounds settings
- If you using 'Teams' or similar, you need to foward your speakers to the Virtual Audio Device as well.
This project is licensed under the Apache License.