Parakeet training configs? #8756

tobispeak · 2024-03-27T20:28:45Z

tobispeak
Mar 27, 2024

Are there any training recipes or recommended configs for fine-tuning the parakeet-ctc-0.6b model? Specifically, I'm curious about how long it took to train on 1 hr of input audio.

I've been fine-tuning on an AWS p3.16xlarge instance, which has 8 V100 GPUs with 16 GB memory each. I run into persistent memory errors (torch.cuda.OutOfMemoryError) when training on more than 4 GPUs or with a batch size larger than 2. The quickest parameter set I've found takes 5 minutes to train on 10hr of data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parakeet training configs? #8756

{{title}}

Replies: 0 comments

Select a reply

Parakeet training configs? #8756

tobispeak Mar 27, 2024

Replies: 0 comments

tobispeak
Mar 27, 2024