Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Seeking Advice on Audio Segmentation & Fine-Tuning for Improved Korean Pronunciation #83

Open
jikerbug opened this issue Feb 27, 2025 · 0 comments

Comments

@jikerbug
Copy link

jikerbug commented Feb 27, 2025

Hello! Thank you for all your amazing work!

I tried using the JP-KR model, but possibly due to the mix of both languages, I haven’t noticed a clear improvement in Korean pronunciation compared to the original model. Therefore, I’m preparing a LoRA-based fine-tuning process specifically to enhance Korean pronunciation. Right now, I’m slicing audio into fixed 30-second segments, but this often ends up cutting lyrics in the middle. Would you recommend aligning segments precisely with the lyrics instead, or does a static-length approach still work well in your experience? Any insights you can share would be greatly appreciated.

Once again, thank you for your incredible contributions!

@jikerbug jikerbug changed the title Subject: Advice on Audio Segmentation & LoRA Fine-Tuning for Better Korean Pronunciation Subject: Advice on Audio Segmentation & LoRA Fine-Tuning for Better Pronunciation Feb 27, 2025
@jikerbug jikerbug changed the title Subject: Advice on Audio Segmentation & LoRA Fine-Tuning for Better Pronunciation Seeking Advice on Audio Segmentation & Fine-Tuning for Improved Korean Pronunciation Feb 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant