Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to get the TextGrid file from a wav? #13

Open
sjq19960802 opened this issue Nov 28, 2024 · 5 comments
Open

how to get the TextGrid file from a wav? #13

sjq19960802 opened this issue Nov 28, 2024 · 5 comments

Comments

@sjq19960802
Copy link

sjq19960802 commented Nov 28, 2024

I only have a wav file, how can i generate the TextGrid file from it?

@AaronZ345
Copy link
Owner

Hi. You can refer to the annotation process descried in our paper to conduct the corresponding annotations.

@HuZhetao
Copy link

Hi. You can refer to the annotation process descried in our paper to conduct the corresponding annotations.

could you share the MFA model.zip

@AaronZ345
Copy link
Owner

Hi. You can refer to the annotation process descried in our paper to conduct the corresponding annotations.

could you share the MFA model.zip

You can find model for each language in https://mfa-models.readthedocs.io/en/latest/dictionary/index.html#dictionary

@HuZhetao
Copy link

Hi. You can refer to the annotation process descried in our paper to conduct the corresponding annotations.

could you share the MFA model.zip

You can find model for each language in https://mfa-models.readthedocs.io/en/latest/dictionary/index.html#dictionary

Did you train the model on you own dataset? I find the official model permformance poor on my sing dataset

@AaronZ345
Copy link
Owner

Hi. You can refer to the annotation process descried in our paper to conduct the corresponding annotations.

could you share the MFA model.zip

You can find model for each language in https://mfa-models.readthedocs.io/en/latest/dictionary/index.html#dictionary

Did you train the model on you own dataset? I find the official model permformance poor on my sing dataset

Sorry, I misunderstood earlier. We have an internal model, but it may not be available for public release. You can try using mfa adapt or use Whisper X or SOFA. To be honest, even our own results aren't great, as we’ve relied heavily on manual adjustments. We are currently working on an open-source SOTA model for aligning singing voices, so please look forward to it in the next few months.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants