Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minor issues #3

Open
danyaljj opened this issue Aug 16, 2018 · 1 comment
Open

Minor issues #3

danyaljj opened this issue Aug 16, 2018 · 1 comment

Comments

@danyaljj
Copy link

Hello there,

Nitpicky issues:

@julianmichael
Copy link
Collaborator

Thanks for the feedback—you're right, the .tsv extension is not technically right. But I did it that way so you can read them directly in a text editor or browser, e.g., in the link you gave above.

To read them in automatically, just chop off the first 50 characters for the question and trim whitespace. (50 characters was the length limit for the questions during annotation.)

Yes, some questions are repeated. In dev and test, we had multiple annotators write QA pairs for each set of target words, so duplicates will be especially common in those partitions. The example you give is one of those.

Hope that resolves your questions. I'll leave this issue open until I get around to updating the readme with clarifications.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants