We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I'm linking gbif data, and the 1.7 TB file doesn't seem to be anything that gets through a regex for the id.
I see around there's filtered in the name of some files, but I don't know how they got filtered.
filtered
The text was updated successfully, but these errors were encountered:
This file gets filtered down from 1.7 TB to 2.5 GB. Which means we only use about ~14% of gbif data.
2.5G /fs/project/PAS1604/gbif/0147211-200613084148143.filtered.txt
Sorry, something went wrong.
This is handled in this file (or some variation of the file as I'm about to rename it).
https://github.com/OSC/phylogatr-web/blob/40504e83491626da3a2279304c7464f6ce21df58/gbif_filter_occurrences.pbs
This ticket will now be to document this filtering.
No branches or pull requests
I'm linking gbif data, and the 1.7 TB file doesn't seem to be anything that gets through a regex for the id.
I see around there's
filtered
in the name of some files, but I don't know how they got filtered.The text was updated successfully, but these errors were encountered: