UNITE v2.0-ref
The files used to train the RDP Classifier v2.13 . Sequences and taxonomy are largely based on the UNITE + INSD v8.3 full dataset for eukaryotes. Taxonomic adjustments were made to resolve unknown and non-unique taxa into a strictly hierarchical taxonomy. Sequences were dereplicated, only unique sequences retained, to reduce dataset size. Compressed file size 167.5 Mb, decompressed file size 1.2Gb.