Taxonomy File Format for training my classifiers

I am trying to import a taxonomy file that is specific to Oral Microbiota posted by Ohio State.

The command I am running is below:

qiime tools import
--type 'FeatureData[Taxonomy]'
--input-path CORE.taxonomy.tab.noheader.txt
--output-path ref-taxonomy-OhioState.qza

The error I am getting is below:
There was a problem importing CORE.taxonomy.tab.noheader.txt:

CORE.taxonomy.tab.noheader.txt is not a(n) TSVTaxonomyFormat file:

['Feature ID', 'Taxon'] must be the first two header values. The first two header values provided are: ['X80413', 'Actinobacteria'] (on line 1).

The header of my taxonomy txt file and the first set looks like this:
accession phylum class order family genus species
X80413 Actinobacteria Actinobacteridae Actinomycetales Actinomycetaceae Actinomyces Actinomyces georgiae

Is there a special format I should be using or do I need to change the taxonomy file?

I am running Qiime 2 2019.10

Thank you

Welcome to the forum @Chgilber!

The taxonomy file should be in the format "Feature ID"<TAB>"Taxonomy" so you will need to re-format your file.

The best way to make sure you have the correct format is to look at an example file, e.g., see the QIIME-compatible releases of the HOMD database:
http://www.homd.org/index.php?name=seqDownload&file&type=R

Good luck!

2 Likes