For importing FeatureData[Taxonomy]

Hi, I plan to use the NCBI taxonomy data which is publicly accessible as reference. I am about to transform the original format to fit an importable format for qiime2. I just notice there are four types related to taxonomy

HeaderlessTSVTaxonomyDirectoryFormat
HeaderlessTSVTaxonomyFormat
TSVTaxonomyDirectoryFormat
TSVTaxonomyFormat

I found some description for 'HeaderlessTSVTaxonomyFormat' but not for the other three. I wander if someone can give a hint??

Thanks!!

Hi @b87401116,

You can ignore the *DirectoryFormat versions, as they tend to run behind the scenes, and not useful anyway if you are simply importing taxonomy from a single file. So, you'd likely stick with TSVTaxonomyFormat or HeaderlessTSVTaxonomyFormat, likely the latter.

Depending on your needs you can use RESCRIPt to fetch reference sequence and taxonomy data from GenBank.

-Mike

1 Like

Mike,
very helpful and so fast reply :slight_smile:

1 Like

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.