Importing .csv to .qza

I've downloaded my taxa barplot from Qiita and I can only download it as a .csv file. I'm trying to find how I can convert that to a .qza file but I cannot find it in the current documentation. Can someone send me to the correct import page or send me the correct command? Thank you!

Hi @Stephanieorch ,

This thread should be of help to you. Could you try qiime tools import using --type 'SampleData[AlphaDiversity]'?

My .csv file is from running summarize taxa on Qiita. So the using alpha diversity as its semantic type gives me an error sayiing "is not a(n) AlphaDiversityFormat file".

Ok, thanks for trying that out. Actually, it seems the .csv file is just the metadata file with taxa abundances added. So depending on what you want to do, it may not need importing, but rather just use like a metadata file (with conversion to a .tsv and renaming the first column from 'index' to 'sampleid' or equivalent). What are you wanting to do with the file in Qiime2?

I was hoping to use it as the input for --i-taxonomy.

Sorry, I missed the part in your original post that you downloaded the file from QIITA. And you're now wanting to use it in QIIME2.

First to clarify, the barplot output is not a taxonomy file. The --i-taxonomy file for qiime taxa barplot contains the taxonomies that have been assigned using some sort of taxonomic classifier on the representative sequences in your feature table (rep-seqs.qza).

To get back to your original problem, it may be avoided by just downloading the full taxa-bar-plots.qzv file. This can be done in qiime2 view by clicking on the download icon on the far right corner of the page (next to the link symbol).

Ok, I want to group the samples together so I'll have to do it all in Qiime2. A quick follow up question, how can I go from a biom table on Qiita to a FeatureTable[Sequences] file to be used as the rep-seqs? When I download the biom and import it, it's FeatureTable[Frequency]. How can I convert that to FeatureTable[Sequences]? I'm not sure if this matters, but I am doing this using closed reference data.

Hi @Stephanieorch,

As your file is a close reference table there are a few things to consider:

  • The names of the OTUs are not sequences but OTU ids based on your reference
  • The representative sequences are not sequences from your study or studies but the centroids of the OTUs given by your reference
  • The tree and taxonomies are "inherited" from the reference
  • The close reference approach will normally yield a log/txt file with which sequences were grouped within which OTU id

Thus, depending on what you want to do with those sequences is what the next step will be.

Hope this helps.

Since I am unable to use my taxa plot from Qiita in Qiime, I am trying to create one on Qiime to analyze. I wanted to convert my biom table to a FeatureData[Sequences] file so I will have the required rep-seqs.qza to input into the command on Qiime to create a taxa.qza file that I can further analyze. Thank you!

@Stephanieorch, right, the csv that is available to download from a qzv taxa summary like this will not allow you to get qza's for FeatureTable[Frequency] and a FeatureData[Taxonomy]. However, a biom table with taxonomy (basically the input for that visualization in Qiita), could be converted to these two qza's via:

$ qiime tools import \
  --input-path hdf5.biom \
  --output-path feature-table.qza \
  --type "FeatureTable[Frequency]"
$ qiime tools import \
  --input-path hdf5.biom \
  --output-path taxonomy.qza \
  --source-format BIOMV210Format \
  --type "FeatureData[Taxonomy]"

Credit to @thermokarst who sent me these commands.

Hope this helps.

Those should fix it! Thank you all for your help.

1 Like

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.