Is it possible to import Nanopore ITS data (after BLAST)?

henganl2 · May 10, 2019, 9:28pm

Hello,

I have ITS sequencing data from Nanopore GridION.
I already blast the data with a custom database (Unite).
Is it possible to import blast results to QIIME2 for further analysis?
If so, what format should I use?

I am really new to this area, thanks for any advice in advance!

Best,
Heng-An

Nicholas_Bokulich · May 13, 2019, 12:33pm

Yes. If the BLAST results are in a tab-delimited format like the example below, you can import with this command:

qiime tools import \
  --type 'FeatureData[Taxonomy]' \
  --input-format HeaderlessTSVTaxonomyFormat \
  --input-path taxonomy.txt \
  --output-path taxonomy.qza

The expected format:

seq1<tab>semicolon;delimited;taxonomy
seq2<tab>semicolon;delimited;taxonomy
...

If that taxonomy file has a header line, use the same command as above but use TSVTaxonomyFormat for the input format, rather than HeaderlessTSVTaxonomyFormat.

Good luck!

henganl2 · May 17, 2019, 10:32pm

Thanks so much for reply!
I have a question about the format since I got some error message when I import the data.

What does the "seq1" "seq2" mean? is it the ID for each sequence? or the count of the same taxomomy group?

I have tried two types of format for the first column.
In the first format, I set the "seq1" as the count of the taxonomy, and got the error:

Taxonomy format feature IDs must be unique. The following IDs are duplicated: 3, 1, 2, 5, 4, 7, 9, 6, 8, 12

In the second format, I set "seq1" as the sequence ID, and the code works.

Am I doing this correctly?

Thanks!!!

Nicholas_Bokulich · May 18, 2019, 1:45am

You are correct with the second format.

henganl2 · May 20, 2019, 8:48pm

Hi @Nicholas_Bokulich,
Thanks for your reply!! That really helps a lot.

I have another question about the data format.

I am trying to run qiime taxa barplot, and find out that I need to import another file with Feature Table [Frequency] format. Since I don’t have a .biom file to import through the code below:
qiime tools import
–input-path feature-table-v210.biom
–type ‘FeatureTable[Frequency]’
–input-format BIOMV210Format
–output-path feature-table-2.qza
Is there another way to get this format? Or if I need to create the file by myself, what the format will look like?
In total, I have 96 samples and demultiplexed by other tools. For now, I only choose one of the files to test the code. If I want to analyze all the samples (eg. for alpha and beta-diversity analysis), it looks like all the 96 samples need to be in one .qza file? Is this correct?

Nicholas_Bokulich · May 21, 2019, 12:43pm

To create a barplot (or any other representation of species abundance) you need to know the abundance of each species in each sample. This is usually represented as an observation matrix, which can be converted to biom format and then imported to QIIME 2. See biom-format conversion instructions (not a part of QIIME 2) for details on how to perform this conversion, and expected formats.

If you are importing as a feature table, yes, they should all be in the feature table. Note: it is easy to merge feature tables inside QIIME 2 if it is easier for you to import each sample separately.

henganl2 · May 30, 2019, 11:18pm

Thanks @Nicholas_Bokulich!!

But when I convert the file to .biom format, I got some error in QIIME.

Here is part of my frequency table (.txt file) for biom converter

OUT ID BC01
otu1000 1
otu1001 1
otu1002 1
otu1003 2
otu1004 1
otu1005 1
otu1006 1
otu1007 1
otu1008 1
otu1009 1
otu100 1
otu1010 2
otu1011 1

Here is the code and the error message:

biom convert -i BC01.frequencyTable.txt -o BC01.frequencytable.biom --table-type="Table" --to-tsv

qiime tools import --input-path BC01.frequencytable.biom --type 'FeatureTable[Frequency]' --input-format BIOMV210Format --output-path BC01.featureTable.qza
There was a problem importing BC01.frequencytable.biom:

BC01.frequencytable.biom is not a(n) BIOMV210Format file

So, what should I specify in biom convert to get BIOMV210Format file?

Thanks again!

Nicholas_Bokulich · May 31, 2019, 12:00pm

Please read the biom-format documentation once more. Do you really want to convert to TSV? Don't you want a biom table?

henganl2 · May 31, 2019, 9:51pm

Thanks so much!! I use the code follow and works.

biom convert -i BC01.frequencyTable.txt -o BC01.frequencytable.biom --table-type="OTU table" --to-hdf5’

However, I got another error

Plugin error from taxa:
  'float' object has no attribute 'split'
Debug info has been saved to /tmp/qiime2-q2cli-err-m3ldv5xf.log

I know there is a similar post about this, but I think I don’t have missing data in my file.
Is it because I only use one sample in here?

Nicholas_Bokulich · May 31, 2019, 10:56pm

@henganl2,
Open a new topic since this is an entirely unrelated error. We have solved the original question in this topic!

In that new topic, make sure you list the information that is requested in the new topic template, especially:

the command you are running
the full error message

system · July 2, 2019, 4:56am

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.