hello people,
i am stuck with my ITS sequences. i know i can use naive bayes, but i also wanted to perform the taxonomy assignment using blast.
so, i am trying to parse the references files that are available in the UNITE database, to use them within the classify-consensus-blast
function.
the problem seems to be quite straightforward for the SILVA database (see the RESCRIPt pipeline) but it looks more tricky for the UNITE database. or i am missing something trivial.
the thing is, i am unable to find a proper taxonomy
file (as the one they provide in SILVA). i need it to run the following, i.e. importing references, taxonomy file, then run blast:
conda activate qiime2-2023.2
# import the just created fasta file
qiime tools import \
--input-path /mnt/tables/sequences.fasta \
--output-path /mnt/tables/sequences.qza \
--type 'FeatureData[Sequence]'
# create the qza reference file for the database
qiime tools import \
--input-path /mnt/database_references/unite/developer/sh_refs_qiime_ver9_dynamic_29.11.2022_dev.fasta \
--output-path /mnt/database_references/unite/developer/sh_refs_qiime_ver9_dynamic_29.11.2022_dev.qza \
--type 'FeatureData[Sequence]'
# create the qza taxonomy file for the database
qiime tools import \
--input-path /mnt/database_references/unite/developer/sh_taxonomy_qiime_ver9_dynamic_29.11.2022_dev.txt \
--output-path /mnt/database_references/unite/developer/sh_taxonomy_qiime_ver9_dynamic_29.11.2022_dev.qza \
--type 'FeatureData[Sequence]'
# run blast
qiime feature-classifier classify-consensus-blast \
--i-query /mnt/tables/sequences.qza \
--i-reference-reads /mnt/database_references/unite/developer/sh_refs_qiime_ver9_dynamic_29.11.2022_dev.qza \
--i-reference-taxonomy /mnt/database_references/unite/developer/sh_taxonomy_qiime_ver9_dynamic_29.11.2022_dev.qza \
--p-evalue 0.0001 \
--o-classification /mnt/tables/taxonomy_qiime.qza \
--o-search-results /mnt/tables/top_hits.qza
# export taxonomy
qiime tools export \
--input-path /mnt/tables/taxonomy_qiime.qza \
--output-path /mnt/tables/
the problem occurs when i try to import the file sh_taxonomy_qiime_ver9_dynamic_29.11.2022_dev.txt
since it's not a fasta file:
qiime tools import \
--input-path /mnt/database_references/unite/developer/sh_taxonomy_qiime_ver9_dynamic_29.11.2022_dev.txt \
--output-path /mnt/database_references/unite/developer/sh_taxonomy_qiime_ver9_dynamic_29.11.2022_dev.qza \
--type 'FeatureData[Sequence]'
so...where or how do i get the information i need?