Hi everyone, and thanks in advance for your help.
I am currently attempting to compare representative sequences from in-house experiments with an outside, publicly available, reference database for coal seam microbiomes. The goal is to identify whether or not bacteria in our samples are potentially common to other coal seam environments. In the example paper I followed, samples were clustered at 97% similarity using UPARSE, and the clustered OTU sequences were aligned to the reference dataset using BlastN. I mimic’d this as closely as possible on QIIME2 2021.4, but am now not even sure that it’s possible.
- Step one - import .fa file containing rep seqs from ref database
- qiime tools import --input-path CSMB_V1-0_sequences.fa \\ --output-path CSMBV1_Seqs.qza \\ --type 'FeatureData[Sequence]'
- Successful importing to CSMBV1_Seqs.qza
- qiime tools import --input-path CSMB_V1-0_sequences.fa \\ --output-path CSMBV1_Seqs.qza \\ --type 'FeatureData[Sequence]'
- Step two - cluster OTU sequences against reference database at 97% similarity
- qiime vsearch cluster-features-closed-reference \\ --i-table table_dada2.qza \\ --i-sequences rep_seqs_Dada2.qza \\ --i-reference-sequences CSMBV1_Seqs.qza \\ --p-perc-identity 0.97 \\ --i-clustered-table table-cr-97.qza \\ --o-clustered-sequences rep-seqs-cr-97.qza \\ --o-unmatched-sequences unmatched-cr-97.qza
- This gave me what I assume to be my clustered OTUs aligned to the reference database, condensed into three folders listed as the –output above
- Here’s where I get stuck. From this, I’ve been trying to follow different BlastN tutorials, namely from More on Taxonomy | Getting Started with Qiime2
- qiime feature-classifier classify-consensus-blast \
--i-query
--i-reference-reads
--i-reference-taxonomy
--p-perc-identity
--o-classification
--verbose-
I understand everything up until what I’m supposed to be putting in for –i-reference-taxonomy. Originally, I tested the following: qiime feature-classifier classify-consensus-blast \\ --i-query rep_seqs_Dada2.qza \\ --i-reference-reads rep-seqs-cr-97.qza \\ --i-reference-taxonomy table-cr-97.qza (obviously wrong) \\ --p-perc-identity 0.97 \\ --o-classification blast-taxonomy.qza \\ --verbose and received the following error: Invalid value for '--i-reference-taxonomy': Expected an artifact of at
least type FeatureData[Taxonomy]. An artifact of type TaxonomicClassifier (or FeatureData[Sequence]
was provided.
-
Because the database I’m pulling from does not have a file with the reference taxonomy, it makes me feel like I am going down the wrong path.
-
- qiime feature-classifier classify-consensus-blast \
I am inexperienced with bioinformatics and am hanging on by a thread. Thanks again.