Hi there Q2 White Wizards:
I have ran the following commands:
qiime feature-classifier classify-consensus-vsearch --i-query rep-seqs.qza --i-reference-taxonomy taxonomy_7_levels.99.qza --i-reference-reads silva_132_99.qza --o-classification Baha_taxa.qza --p-maxaccepts 1 --p-threads 25
qiime taxa barplot --i-table table.qza --i-taxonomy Baha_taxa.qza --m-metadata-file Bahamas_Metadata.txt --o-visualization Baha_taxa-bar-plots.qzv
And used the Qiime2 View to see these Baha_taxa-bar-plots.qzv. Unfortunately, my sequences are 70% unclassified. Boo!! So, looking at past forums such as this one: Classify-consensus-vsearch - high number of unassigned features I can see I am not alone in this. In his answer, Nicholas stated: “_Pull out some of the sequences that are not classifying, and run an NCBI blast search on them to see what these seqs could be.” So, I’m on a dark, Alice in Wonderland quest to do just this.
I am trying to pull out all of my fastq sequences associated with the Unassigned ASV’s to either 1) run blast on them or; 2) re-run another way of annotation w/in Qiime2. I must note that I have done this with Qiime 1 and I ended up with ~30% Unclassified’s, but I know the answer bc I read it in another post - “maybe this is the correct annotation and not Qiime 1”. So, onward on my quest:
After following this tutorial: Exporting and modifying BIOM tables (e.g. adding taxonomy annotations) on how to put your ASV, Taxa, and sample counts together (by the way at the end of that tutorial you get 2 files, one with ASV, the other with sample counts with ASV. I had to merge them in excel, but I digress …).
Armed now with the ASV’s, I found this tutorial Filtering fasta sequences file based on a list of OTUs that tries to filter fasta sequences from a list of OTUs. So, my data is is .fastq, but I don’t think this matters. I noticed the --m-metadata-file was the list of OTUs, so
I tried to do just this:
- Used grep to remove all ASV that were labeled Unassigned and saved it to ASV_Unassigned.txt
$head ASV_Unassigned.txt #These are all ASV’s that are Unassigned
#Sample ID
63fe5b9af4b8589c284b7197a5b0abe2cb207a06
6dd61a4f4eb1c52de616c5ba367a9eb18aa9c21e
fc92df8284c115b32eb8b9aa193c27559186a18c
d82a416ecea8d18af1ed0b5d48f2c43ddbce73c6
Next, I ran this, which promptly gave me the following error:
qiime tools import –input-path seqs.fna –output-path seqs.qza –type FeatureData[Sequence]
qiime feature-table filter-seqs --i-data seqs.qza --m-metadata-file ASV_Unassigned.txt --o-filtered-data ASV_filtered.qza
Plugin error from feature-table:
All features were filtered out of the data.
Debug info has been saved to /var/folders/z4/jfbc76zs7_l8q_4xb8rgtmf00000gn/T/qiime2-q2cli-err-uk08gy61.log
(qiime2-2018.8) ssrb-vpn1-5-82:Testing $ cat Debug info has been saved to /var/folders/z4/jfbc76zs7_l8q_4xb8rgtmf00000gn/T/qiime2-q2cli-err-uk08gy61.log
cat: Debug: No such file or directory
cat: info: No such file or directory
cat: has: No such file or directory
cat: been: No such file or directory
cat: saved: No such file or directory
cat: to: No such file or directory
Traceback (most recent call last):
File “/Users/miniconda2/envs/qiime2-2018.8/lib/python3.5/site-packages/q2cli/commands.py”, line 274, in call
results = action(**arguments)
File “”, line 2, in filter_seqs
File “/Users/miniconda2/envs/qiime2-2018.8/lib/python3.5/site-packages/qiime2/sdk/action.py”, line 231, in bound_callable
output_types, provenance)
File “/Users/miniconda2/envs/qiime2-2018.8/lib/python3.5/site-packages/qiime2/sdk/action.py”, line 362, in callable_executor
output_views = self._callable(**view_args)
File “/Users/miniconda2/envs/qiime2-2018.8/lib/python3.5/site-packages/q2_feature_table/_filter.py”, line 112, in filter_seqs
raise ValueError(‘All features were filtered out of the data.’)
ValueError: All features were filtered out of the data.
Ok … I don’t speak code hieroglyphic, so I’m wondering 1) Is this way of trying to ID the .fastq sequences of the ones annotated as Unassigned is correct and; 2) What does that error mean?
Many Thanks!!
Purrsia