Hi there, I'm trying to use the vsearch dereplication plugin: dereplicate-sequences: Dereplicate sequences. — QIIME 2 2024.2.0 documentation
My lab has decided not to use the DADA2 filter/deblur steps in Qiime2 since it seems to throw up most of our sequencing reads from Illumina MiSeq paired-end sequences.
I used Trim Galore for pre-processing, then AMPtk for quality filtering and to merge my ITS1 Illumina Miseq R1 and R2 files. I am now trying to use the vsearch plugin to dereplicate my sequences and create a .qza table that I could then use for taxonomy assignment.
This is what I used to import my data into qiime (please keep in mind these are already merged R1 & R2 sequences from the MiSeq):
qiime tools import
--type 'SampleData[SequencesWithQuality]'
--input-path manifest.csv
--output-path single-end-demux.qza
--input-format SingleEndFastqManifestPhred33V2
And then for the vsearch dereplication plugin:
qiime vsearch dereplicate-sequences
--i-sequences single-end-demux.qza
--p-min-unique-size 10
--o-dereplicated-table table.qza
--o-dereplicated-sequences rep-seqs.qza
This is the error I am encountering:
Plugin error from vsearch:
Mapping not provided for observation identifier: sample-58_100035. If this identifier should not be updated, pass strict=False.
Debug info has been saved to /var/folders/m4/sjv1hpgd4hnft76_h_q1bgtr0000gn/T/qiime2-q2cli-err-tgjtn573.log
Here is the log file:
File "/Users/amathewson/miniconda3/envs/qiime2-amplicon-2024.2/lib/python3.8/site-packages/q2cli/commands.py", line 520, in call
results = self._execute_action(
File "/Users/amathewson/miniconda3/envs/qiime2-amplicon-2024.2/lib/python3.8/site-packages/q2cli/commands.py", line 581, in _execute_action
results = action(**arguments)
File "", line 2, in dereplicate_sequences
File "/Users/amathewson/miniconda3/envs/qiime2-amplicon-2024.2/lib/python3.8/site-packages/qiime2/sdk/action.py", line 342, in bound_callable
outputs = self.callable_executor(
File "/Users/amathewson/miniconda3/envs/qiime2-amplicon-2024.2/lib/python3.8/site-packages/qiime2/sdk/action.py", line 566, in callable_executor
output_views = self._callable(**view_args)
File "/Users/amathewson/miniconda3/envs/qiime2-amplicon-2024.2/lib/python3.8/site-packages/q2_vsearch/_cluster_sequences.py", line 45, in dereplicate_sequences
table.update_ids(id_map=id_map, axis='observation')
File "/Users/amathewson/miniconda3/envs/qiime2-amplicon-2024.2/lib/python3.8/site-packages/biom/table.py", line 1410, in update_ids
raise TableException(
biom.exception.TableException: Mapping not provided for observation identifier: sample-58_100035. If this identifier should not be updated, pass strict=False.
(END)
Is there any way to get around this or fix this issue? Could it be my manifest file or the way I imported my files into qiime?
Thanks for any input!