Hi everyone!
I have performed a single-end analysis using only the R1 reads from my sequencing due to a quality issue I experienced related with the reverse reads, which issued the merging of my sequences.
I used the following dataset for the taxonomic classification (rep-seqs file and table)
table-single.qzv (454.6 KB)
rep-seqs-single.qzv (318.0 KB)
Initially used the pre-trained classifier provided by QIIME2 (silva 138.99).
Then, I attempted to train a classifier with my own data following the tutorial "Training feature classifiers with q2-feature-classifier" using the SILVA 138.99 files for Feature[sequence] and Feature[taxonomy] provided on the resource page. However, the result was that it only identified one species, and the rest remained unassigned.
I also made the same attempt with version 132 of SILVA at 99% identity, and almost all fragments were classified as unassigned.
Then, I made an attempt with the new plugin of Qiime2, Greengene2, following the instructions provided in the section for regions other than V4. I'm not sure if this section is optimized, but I was able to generate the .qzv file for taxonomy. However, when I tried to generate the code for taxa_bar_plots, I encountered the following error.
taxonomy.gg2.1.qzv (1.2 MB)
> Traceback (most recent call last):
File "/home/julia/anaconda3/envs/qiime2-2023.2/lib/python3.8/site-packages/q2cli/commands.py", line 352, in __call__
results = action(**arguments)
File "<decorator-gen-485>", line 2, in barplot
File "/home/julia/anaconda3/envs/qiime2-2023.2/lib/python3.8/site-packages/qiime2/sdk/action.py", line 234, in bound_callable
![Screenshot from 2023-05-26 12-38-58|580x500](upload://mvoGtEkG3jjN8L346qKIlv1KzM7.png)
outputs = self._callable_executor_(scope, callable_args,
File "/home/julia/anaconda3/envs/qiime2-2023.2/lib/python3.8/site-packages/qiime2/sdk/action.py", line 443, in _callable_executor_
ret_val = self._callable(output_dir=temp_dir, **view_args)
File "/home/julia/anaconda3/envs/qiime2-2023.2/lib/python3.8/site-packages/q2_taxa/_visualizer.py", line 41, in barplot
collapsed_tables = _extract_to_level(taxonomy, table)
File "/home/julia/anaconda3/envs/qiime2-2023.2/lib/python3.8/site-packages/q2_taxa/_util.py", line 42, in _extract_to_level
collapsed_table = _collapse_table(table, taxonomy, level, max_obs_lvl)
File "/home/julia/anaconda3/envs/qiime2-2023.2/lib/python3.8/site-packages/q2_taxa/_util.py", line 19, in _collapse_table
raise ValueError('Feature IDs found in the table are missing from the '
ValueError: Feature IDs found in the table are missing from the taxonomy: {'0ea89af03489cd0042e244da95717306', 'd31d618b00510a936ad75bf753bd8111', 'b3b318ac62506795e8a41ee289431779', '60fc710df6345a9b2b7f31ac1500231c', '0640579fca999d04069119566d6bea23', '5cfc2e63284826be749078d590de2e8a', '7006270b2117277ace9e3eb44fed625d', '567c2a18b33de5a199fcc707626b04c7', '8b49a8e2010e6d41a7a18c1ce2e9b0ba', '487405dc03bc3775979159b5c5a54af4',
I understand that I have just provided you with a lot of information at once. I'm not sure if it would be preferable for me to generate an additional post explaining the issues I am encountering in classification using my own classifier or using GreenGen2.
I also tried to use the training-classifier tutorial using the old version of greengene 13_5 and this were the results
In those tries were I used the training-classifier i used this code for the extraction of the reads:
qiime feature-classifier extract-reads \
--i-sequences ******
--p-f-primer ATTAGAWACCCBNGTAGTCC \
--p-r-primer CTGSTGCVNCCCGTAGG \
--p-max-length 500 \
--o-reads ****
Thanks a lot
Warmest regards,
Julia