Hi QIIME 2 Community,
I’m currently working on a microbiome analysis project and have encountered a situation where I'm using different versions of the SILVA database for different steps in my workflow. Specifically, I am using SEPP with the SILVA 128 reference database to build my phylogenetic tree, but I am assigning taxonomy with the SILVA 138 classifier.
Here’s a summary of my workflow:
- Phylogenetic Tree Construction:
- I used SEPP with the SILVA 128 reference database (
sepp-refs-silva-128.qza
) to insert my representative sequences into the phylogenetic tree. - After running SEPP, I filtered my feature table based on the sequences that were successfully placed in the tree.
- Taxonomy Assignment:
- For taxonomy classification, I used the SILVA 138 database (
silva138_rhizosphere.qza
) to assign taxonomy to my sequences.
My Question:
Is there any potential issue with using SEPP Silva128 for the phylogenetic tree but Silva138 for taxonomy assignment? Specifically:
- Could the mismatch between the SILVA versions cause any significant loss of data or misclassification during the SEPP filtering step?
- Are there potential taxonomic inconsistencies I should be concerned about when comparing the results from these two different database versions?
I’m particularly interested in understanding how this might impact analyses focused on different taxonomic levels (e.g., genus vs. species).
Thank you in advance for your insights!