Hello everyone,
I am currently working on AMF (Arbuscular Mycorrhizal Fungi) community analysis and I’m facing difficulties in selecting or customizing a suitable taxonomy reference database that works well with QIIME2 or external tools like VSEARCH, particularly for AMF taxa.
What I’ve Tried:
I used the SILVA 138.1 and 138.2 databases (SILVA_138.1_SSURef_NR99_tax_silva_fixed.fasta and SILVA_138.2_SSURef_tax_silva_fixed.fasta) for taxonomy assignment after clustering OTUs with VSEARCH (--usearch_global).
The taxonomy string mapping was done using SILVA_138.1_taxonomy_map.tsv and later SILVA_138.2_taxonomy_fixed.txt.
Issue:
The taxonomy strings from these SILVA versions contain a large number of intermediate taxonomic levels (e.g., Supergroup, Subgroup, Subphylum, Strain tags), often exceeding 15 ranks. This makes it challenging to extract only the standard 7-level hierarchy (Kingdom, Phylum, Class, Order, Family, Genus, Species).
Here is an example of a taxonomy string from SILVA:
CopyEdit
Eukaryota;Amorphea;Obazoa;Opisthokonta;Nucletmycea;Fungi;Dikarya;Basidiomycota;Pucciniomycotina;Pucciniomycetes;Pucciniales;Pucciniaceae;Gymnosporangium;Gymnosporangium;ellisii
In this case, Fungi is expected as Kingdom, but it appears mid-string.
My Question:
Is there any QIIME2-compatible database specifically curated for AMF (18S) that:
- Uses the standard 7 taxonomic ranks?
- Works directly with VSEARCH or the QIIME2
feature-classifierplugin? - Has been used effectively in recent AMF metabarcoding studies?
Alternatively, is there a recommended way to simplify or map long SILVA taxonomies to the standard 7 levels in a reproducible manner?
Any help, examples, or references would be greatly appreciated.
Thank you!
—
Salma
PhD Candidate, ANU
