I am not sure which table to use for generating the beta-diversity plot. I went through the qiime2-dada2 pipeline and generated two asv tables.
The first one is the table.qza file right after the dada2 step.
The second one is one of the tables I generated after I ran the " qiime taxa collapse" command. This the for loop code I used:
for i in {1..7}
do
qiime taxa collapse \
--i-table table.qza \
--i-taxonomy taxonomy.qza \
--p-level $i \
--o-collapsed-table table-L$i.qza
From my understanding, the first table doesn't have the OTU ID assigned yet.
I would recommend using the full table made by DADA2, as that includes all ASVs and no database bias.
Once you collapse by taxa level, you group ASVs together based on what the taxonomy the database can resolve, and add database bias: 100s of ASVs may get grouped into a single taxon like p__Bacillota, c__Bacillio,__Lactobacillales; f__? and all this variation at the ASV level is lost.
A summarized taxonomy table that only goes to order or family can be helpful for making bar plots, but I would use the original, maximum resolution ASVs for my beta-diversity calculations and PCoA plots.
P.S.
That table does have a feature ID (in the first column called "#OTU ID") and taxonomy if you have run a classifier, but the full ASV IDs are shown.
Outside the field of amplicons, we might more generally call ASVs / OTUs 'features'. This is the same as a feature abundance table or an abundance matrix in numerical ecology.
Hi Colin, yes familiar with the paper and why ASVs have replaced OTUs/ feature usage. I was curious as to why qiime2 output still says OTU ID when fed in ASV table input.