We will get table-dn-99.qza, and rep-seqs-dn-99.qza.
And my question is what is the difference between the table.qza and table-dn-99.qza, rep-seqs.qza and rep-seqs-dn-99.qza? What if I stop analysis after I finish the vsearch dereplicate-sequences, which means I use table.qza and rep-seqs.qza to do the taxonomy analysis?
And now I know, vsearch just does the classic OTU clustering, which means, I need to Clustering and followed by chimera filtering and aggressive OTU filtering. Just like the picture shows:
Looks like you figured out this one out by yourself! This was a good question about vsearch, so I thought I would summarize your finding for new users.
They are both tables of reads that are either 100% identical (and no gaps!) or 99% identical.
These are the sequences from the 100% and 99% tables.
You pose another great question:
You could! But vsearch clustering does not remove noisy reads the way dada2 or clustering does, so you would likely end up with 1000s extra, low abundance reads, that are just errors of real reads. But you could