Hello, everyone. Today when I read vsearch clustering, a question arised.
When we run:
qiime vsearch dereplicate-sequences \
--i-sequences seqs.qza \
--o-dereplicated-table table.qza \
--o-dereplicated-sequences rep-seqs.qza
We will get a table.qza and a rep-seqs.qza. The tutorials seems ask us to do subsequencet clustering.
We need to do Clustering of FeatureTable[Frequency] and FeatureData[Sequence]
And after these codes (take de novo clustering as an example):
qiime vsearch cluster-features-de-novo \
--i-table table.qza \
--i-sequences rep-seqs.qza \
--p-perc-identity 0.99 \
--o-clustered-table table-dn-99.qza \
--o-clustered-sequences rep-seqs-dn-99.qza
We will get table-dn-99.qza, and rep-seqs-dn-99.qza.
And my question is what is the difference between the table.qza and table-dn-99.qza, rep-seqs.qza and rep-seqs-dn-99.qza? What if I stop analysis after I finish the vsearch dereplicate-sequences, which means I use table.qza and rep-seqs.qza to do the taxonomy analysis?
Thank your for your reply!
Mavis
