I trained a classifier which is applied to classify the V3V4 region seqence of bacteria. However, there were two questiones that i can't understand.
I ran the command "qiime feature-classifier classify-sklearn --i-classifier classifier_for_V3V4.qza --i-reads rep_seq.qza --o-classification greengenes_taxonomy.qza", and at the same time, I got the following results.
There are three columns in this results. They are Feature ID, Taxon and Confidence. The first question is that there are different taxonomy levels such as family level , genus level and species level in taxon column. why features are assigned to different taxonomy? The second question is that whether I should filter features according to the size of confidence value. If so, what is the threshold?
