I am testing q2-sample-classifier on a subset of samples including control and pathogen-treated groups. Since all samples are representing the same plant tissue and the most important parameter in metadata is pathogen treatment, so the taxonomic composition is pretty similar between tested samples and the major changes were reduction in diversity and increase of frequencies of specific taxa. I can see from applying the tutorial’s commands of q2-sample-classifier that the most important features are the indicator taxa identified in DESeq2 in R.
So, my question is; can I use these results as a confirmatory test for indicative taxa calculation?
N.B. The accuracy stats as follow:
Accuracy ratio 1.44
Overall accuracy 0.76
control (AUC) 0.88
Pathogen group (AUC) 0.88
One more question, for --p-n-estimators , I use almost half of the total samples’ count i.e. if I have 99, I use 50
Also, how far the number used here ( --p-random-state ) could influence the results?