I think this question is great! There's a lot to learn about Qiime 2 and it takes a while to get used to it. If you have more question after trying that tutorial, I'm always happy to help.
options. Even so, I trained two classifiers with one having the minimum length of 30bp and the other to have to minimum length and maximum length and compared the results. There was no differences between the results.
I am just worried if the maximum and minimum as well as truncate option is an important requirement for the analysis. The type of sample I'm using is the V3 to V4 region of 16s rRNA gene paired-end reads Illumina 2X300.
What questions did you have about these settings? We are always looking to make the documentation more helpful.
Comparing methods is always a good idea! I'm glad the results are consistent. But this is a little surprising because, just like you said, the read lengths you use for classification should impact performance, with longer reads usually being better.
Let's see if some of the Qiime devs have any advice!
No these are not critical parameters. The descriptions of what these do and what effect they may have on results are described in the tutorial that @colinbrislawn directed you to.
This is not surprising, this is not really impacting the lengths of most (or all?) of the sequences that you extracted. The min/max lengths just filter out aberrant sequences that are probably mismatched hits and occur with some databases, not a common occurrence. The min/max length parameters are only used as a safety catch to avoid issues with misprimed hits.
colinbrislawn I was not able to understand that if its necessary to input values for;
–p-trunc-len
–p-min-length
–p-max-length
But as mentioned by Nicholas_Bokulich, it served as safety measures in analysing the data. I think I finally understands the underlaying reasons for the commands. I will later try to varied the values to truncate, min- and max- length to further understand the differences.
I also would like to add a suggestion regarding the documentation, where, if it is possible to add more explanation as of why the values should be or not be inserted in the
–p-trunc-len
–p-min-length
–p-max-length
I am confident that the explanations in the tutorial was enough and very helpful. But, more explanations on this part would be so much help especially to a newbie like me and others who are starting to do microbiome analysis.