I want to perform functional annotation using Picrust2 plugin.
Blockquote
First of all I want to train my classifier for V3-V4 region with 99% identity for latest release of greengenes-2022
Blockquote. I tried to check files from Index of /greengenes_release/2022.10 new release of GG-2022 but confused that which file is of my use!!!
Also, I am facing problem to use my QIIME2 outputs as a input file file for PICRUSt2. Please help me out for this, so that i can start my further data processing and analysis.
I think what you would want to do is use q2-feature-classifier to extract-reads based on your primers from the Greengenes2 backbone sequences, and then train a Naive Bayes classifier on the result
The following method should help point you in the right direction. First take the sequence files and trim them based on your primers (obviously I've just added a random sequence here!). You can also add truncation and min/max lengths of sequences based on your experimental design, for example:
Thank you so much for this example! I used this for my samples, but I was wondering how you know the % identity. Does this code default to 99% identity?
I read elsewhere in the forum that you can specify the % identity, but it was using a different code?
Any clarification would be greatly appreciated. Thank you!
Thanks so much for the explanation! So the "act of selecting an internal region" is like specifying the primers for the V3-V4 region, for example?
Also, after I commented I was looking at my taxonomy output file and the "Confidence" column ranges from 0.72-0.99 so is that in some way connected to the % identity? I ran a different code (qiime greengenes2 non-v4-16s) which has 99% identity and my "Confidence" column was all 1.0 so I was wondering if that's connected to the % identity.