Thanks a lot for your answer
I also conduct the analysis for my 16s amplicon (V1-V9) data using the same procedure
With minor adjustment to suit the 16s data
- Primer: 27F & 1492R
- fastp = * Length required = 200 & Maximum length = 800
MultiQC in Galaxy before quality control
MultiQC in Galaxy after quality control
Here is the taxa bar plot
I use the same SILVA database and closed reference at 90%
So here are my questions:
- I assume that I can apply the same method of BLAST and filtering data to eliminate the "d__Bacteria;__"?
- All of the samples are soil rhizosphere sample, is it normal have almost 90% of d__Bacteria;p__Proteobacteria? I look at some other literatures that using the same kind of sample, it have a huge portion of proteobacteria, but as high as mine. Somehow I am not very confident with my data.
- As I have a huge numbers of sequence to be BLAST manually, is there any way to run BLAST for all the "d__Bacteria;__" in one time?
Thanks a lot