My data are 16S amplicons (515F-806R primers), sequenced on Mi-Seq, processed by q2 with dada2 for QC, taxonomy by Greengenes 99%OTU database. Unfortunately, many amplicons are of host genomic DNA, according to NCBI Blast searches! These were assigned to the (level-7) taxon:
k__Bacteria;;;;;;
I am trying to use the feature-table filter to remove these based on taxonomy, but I cannot figure out what to enter for the --p-exclude parameter. I have tried:
–p-exclude “k__Bacteria;;;;;;”
yet, the filter fails to remove sequences with that taxonomic assignment. I think the issue is the syntax I am using, but I am not sure of the error I am making. I bet this is easy to solve, I just don’t know how… Any help would be much appreciated.