Since QIIME uses Greene genes data base, if I come up with some detected species and would like to browse them in NCBI finder, how do I find for example Klebsiella 297311 (read by greengenes) into searchable full genome sequence on NCBI?
It is by no means restricted to Greengenes — we provide pre-trained classifiers for Greengenes and SILVA on our website, but other commonly used databases that are compatible more or less “off the shelf” include GTDB, UNITE, and any other database that uses compatible file formats (FASTA and tab-delimited taxonomy files)
But it sounds like you have already processed data that was classified (or clustered) using Greengenes, and now want to map to NCBI sequences… you could use q2-feature-classifier to perform this mapping, e.g., to BLAST to find the top hit. Alternatively, just copy and paste the sequence that you want into NCBI BLAST to find the closest match…
Thanks Nicholas, yes I have already used greengenes, and we made a heat map with species of interests accompanied by numbers. Now for many who wish to identify those in NCBI from our published resource it wouldn’t be possible to look bck on the sequence. Any way they could decipher greenegenes species entry into NCBI without having sequence?
yes, based on the example you gave above it sounds like you used closed-reference OTU clustering against the greengenes database, so you could grab the greengenes representative sequence that maps to that greengenes ID. Then BLAST that to find the closest match in NCBI RefSeqs.