Import RDP database to .qza file to use it with feature-classifier

Hi!

I want to import RDP database to a .qza file in order to use it with feature-classifier classify-sklearn. When you download Greengenes or Silva database you get 2 different files: sequences and taxonomy and you can import them to .qza files and obtain the classifier with fit-classifier-naive-bayes. I would like to do the same for this database.

I’ve downloaded the RDP database from here, specifically Bacteria16S fasta unaligned. There is a unique .txt file with the sequences and the taxonomic assignment. I've attached an extract from this file.

exampleRDPdatabase.txt (2.3 KB)

How can I get the classifier.qza based on this database?

Finally, I have another question: are there big differences between RDP database and Greengenes?

Thanks in advance,

SLa

An off-topic reply has been merged into an existing topic: Import RDP database to .qza file to use it with feature-classifier

Please keep replies on-topic in the future.