classification of non-overlapping reads treated with justConcatenate

Wow this is great @lukasbeule, thank you for sharing this!

I agree that the initial results you've obtained with sklearn are what we expected. I hope the N length you have chosen works reasonable well across most taxa. However, as per:

I've discussed this previously with some of the other forum moderators, and I had forgotten to clarify in my reply to you, that vsearch may only work if the N region is a reasonable length (otherwise the global alignment would score really badly).

One minor suggestion, we have updated SILVA 138 classifiers available on the Data Resources page. Both pre-made classifiers and the files used to make them are there. If you'd like to curate your own SILVA database, you can give RESCRIPt a try:

-Cheers!
-Mike

2 Likes