Invalid character in sequence: b'U'

Nicholas_Bokulich · January 14, 2021, 6:58am

Hi @LiyingXie,
This is because your sequences consist of RNA, not DNA sequences. Looks like you are trying to import and use the raw SILVA sequences! So you are importing as the wrong data type.

As you are working with SILVA, I recommend using this tutorial for the RESCRIPt plugin, which will make downloading, formatting, and using SILVA much easier:

The outputs of this tutorial — RESCRIPt-formatted SILVA sequences, taxonomy, and taxonomy classifiers — are also available here, which will save you a lot of time:

Good luck!