About OTU clustering of fasta files

Hi @sooni ,

The error indicates that the file contains invalid lowercase characters. You could convert these to uppercase to continue, but there could be other issues with the file if you are trying to use data that have already been processed.

I would recommend starting with the raw data. The raw data for this study appear to be deposited on SRA:
https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA683584

so you could download and automatically format the data from there using the QIIME 2 plugin q2-fondue:

This might be an easier approach, as you could then also follow the QIIME 2 tutorials from the start instead of figuring out the entry point for starting with FASTA data.

Good luck!