Hello everyone,
I am working on homologous amino acid sequences of a functional protein. I would like to analyze my data in amino acid level instead of nucleotide level. I know that QIIME2 does not take amino acid sequences but it can take biomtable and tree built outside of QIIME and run analysis: Import amino acid sequences?
I clustered my sequences using Ucluster fast and I got a uc file and a centroid Fasta file. Uc file was transformed to a biom file using uc2biom from the website: The Biological Observation Matrix (BIOM) format — biom-format.org. Sequences in the centroid file were used as OTUs. Phylogenetic tree was built using the OTUs by FastTree. The biomtable and phylogenetic tree were converted into QIIME2 artifacts successfully. However, building core matrix was failed and the error showed that some sequence names in biomtable are missing in the tree. I checked both of my biomtable and tree file and found that they should match with each other. I also could successfully import my OTU table (converted from the biomtable) and the phylogenetic tree into phyloseq in R. I wonder if it is a file format problem? What can I do to import my data into QIIME2? It would be great if I can use QIIME2 to do some downstream analysis.