I could not find any place (in the "Database Preparation" or "Sequences Reconstruction" parts) inside the documentation that indicates how I can obtain the database/sidle-db-aligned-sequences.qza file. Could you, please, help me?
The sidle-db-aligned-sequences.qza should be in the database folder you downloaded; please double check with ls. If it's not, then we can troubleshoot from there. The aligned sequence file is from the database and so it's something that gets provided during that step.
As always, it depends on the database. If you're working with greengenes 13_8, there should be a directory called rep_set_aligned which will have a list of files showing a percent identity; pick the one that matches your identity (probably 99). If you're using Silva 128, I think it's in the downloaded QIIME 2 folder. (Remember that tree building only works with greengenes 13_8 and Silva 128 right now. At somepoint we'll hopefully have more SEPP references, but this is where things are right now.
I think my question is also useful in this post, because it is related to another database. So that is the reason why I am posting this here and do not in a new post.
I would love to reproduce the last step of sidle documentation's Phylogenetic Tree Reconstruction. However I could not find the rep_set_aligned folder in NCBI 16S database or neither the SEPP references file. That’s the crux of the matter.
I have found some post as:
But didn't find a solution.
Maybe I need to merge-seqs? What approximation could I use?
As I think is documented in the Sidle docs (and if not I will add it): tree reconstruction only works for databases that have SEPP reference trees. This means you can reconstruct using Silva 128 or Greengenes 13_8 if you want a tree. If you'd like to build the backbone for fragment insertion with another database or database version, you're able to do that. I don't think there's a clean solution at the moment; the last thread i can find on building is from last year.