I have selected this pre-built tree to try & get further in my analysis, but I cannot proceed. I feel like I’m missing something obvious, but I cannot figure out where I’m going wrong. I am having difficulty getting back in the groove of things after recovering from “the crud”, so I’m probably overlooking something simple.
I just tried importing it and I get the green light from QIIME 2. I'm using QIIME 2019.1. I just wanted to make sure that file specifically works, but please follow up with Matt's questions if it's still not working for you. -Jennifer
I am using qiime2-2019.1. The error message I received was in my OP. That was all it said. I am about to head to a meeting, then home, but I will respond with the exact code first thing tomorrow morning.
I’m sure I just overlooked something. It’s good to know @Jennifer_Fouquier got it to go through. It’s most likely something silly on my end.
Unfortunately, I don’t have access to that at the moment. I am using the Mac in the lab & I didn’t have time to save my editor to Google Drive before I had to leave. I’ll look it over tomorrow & post the full command in the morning when I get back to the lab.
Sorry for the delayed response, but I got it to work. I’m not sure what I did any different from yesterday, but it saved from .nwk to .qza today. Maybe the Mac was just having a case of the Monday’s or something.
Either way, thank you so much for the quick responses & the help!! Much appreciated!
Could you please provide the ghost_tree.nwk using the latest version of UNITE (ver8 from 02.02.2019)? I was not able to successfully install the q2-ghost-tree to be able to build them by myself.
I am interested in the dynamic version.
Yes, I will work on that early next week. Please note that there is currently an issue with using these ghost trees in QIIME 2 that may or may not be (TBD) related to the underscores found in accession IDs that are in the recent UNITE databases (previous UNITE versions did not have these characters and were the files I used for testing). I'm not entirely sure, but it's the only difference I can see that has arisen with the new UNITE files and was not something I would expect to be an issue . You can follow here: Ghost tree filtering? - #8 by Jennifer_Fouquier. Thanks!
I forgot to update this thread as well. I had a family emergency and am behind. Sorry everyone! cc @helenaax2r
I do have time to work on making new trees today and tomorrow if you would like to do other things in the meantime. It appears that there are quite a few questions related to ghost-tree at the moment, so I need to make sure things are running smoothly in QIIME 2 which likely involves clarifying things on the tutorial.
ghost-tree was developed before the ASV process, that's why the OTU picking exists. It also has a benefit though. It allows you to regroup your ASVs to lower percent identity threshold. Because ghost-tree partially works using nomenclature, this will allow some of the "unidentified" organisms to cluster with organisms that are "identified" taxonomically in the hopes that you will maintain some of their phylogenetic information rather than discarding them.
Example from the tutorial:
Group your rep seqs at 90% similarity.
This handles the abundant ‘unidentified’ organisms.
Still working on the trees. Will take me a few more days to get to the bottom of an issue that was recently identified.
Have you seen this tutorial? There are small files you can use as a demo, and it also mentions the "full size files"
I agree, the tutorial could use more description of what the files are, and not just the commands used. Thanks for bringing this up. I need to expand some of the documentation.
The paper might also help clarify things:
The foundation tree is a tree built from an alignment database for an evolutionarily conserved marker gene (like the 18S for the fungal example) that can be aligned across distant taxonomic lineages to build a “foundation” tree. It's the tree that you will use to graft your extension sequences onto. When you have sequences such as ITS that are good at taxonomic identification but that make terrible alignments across all lineages (but ITS does make good alignments for closely related organisms, and it has good taxonomic resolution/identification), you don't just want to make a tree from all of your ITS seqs....so ghost-tree grafts them onto the foundation to build a hybrid-gene tree that has the benefits of both ITS and 18S gene regions. Please note that the foundation can be either a tree or an alignment but you have to choose the right subcommand in ghost-tree.
Silva 18S is a good foundation tree for fungal ITS. For other organisms, I'm not sure. I'm guessing there are more applications but ghost-tree was designed with fungal diversity analysis in mind.
Here are the most recent UNITE database ghost trees.
There is an issue in ghost-tree that I will fix soon and you can find the details here if you are interested. Basically, the pre-built trees are good to go and formatted correctly, but the software needs to be modified so it adds quotes to the IDs in the Newick to allow underscores that are in the new UNITE accession IDs. I will do that next and improve docs.
If you have any questions, please let me know, especially while documentation is sparse!