Tree visualization

Lu_Yang · November 16, 2017, 3:23am

Hi,
I got the rooted.tree from qiime2, but it is a qza file. May I know how to change it into a qzv file, which can be visualized. OR change the qza file into a fasta file, which I can know the reads and draw a tree? Because all the qza file is not readable.
Thanks so much.

thermokarst · November 17, 2017, 2:56pm

Hi @Lu_Yang! Unfortunately we don't have any tree-viewing visualizations at this moment, but you can export your tree artifact and load that file up into the viewer of your choice! Let us know how it goes for you! Thanks!

Lu_Yang · November 17, 2017, 3:26pm

Hi, @thermokarst,
Thanks for your reply. Got that! Looking forward to qiime2's visualization of the tree. Haha ~:grin:

zejunyan · December 1, 2017, 8:29pm

Hi, thermokarst

I understand that Qimme2 cannot visualize rooted-tree.qza. So, I exported the rooted-tree.qza into a tree.nwk, which can be visualised as a tree. Obviously, this tree is built based on the rep sequences. So, instead of my Sample ID, hashed code for each rep sequence is present on the node of the tree.

My question is that if there is a file that contains information about all the matches between the hash codes and rep sequences? If so, where it can be found?

Bests

Dr yan

Lu_Yang · December 1, 2017, 10:31pm

Hi, @zejunyan,

I had the same problem as you before. Then I found a way out. I visualize the tree on this website. https://itol.embl.de/
I downloaded the taxonomy file, and save it as a txt file. And input the taxonomy file into the tree based on the module they provide.
Hope this can help you.
Good luck!

zejunyan · December 1, 2017, 10:58pm

Hi, @Lu_Yang

Thank you for your reply. I tried your idea, it worked to give me a tree. But, the nodes are stilled labelled with hash code, but not my sample ID. So, I cannot read out useful info from this tree. Do you know how the nodes in the tree can be labelled with my Sample IDs?

Thanks

Dr Yan

Lu_Yang · December 1, 2017, 11:07pm

Hi, @zejunyan,

I am so sorry that I haven't tried the sample ID label. I just tried the taxonomy table. Because for my understanding what the harsh codes you mentioned are the feature ID, each feature ID can only be represented by the taxonomy.
I think the sample ID labels you mentioned are another visualization method. But I am sorry that I have no idea about that kind of visualization. If you figure out, pls let me know.
Good luck.

zejunyan · December 1, 2017, 11:23pm

Hi, @Lu_Yang

Thank you for your quick response.

I think if we know how ro match each feature ID (i.e. the hash code) to its corresponding sampleID, we can just replace the feature ID with the sample ID, after which, a tree can be drawn with sample ID labelled. My problem is that how to find the match between feature ID and sampleID? Is there a file containing this info?

Cheers

Dr yan

Lu_Yang · December 4, 2017, 3:28am

Hi, @zejunyan,

I am sorry that I just use one sample to map the tree. Not all samples. Let's wait for @thermokarst reply.
Good luck!

thermokarst · December 4, 2017, 3:44pm

You can tabulate-seqs to determine which sequence belongs to which Feature ID.

Perhaps you mean "Feature ID" instead of "Sample ID"? The sequences used to generate this tree are per-feature, and features can show up in many different samples.

There is an option when running q2-deblur and q2-dada2 to disable feature-hashing: --p-no-hashed-feature-ids, but this would require you to re-run your denoising/quality-control step. Would that work for you?

zejunyan · December 4, 2017, 4:41pm

@thermokarst

Thank you for you reply.

What I mean is that how I can match each feature ID with my SampleID, but not with my sequences.

Cheers

Dr yan

zejunyan · December 4, 2017, 4:42pm

I have tried --p-no-hashed-feature-ids, it gave me the corresponding sequence of each feature ID, but not my sample ID

zejunyan · December 4, 2017, 4:43pm

@thermokarst
I have tried --p-no-hashed-feature-ids, it gave me the corresponding sequence of each feature ID, but not my sample ID

Jaroslaw_Grzadziel · December 5, 2017, 4:02am

There is any option to create a tree including either NCBI IDs or taxa names ? Hash code isn't very informative for further analyis. If so, can anyone give a clue ?
Thanks

thermokarst · December 6, 2017, 9:39pm

Hi @zejunyan, the relationship between features and samples are what are represented in the feature table, not the aligned sequences (which are used to create a phylogenetic tree). We don't have any functionality in QIIME 2 (that I can think of) that would allow you to relabel your features with arbitrary labels (in your case, you want to map them to samples, I think). Can you please confirm this, and let me know if we are on the same page? If so, I will create an issue to put this kind of relabeling on our radar. Thanks!

thermokarst · December 6, 2017, 9:40pm

@Jaroslaw_Grzadziel, can you please review my response here, and provide confirmation as well? Thanks!

Jaroslaw_Grzadziel · December 7, 2017, 8:18am

I mean that instead of hash ID it would be great to assing taxonomic ID (for example taxon name or somehow NCBI ID, greengenes ID etc.)

The example below is the tree in newick format generated in q2, the visualization (here in https://itol.embl.de/) is great but without taxonomic ID's is almos useless.

zejunyan · December 7, 2017, 11:02am

@thermokarst

Thank you for your reply.

Yes, I would like to draw the phylogenetic tree (UPGMA) labelled with my Sample IDs. This can be done in Qiime 1 through pick_de_novo_otus.py workfolw. However, this pipeline is very computationally costing when dealing with a large input.

Bests

Dr yan

zejunyan · December 7, 2017, 4:12pm

@thermokarst

Is there a file that contain information about which sample feature IDs belong to during Qiime2 analysis?
Because I can find the masked-aligned-rep-seqs.fasta file, if I can have another file storing information about which sample the feature IDs in this masked-aligned-rep-seqs.fasta file belong to, i should be able to draw a tree based on sample_IDs.

Bests

Dr yan