After running I created a biom file, and I converted to tsv file to run the krona chart. On the chart D_0__ , D_1__, D_2__, D_3__ and the other level appeared (“D_0__Bacteria;D_1__Cyanobacteria;D_2__Oxyphotobacteria;D_3__Synechococcales;D_4__Cyanobiaceae;D_5__Synechococcus”). I want to get rid of these D_ in front of the organisms but I don’t know tow to do it. If you help me I’ll be glad.
Redo taxonomy option
One option is to redo the taxonomy assignment using these prototype SILVA 138 classifiers:
Note, you may have to retain the classifiers yourself, with the raw files provided.
There will be more news on silva taxonomy soon. So, keep your peeled!
Find / replace option
Otherwise you can make use of any text editor that can make use of regular expressions, in order to find
D_[0-9][0-9]*__ and replace with
'' / nothing, or whatever.
Alternatively, if you do not mind the unix / linux command line, you can use
sed as follows:
sed 's/D_[0-9][0-9]*__//g' < input_taxonomy.txt > output_fixed_taxonomy.txt
I am sure others will double-check my sed command. But that should work.
I just wanted to refer the folks in this thread over to the tutorial linked below.
This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.