I would like to figure out which of my original samples those feature IDs came from. Can someone PLEASE step me through how I assign those IDs back to the original samples/metadata they came from. I don't understand the biom tutorial and how you link the metadata back to this tsv. I just want to know which of my samples these reads came from. Sidenote, why is it not possible to export with the metadata attached to each read?
Hey there,
Not sure if I well understood your question, but have a look in my answer here to find if it helps you.
You'll end up with a table with counts per sample for each feature and those counts represent the number of reads in that sample that are equal to the feature.
Cheers,
Thank you for getting back to me. I realized I am actually quite confused, everyone only ever shows the top-most of the table, but in fact if I just grep the ID there is a second entry that has the assigned sample the read is from.
To clarify: the second column is not the sample that the read is from - this is the taxonomic identifier that was found to match the sequence defined by the Feature ID in the first column.
There is no information in this FeatureData[Taxonomy] regarding the Sample Metadata - to do any cross-referencing with your samples you will need to use the FeatureTable[Frequency | etc] - the feature table is a 2D data type, one axis is the features in the analysis, while the other axis are the samples in the analysis (this is essentially a contingency matrix).