some confusion about the feature table of qiime2

jwdebelius · February 18, 2020, 9:41am

Let's take a step back. A "feature table" is a tabular representation of your data where we map samples to some kind of feature (ASVs, OTUs, species, metabolites, genes, cytokines, cities visited, nutrients... if you can represent it as a feature table, you can represent it as a feature table!). In QIIME (and microbiome bioinformatics specifically), the feature table typically looks something like this under the hood:


5	0	500
100	200	50
0	5000	30
0	0	10

Due to legacy reasons, we typically represent the features (here , , , ) as the rows and the samples (, , ) are the columns. We can also represent the data as relative abundance, rather than counts. But... those are details.

They primary ways to generate a feature table in QIIME 2 from amplicon data dada2, deblur, and clustering via vsearch to make OTUs (I recommend the tutorial here for more details on OTU clustering.) Essentially, though, after you take the dereplicated sequences from vsearch, you then cluster those to generate a table like the one above.

You can learn more about all the different kinds of features here:

A 100% OTU (also zOTU or sOTU [zeroOTU or subOTU]) is usually the same as an ESV (exact sequence variant) or ASV (amplicon sequence variant). This should be the output of one of the denosing algorithm: dada2 or deblur. You want to denoise to get this output because denoising addresses error in your sequences. If you use the directly dereplicated sequences, you'll introduce error from PCR and sequencing.

The ASVs or raw sequences can then be clustered into operational taxonomic units (OTUs) which are just clusters of sequences with some threshold of similarity.

Here's a really nice discussion about OTUs and ASVs, etc

ASVs and OTUs are one kind of feature a table can contain. But if you used something like q2-PICRUSt, your feature table might contain genes. If you use q2-taxa and collapsed your taxonomy, your feature table would contain the collapsed taxonomy. If you have metabolomics and you have q2-metabolomics then your features are molecules. You could even make a feature table out of metagenomics data using q2-metaphlan or q2-shogun.

Best,
Justine