Hello, I am trying to determine how richness changes as a function of O2 availability in a marine setting. However, I am unsure whether I should use observed features or Chao1 as my richness metric. I am also confused at the different stories these metrics are telling me -- why are they so different?
For example, let's look at how each metric correlates with O2 using qiime diversity alpha-correlation.
First, observed features:
qiime diversity alpha-correlation
--i-alpha-diversity core-metrics-results-no-complexmulticellularity-4/observed_features_vector.qza
--m-metadata-file SI_2015_18S_PF_metadata_pal_all_bins.tsv
--o-visualization core-metrics-results-no-complexmulticellularity-4/observed_features_correlation_PAL.qzv
Raw data:
alpha-correlation-observed-features.tsv (2.4 KB)
Next, Chao1:
qiime diversity alpha-correlation
--i-alpha-diversity core-metrics-results-no-complexmulticellularity-4/chao1.qza
--m-metadata-file SI_2015_18S_PF_metadata_pal_all_bins.tsv
--o-visualization core-metrics-results-no-complexmulticellularity-4/chao1_correlation_PAL.qzv
Raw data:
alpha-correlation-chao1.tsv (3.8 KB)
My questions:
-
First off, the numbers on the y-axis in both plots refer to the number of taxa (ASVs) in each sample, right?
-
I think I understand how some samples in the Chao1 plot have higher values -- that means we missed some of the predicted diversity in our sampling, right?
-
Why do some of the samples in the Chao1 plot have such low values (e.g. 1 or 2)? And why do these samples not show up in the observed features analysis (e.g. Sample ID: MAR15_150_PF). Is it really likely that these samples only have 1 taxon?
Anyway, any help in sifting through all this is much appreciated!

