How to account for uneven sampling

Mahima_y · July 9, 2025, 6:26am

Hello! I have my ASV relative abundance data (normalized per sample), and I'm visualizing the taxonomic classifications across the various months I sampled. I'm working in R, and I realized that I haven't accounted for the uneven sampling depth per month (different months have a different number of samples). Is there a way I could account for this? Right now, I'm using the normalized abundances to calculate monthly Phyla abundance and using that for a stacked barplot.

colinbrislawn · July 10, 2025, 7:25pm

Hello Mahima,

EDIT: I mistakenly though you were asking about sampling depth during sequencing, and not balance/unbalanced number of samples in each cohort!

I've been able to publish imperfectly balanced cohorts, as long as we had enough reps. PMC5319707 CD n=49, UC n=60, HC = 9
PMC6775999 n = 20 samples taken, got n=4 at the first timepoint due to low-biomass

If you want to prove to your self (or reviewer three!) that you have enough reps, consider a power analysis as described in PMC10297957

Normalizing sampling depth been much debated over the years.

I recommend this paper:

"repeatedly rarefying" == rarefaction == bootstrapping

I recommend this plugin, which implements bootstrapping for use with Qiime2!

Mahima_y · August 9, 2025, 7:18pm

Thank you so much!! I appreciate it!