Is there any functionality to subset randomly from a set of reference sequences? For instance, qiime taxa filter-seqs is great when I want to retain (or exclude) some taxonomic group within a reference database, but what I I want to select just a subset of references?
Maybe this is more of a feature request for RESCRIPt @SoilRotifer, @Nicholas_Bokulich, and @thermokarst? For now I’m resorting to exporting the sequences as a .fasta and using seqkit sample to perform the subsampling, collecting those featureID’s as a text file, and filtering the original dataset with
qiime feature-table filter-seqs.
Thanks for any creative thoughts if you know of a way to use existing tools within QIIME to avoid exporting/importing!