Confirmation on code that removes Homopolymers

tamardigrade · July 22, 2023, 5:19am

Hi,

I am using QIIME 2 OTU/Naive Bayes classifier to process ITS amplicons. I am looking for confirmation on what/which code actually removes homopolymers. I've looked through the website/tutorials and cannot find it (the closest thing I've found is --p-max-ambiguous from q-score (q-score: Quality filter based on sequence quality scores. — QIIME 2 2023.5.1 documentation) btu this is for ambiguous bases not homopolymers. Can someone with better knowledge of QIIME 2 code please provide confirmation and link to any code that does remove homopolymers? Thanks so much!

Nicholas_Bokulich · July 22, 2023, 5:29pm

Hi @tamardigrade ,

See here:

The cull-seqs action is part of the RESCRIPt plugin, which must be installed separately. Note that this will only remove homopolymer from FeatureData[Sequence] or RNASequence artifacts, i.e., after reads have been denoised or clustered.

Good luck!

peebeenojay · August 11, 2023, 5:04pm

Hello,

Just to confirm, in order to remove reads that have homopolymers with n > x (either user-defined x, or by default x), one would have to install and apply the RESCRIPt plugin? I.e., QIIME2 does not by default remove reads with homopolymers?

I believe this should be clarified, since in QIIME 1 had split_libraries.py that had the option to exclude reads with --max_homopolymer over a certain number. I could not find any doc in QIIME2 that refers to homopolymers.

I just want to make sure reads with homopolyers are not being removed in the background in some hidden step.
This is important in the analysis of ITS sequences, and is refered in a 2022 review paper that QIIME2 does remove reads with homopolymers n>5 in its script.

Kind regards,
Cátia Fidalgo

SoilRotifer · August 14, 2023, 1:12pm

Hi @peebeenojay,

QIIME 1 & QIIME 2 are quite different, and many of the QIIME 1 commands do not have an analogue in QIIME 2.

I am not sure why the authors of the review paper think this is the case. It is possible that the authors confused QIIME 1 and QIIME 2 functionality.

Yes, as already mentioned in this thread, RESCRIPt is a QIIME 2 plugin that must be installed separately.

Correct, QIIME 2 does not remove homopolymers by default.

-Cheers!
-Mike

system · September 14, 2023, 7:13pm

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.