Discard/remove reads from fastq files by IDs (txt file)

sbarvaux · August 13, 2020, 3:53pm

Dear All,

I try to remove/discard (NOT extract a subset) reads from FASTQ files with IDs. I have a .txt file with IDs I want to remove (1 line = 1 ID).
Miniconda3 and Qiime2 are installed, but I do not find the right command to do that.

Can someone help me with that issue ?

Thank you in advance

SoilRotifer · August 13, 2020, 4:29pm

Hi @sbarvaux, welcome to !

The general documentation is here. But more specifically check out the filtering tutorials.

-Mike

sbarvaux · August 14, 2020, 7:35am

Hi SoilRotifer !

Thank you for your reply !
Actually, I am quiet new in bioinformatic stuffs, and I’ve already seen that filtering tutorial before asking my question, but it is like theses tutorials are used to filter (keep) some sequences and not really used to delete them from a file.
Moreover, I have fastq files and text files, I do not really understand how to apply any of these tutorials on my format files.
In other words, I am looking for a similar command to the filter_fasta.py function of QIIME1 ( I can not download it …).

SoilRotifer · August 14, 2020, 2:53pm

Hi @sbarvaux,

Actually, you can use these commands to keep or remove (delete) features, samples, etc... as referenced throughout the filtering tutorial page.

I mis-read your initial post, and then realized you were looking to filter fastq and not fasta. So, I pinged my wonderful colleagues for help and they referred me to qiime demux filter-samples. It may not be exactly what you are looking for, as this command simply removes samples of fastq files based on the Sample ID and not sequence / feature ID.

system · September 14, 2020, 8:53pm

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.