Since DADA2 uses the information of a single (full?) MiSeq run, I am not sure if I need to import only the reads assigned to samples or do I even import the undetermined reads?
When importing âCasava 1.8 paired-end demultiplexed fastqâ, does it import ALL the fastq.gz files in the --input-path dir? If yes, do I need to keep only those files that I plan to import (as per you answer to Q1) in that folder?
DADA2 will still work with a subset of samples, so you could skip importing these. If you did want to import the unassigned, you could lump them into an âunassignedâ sample, which you could then filter out once you have a feature table in hand.
Yes, this will import all fastq.gz files that match the CASAVA 1.8 naming convention. If there are samples in that dir you donât want imported, remove them first.
Hope that helps!
2 Likes
thermokarst
(Matthew Ryan Dillon)
unassigned thermokarst
#4