Hello everyone!
I’m very new to qiime2, and I’ve been running into an issue demultiplexing my sequence data. When I take my data through a EMP paired ends concatenation and demultiplexing process, the resulting qzv file, which should show me an overview and quality score for 50 sequences only shows me three sequences… I’m wondering where the other 47 went?
Essentially, I’ve been working with a sample that contains sequences for 50 sample (each has a forward, reverse and barcode associated with them).
The order of commands I took were:
- I imported data from our remote illumina server into our campus remote qiime2 server:
mv ORIGINALFILENAME.fastq.gz forward.fastq.gz
mv ORIGINALFILENAME.fastq.gz reverse.fastq.gz
mv ORIGINALFILENAME.fastq.gz barcodes.fastq.gz
I saved these in their own directory folder.
2.I imported my mapping file as a .txt file and changed it into a .tsv file type.
-I saved this in a level about my directory folder containing the forward, reverse and barcode reads.
-
I concatenated this data:
qiime tools import
–type EMPPairedEndSequences
–input-path DIRECTORY
–output-path FILENAME.qza -
I demultiplexed data:
qiime demux emp-paired
–m-barcodes-file FILENAME.tsv
–m-barcodes-category BarcodeSequence
–i-seqs FILENAME.qza
–o-per-sample-sequences demux
–p-rev-comp-mapping-barcodes -
I visualized this qza file:
qiime demux summarize
–i-data demux.qza
–o-visualization demux.qzv -
I copied this qzv file to my desktop and visualized it in the qiime2 viewer website… this is where I noticed only 3 samples came up and the other 47 were missing.
What we have done to trouble shoot:
- We know from qiime1 workflow with this set that there isn’t an issue with the mapping file, at least that we are aware of.
- The internet could have went down during the concatenation steps, but if I felt that was the case, I removed the file in question and re-imported it into qiime2 again… ie I’m fairly sure that it wasn’t an issue with broken data getting imported.
- I’ve tried re-importing this data twice, under different folders. Both times I’ve ended up with the same qzv missing files visualization issues.
- We have confirmed our barcode, forward and reverse read files look okay.
What we are wondering:
- Is there a “built in” filtering step that occurs with the demux command? So that the resulting qzv file will only show reads of a certain quality and other samples are removed? If that is the case, is there a way I can work around this?
- Has anyone encountered a similar issue and what did you do?
- Is there a better way to concatenate and demultiplex paired end data? (we used earth microbiome project primers).
- Could it possible be something with the qiime2 view website that is only showing me part of my data?
- Should the mapping file be formatted differently for qiime2 as compared to qiime1?
Thank you all for your assistance!!