When I look at the distance matrix that was also created, all of the values are 1, indicating that there are no shared features between any of my samples. These results seem like something is wrong at some point in my analysis. I've looked at this thread Bray-Curtis PCoA results visualisation problem - #5 by Sapsanas to try and figure out a solution to my issue, but the issue there was the presence of NA values in their metadata set, and to my knowledge qiime2 now accepts NAs in the metadata file.
Thanks for posting that screenshot. Yes, something is wrong!
That's correct. Let's see if we can figure out what removed shared microbes between samples. Did you perform any filtering or merging on your feature table?
I've also seen this issue when demultiplexing failed, leaving barcodes inside the reads. These unique barcodes made common features seam distinct, leading to totally different samples (which looks similar once the barcode issue was fixed).
Yes, I originally had to merge two datasets (same amplicon, they were just sequenced at two different times). I thought that was causing the issue so I went back and re-ran everything with only one of my datasets and still had the same issue.
These are the steps that I followed (I used the Moving Pictures tutorial as a guide):
I extracted barcodes using Qiime 1’s extract barcodes command (another colleague did this for his data with no issues)
I used the artifact creation step for paired end data
I demultiplexed and used the demux visualization file to determine where to trim the reads
I used DADA2 for quality control
I originally used the feature merge command to merge my datasets, but I also tried running everything on my datasets separately and that did not fix the issue
I followed the commands to create a tree which I used for the diversity analysis
Finally, I ran the core metrics phylogeny command and visualized the Bray-Curtis PCOA.
Is there a way that I can check to see if the barcodes were left within the reads?
Have you had a chance to check out the cutadapt community tutorial? You could start with your reads before your step 1 listed above, import following the cutadapt guide suggested above, then demultiplex and/or trim any additional known adapters.
Note that this assumes that the barcode is at the beginning of the sequence. By the way, you could do the same for primers as long as you "expand" the ambiguous nucleotides; for example for GTGCCAGCMGCCGCGGTAA, you could search for:
Thank you for the reply! I have been trying to follow the cutadapt community tutorial for paired-end reads but it keeps timing out. Do you know approximately how long this should take? (Right now, it it fails after 4 days, which makes me think there is an issue.)
It hasn’t given us an error, but it has been running for 12 days and based on how much computational resources are currently being used on the node its “running” on, I don’t think its actually doing anything. It did start properly though, and according to the outfile it is running:
This is cutadapt 1.15 with Python 3.5.4
Command line parameters: --front file:/tmp/tmphvl3za80 --error-rate 0.1 -o /tmp/q2-CasavaOneEightSingleLanePerSampleDirFmt-2e15vgt8/{name}.1.fastq.gz --untrimmed-output /tmp/q2-MultiplexedPairedEndBarcodeInSequenceDirFmt-p7jg0mux/forward.fastq.gz -p /tmp/q2-CasavaOneEightSingleLanePerSampleDirFmt-2e15vgt8/{name}.2.fastq.gz --untrimmed-paired-output /tmp/q2-MultiplexedPairedEndBarcodeInSequenceDirFmt-p7jg0mux/reverse.fastq.gz /tmp/qiime2-archive-qliunjuj/575bb642-0496-44e9-b34c-b9cb010c5e3a/data/forward.fastq.gz /tmp/qiime2-archive-qliunjuj/575bb642-0496-44e9-b34c-b9cb010c5e3a/data/reverse.fastq.gz
Running on 1 core
Trimming 48 adapters with at most 10.0% errors in paired-end legacy mode …
We installed version 2018.6 and tried to rerun the command, but we had the same issue- it hasn’t given us an error, but it also hasn’t finished running (its been running for more than 5 days). Based on the computational resources being used, we don’t think its actually doing anything.
How many reads are in this file you are trying to demultiplex?
This plugin uses cutadapt directly - are you able to demultiplex using that tool independently, then import into QIIME 2?
Its possible this is an issue with the computation environment — it sounds like this might be running on a cluster - can you provide some details about the configuration there? Also, maybe it is worth trying to run the command on another machine.
Since I posted the last comment, I was able to figure out the issue with cutadapt- it says demultiplexing paired-end files is not available, so this could be where we are running into all of the issues. Do you have any suggestions about this?