I just got my first batch of MiSeq data and I spent a bunch of time reading the qiime2. org website, and I did all the tutorials. However, I’m having a hard time getting my own data into qiime2. I have paired-end multiplexed data. I do not have a separate file for each sample, instead I have one .fastq file ( not fastq.gz) for the forward reads, one for the reverse reads and one for the indexing barcodes. As far as I can tell, I don’t have any information on the quality (phred scores) of the sequences. Given this information can you advise me on which approach I should use to import my data. Any insights are much appreciated.
thermokarst
(Matthew Ryan Dillon)
assigned thermokarst
#2
This sounds a lot like the EMP format, with the exception of your files not being gzipped (.gz).
Really? That would surprise me, since fastq is designed to have that kind of information (the quality is the “q” in fastq*) - how about you run the head command on your forward reads and copy-and-paste that information here?
head my-forward-reads.fastq
In the meantime, it sounds like you can probably gzip your files, and use the EMP paired-end import format. To gzip your files, you could run something like this:
Thank so much for you help! You’re right, my fastq files of course had quality data contained within. Zipping my files with the script you provided did the trick and got me started. Thanks so much!