I'm totally new to bioinformatics. I have a degree in computer science and very basic knowledge in biology. I have been mandated to start to learn QIIME2 for a project. So I started the "Moving pictures" tutorial, so far so good, but I need to do something specific.
I have two files which I think are PairEnded, because the files ends with xyz_R1.fastq.gz and xyz_R2.fastq.gz. I tried to load the files in examples, but I get a wrong format error.
You are off to a good start! Have you had a chance to review any of the other tutorials on the User Docs site? In particular, have you had a chance to look at these two:
Thanks for giving a hand. I already tried the Moving Pictures Tutorial. But I get stuck when it comes to provide a ".tsv" file. I don't have this kind of file.
You need to create a metadata file with information about your sample IDs and your barcodes. Did you check out the cutadapt tutorial I linked to above? Once you have that file, you can proceed to demultiplexing.
I think maybe you misunderstood me --- I said you need to create a metadata file with information about which barcode maps to which sample --- this has nothing to do with a sequencing barcode file. There are at least 2 examples of these kinds of files in the many resources I have linked you to above.
Care to share? It isn't really helpful to any future readers of this thread if you don't provide some details about what worked for you.
Is that a valid metadata file that can describe the samples? If so, I feel a bit dumb. I have built this file a long time ago by hand. I just did not associate this to metadata... I just called it a manifest.
I think I getting to something. I just find out that the name has a meaning which followed the Illumina standards and the lines inside the FASTQ file are some kind of descriptor. Maybe I do not need the metadata.
@nbourre --- I suggest you take a step back and regroup.
Is your source data multiplexed or demultiplexed? You mentioned above that you only had two files. My understanding is that you have two, multiplexed files --- one for forward reads, one for reverse reads. This kind of data will need some list of which barcodes map to which samples, it is the only way you can demultiplex these data. Please note, this isn't specific to QIIME 2, this is going to be a required step with any tool you use. So, you will need to assemble the list of which barcodes belong to which samples. There is no way forward without that, assuming your data is multiplexed. Once you have that, you can follow the q2-cutadapt demultiplexing protocol I linked to above, but, let's wait on that and instead sort out the confusion around your source data.