Hii!
I'm facing a conceptual question.. I'm using Illumina Miseq 2x300bp files.
When preparing libraries, three main "concepts" are combined: primers for amplifying the specific gene, adapters for compatibility with the sequencer, and barcodes for sample identification. Is this correct?
I receive one FASTQ file per sample, so the sequencing is already demultiplexed and I understand that the barcodes are no longer within the sequences I receive, right?
It remains to understand what to do with the adapters and primers... In the FASTQ files we receive from the sequencer, would the adapter and primer sequences be present? I've checked different posts and tutorials and everyone talks about primers, but does those primers refer to PCR primers or are referring to the adapters?
From the genomics platform, they have told us that they used the following protocol:
On page 3, it talks about primers:
16S Amplicon PCR Forward Primer = 5'
TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCCTACGGGNGGCWGCAG
16S Amplicon PCR Reverse Primer = 5'
GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGACTACHVGGGTATCTAATCC
But also about adapters:
Forward overhang: 5’ TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG‐[locus‐ specific sequence]
Reverse overhang: 5’ GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG‐[locus‐ specific sequence]
And if we look, the adapter sequence matches the beginning of the primers above...
Knowing these sequences.. How should I use cutadapt? Should I remove the adapter sequence or the adapter+primer? We are unsure if it's better to remove everything or if removing just the adapter is sufficient, as the primer is part of the gene...
Thank you so much in advance!