Hi thanks for join me in this forum. I have a problem to import paired end raw data to qiime2 and I want to use the EMPPairedEndSequences pipeline in the Atacama tutorial, i have 2 fastq.gz forward and reverse fastq for each sample, a list of barcodes reverse and forward in .xlsx file in this format:
Sample_Name I7_Index_ID index I5_Index_ID index2
2740_1 N704 TCCTGAGC S502 CTCTCTAT
2740_1 N704 TCCTGAGC S502 CTCTCTAT
Fastq file are in this format:
88B@CE=@=A581==-;;,;8,-;,<E,6-8++8C<,;@++++,;E,6,:,:=++6++88CFF+4+,4,:,4+,++,59B,5,:9,:+,<,955@?,>,4,+,8,+3+5+3,3,::,5,3,3,8,37,314,*41,7,7,+2*,7+,31++,+,45+2++++,34/33++++/1+++0<+2*+2001)/+1*+20)00-/8++1/)0**(.1))02).)0))))))((-(((),()(
@M05836:3:000000000-BP6KJ:1:1101:18741:1482 1:N:0:16
CCTACGGGAGGCAGCAGTGAGGAATATTGGTCAATGGACGCAAGTCTGAACCAGCCATGCCGCGTGCAGGATGACGGCTCTATGATTTGTAAACTTCTTTTTTACGGGGGTAAACTCTGATACTTTTATCTGTCTTTATGTACCTTTCGTATATGGATCGGCTAACTCCGTTCCAGCTGCCGCGGTTATTCGGAGGTTCCAAGCGTTATCCGGATTTATTTGGTTTATACTGTTCGTATGCGTTTTGTTAAGTTAGTTCTTAAATCCCGTTTCTTAACTCCCGCACTGCCTCTCATT
+
88BBCD=@AF5:)==;==,;;,6C,CF9,CF@@FFE9C+8+++;EF@@-;CCFE?,6,6CCFFF7@,:,6,:,++76C,C<,<C,<9,5<,59CEC,>,5++++4@+,85,8,5,<,<,8@@,5A,:7,3,:,3+,:,9D7+B>+,3@>,+6,36,33,363>8*>6,2,62,+243;;<92;>+>@++23:C)++++210))/077)2+.9***++1***61))((),.:++300)()((–(…548().)
@M05836:3:000000000-BP6KJ:1:1101:9032:1555 1:N:0:16
CCTACGGGGGGCAGCAGTGGGGAATATTGCACAATGGGGGAAACCCTGATCCAGCGACGCCGCGTGGAGGATGAAGGTTTTCTGTTTTTTAACTCCTGTCTTCGTGGACGTTAATTACCCTTCCCGTTCTTGTTTCCACCTCTTTCTTCTTTCCACCTGCCGCGGTTATACGTACGTTGCAAGCGTTTTCCGGAATTTCTTGGTGTTCTGCGAGCTCATGCGGTTCGGCTTGTTTGTGTTTTAAGCTCTCGGCTCATCCCCCTAACTTCCTTCTTCACTCCCGCTCTTCTGTTGTGCAGTT
ect…
Primers Illumina:
forward: CCTACGGGNGGCWGCAG
reverse: GACTACHVGGGTATCTAATCC
Please can someone help me? I’m new on bioinformatic, Thanks in advance.
Lisa