I am very new to using Qiime2, and I don’t have any coding experience.
I have single end, already demultiplexed reads produced by an Ion Torrent PGM. I have been able to import these reads using the fastq manifest format, and everything went fine. I was also able to import them as Casava format. The below error happens with both formats.
I have tried to trim my primers using this command:
qiime cutadapt trim-single --i-demultiplexed-sequences MC1_run1_demux.qza --p-adapter CTTCCGGTACACTTACCATG --p-front ACACCGCCCGTCACTCT --p-error-rate 0.1 --o-trimmed-sequences MC1_run1_Teleo_trimmed.qza --verbose
and I get this error message:
Plugin error from cutadapt:
/tmp/q2-CasavaOneEightSingleLanePerSampleDirFmt-40fz3afg/Teleo_1_L001_R1_001.fastq.gz is not a(n) FastqGzFormat file:
Missing sequence for record beginning on line 5
See above for debug info
Here is the complete output:
Running external command line application. This may print messages to stdout and/or stderr.
The commands to be run are below. These commands cannot be manually re-run as they will depend on temporary files that no longer exist.
Command: cutadapt --cores 1 --error-rate 0.1 --times 1 --overlap 3 -o /tmp/q2-CasavaOneEightSingleLanePerSampleDirFmt-fxl7g674/MC1_run1_12S_1_L001_R1_001.fastq.gz --adapter CTTCCGGTACACTTACCATG --front ACACCGCCCGTCACTCT /tmp/qiime2-archive-25f11crf/0a91b822-d856-4f7b-9489-add0061b5fed/data/MC1_run1_12S_1_L001_R1_001.fastq.gz
This is cutadapt 1.15 with Python 3.5.4
Command line parameters: --cores 1 --error-rate 0.1 --times 1 --overlap 3 -o /tmp/q2-CasavaOneEightSingleLanePerSampleDirFmt-fxl7g674/MC1_run1_12S_1_L001_R1_001.fastq.gz --adapter CTTCCGGTACACTTACCATG --front ACACCGCCCGTCACTCT /tmp/qiime2-archive-25f11crf/0a91b822-d856-4f7b-9489-add0061b5fed/data/MC1_run1_12S_1_L001_R1_001.fastq.gz
Running on 1 core
Trimming 2 adapters with at most 10.0% errors in single-end mode ...
Finished in 5.26 s (36 us/read; 1.65 M reads/minute).
=== Summary ===
Total reads processed: 145,009
Reads with adapters: 246 (0.2%)
Reads written (passing filters): 145,009 (100.0%)
Total basepairs processed: 18,525,079 bp
Total written (filtered): 18,524,316 bp (100.0%)
=== Adapter 1 ===
Sequence: CTTCCGGTACACTTACCATG; Type: regular 3'; Length: 20; Trimmed: 235 times.
No. of allowed errors:
0-9 bp: 0; 10-19 bp: 1; 20 bp: 2
Bases preceding removed adapters:
A: 19.6%
C: 40.9%
G: 30.6%
T: 8.9%
none/other: 0.0%
Overview of removed sequences
length count expect max.err error counts
3 220 2265.8 0 220
4 10 566.4 0 10
5 1 141.6 0 1
6 4 35.4 0 4
=== Adapter 2 ===
Sequence: ACACCGCCCGTCACTCT; Type: regular 5'; Length: 17; Trimmed: 11 times.
No. of allowed errors:
0-9 bp: 0; 10-17 bp: 1
Overview of removed sequences
length count expect max.err error counts
3 10 2265.8 0 10
4 1 566.4 0 1
Command: cutadapt --cores 1 --error-rate 0.1 --times 1 --overlap 3 -o /tmp/q2-CasavaOneEightSingleLanePerSampleDirFmt-fxl7g674/MC1_run1_MiFish_2_L001_R1_001.fastq.gz --adapter CTTCCGGTACACTTACCATG --front ACACCGCCCGTCACTCT /tmp/qiime2-archive-25f11crf/0a91b822-d856-4f7b-9489-add0061b5fed/data/MC1_run1_MiFish_2_L001_R1_001.fastq.gz
This is cutadapt 1.15 with Python 3.5.4
Command line parameters: --cores 1 --error-rate 0.1 --times 1 --overlap 3 -o /tmp/q2-CasavaOneEightSingleLanePerSampleDirFmt-fxl7g674/MC1_run1_MiFish_2_L001_R1_001.fastq.gz --adapter CTTCCGGTACACTTACCATG --front ACACCGCCCGTCACTCT /tmp/qiime2-archive-25f11crf/0a91b822-d856-4f7b-9489-add0061b5fed/data/MC1_run1_MiFish_2_L001_R1_001.fastq.gz
Running on 1 core
Trimming 2 adapters with at most 10.0% errors in single-end mode ...
Finished in 5.33 s (47 us/read; 1.27 M reads/minute).
=== Summary ===
Total reads processed: 112,529
Reads with adapters: 581 (0.5%)
Reads written (passing filters): 112,529 (100.0%)
Total basepairs processed: 21,585,797 bp
Total written (filtered): 21,583,924 bp (100.0%)
=== Adapter 1 ===
Sequence: CTTCCGGTACACTTACCATG; Type: regular 3'; Length: 20; Trimmed: 483 times.
No. of allowed errors:
0-9 bp: 0; 10-19 bp: 1; 20 bp: 2
Bases preceding removed adapters:
A: 17.2%
C: 12.2%
G: 48.9%
T: 21.7%
none/other: 0.0%
Overview of removed sequences
length count expect max.err error counts
3 374 1758.3 0 374
4 98 439.6 0 98
5 7 109.9 0 7
6 1 27.5 0 1
7 2 6.9 0 2
10 1 0.1 1 0 1
=== Adapter 2 ===
Sequence: ACACCGCCCGTCACTCT; Type: regular 5'; Length: 17; Trimmed: 98 times.
No. of allowed errors:
0-9 bp: 0; 10-17 bp: 1
Overview of removed sequences
length count expect max.err error counts
3 98 1758.3 0 98
Command: cutadapt --cores 1 --error-rate 0.1 --times 1 --overlap 3 -o /tmp/q2-CasavaOneEightSingleLanePerSampleDirFmt-fxl7g674/MC1_run1_Teleo_0_L001_R1_001.fastq.gz --adapter CTTCCGGTACACTTACCATG --front ACACCGCCCGTCACTCT /tmp/qiime2-archive-25f11crf/0a91b822-d856-4f7b-9489-add0061b5fed/data/MC1_run1_Teleo_0_L001_R1_001.fastq.gz
This is cutadapt 1.15 with Python 3.5.4
Command line parameters: --cores 1 --error-rate 0.1 --times 1 --overlap 3 -o /tmp/q2-CasavaOneEightSingleLanePerSampleDirFmt-fxl7g674/MC1_run1_Teleo_0_L001_R1_001.fastq.gz --adapter CTTCCGGTACACTTACCATG --front ACACCGCCCGTCACTCT /tmp/qiime2-archive-25f11crf/0a91b822-d856-4f7b-9489-add0061b5fed/data/MC1_run1_Teleo_0_L001_R1_001.fastq.gz
Running on 1 core
Trimming 2 adapters with at most 10.0% errors in single-end mode ...
Finished in 1.56 s (29 us/read; 2.08 M reads/minute).
=== Summary ===
Total reads processed: 54,129
Reads with adapters: 53,627 (99.1%)
Reads written (passing filters): 54,129 (100.0%)
Total basepairs processed: 5,085,118 bp
Total written (filtered): 1,378,531 bp (27.1%)
=== Adapter 1 ===
Sequence: CTTCCGGTACACTTACCATG; Type: regular 3'; Length: 20; Trimmed: 36399 times.
No. of allowed errors:
0-9 bp: 0; 10-19 bp: 1; 20 bp: 2
Bases preceding removed adapters:
A: 0.0%
C: 0.2%
G: 0.0%
T: 0.1%
none/other: 99.6%
Overview of removed sequences
length count expect max.err error counts
3 11 845.8 0 11
4 3 211.4 0 3
5 1 52.9 0 1
24 1 0.0 2 0 1
25 166 0.0 2 126 27 13
26 124 0.0 2 94 19 11
27 120 0.0 2 93 15 12
28 83 0.0 2 59 14 10
29 52 0.0 2 33 12 7
30 55 0.0 2 36 10 9
31 61 0.0 2 54 3 4
32 52 0.0 2 34 13 5
33 65 0.0 2 56 2 7
34 39 0.0 2 35 3 1
35 59 0.0 2 45 10 4
36 48 0.0 2 42 4 2
37 257 0.0 2 238 17 2
38 72 0.0 2 50 20 2
39 54 0.0 2 43 8 3
40 69 0.0 2 64 3 2
41 109 0.0 2 95 7 7
42 129 0.0 2 116 8 5
43 145 0.0 2 130 13 2
44 94 0.0 2 81 11 2
45 107 0.0 2 100 3 4
46 131 0.0 2 112 15 4
47 108 0.0 2 90 12 6
48 134 0.0 2 116 15 3
49 132 0.0 2 104 20 8
50 109 0.0 2 96 12 1
51 150 0.0 2 130 16 4
52 74 0.0 2 46 25 3
53 97 0.0 2 82 11 4
54 178 0.0 2 160 14 4
55 79 0.0 2 53 21 5
56 108 0.0 2 78 21 9
57 86 0.0 2 57 24 5
58 65 0.0 2 44 16 5
59 62 0.0 2 45 10 7
60 66 0.0 2 48 16 2
61 55 0.0 2 39 16
62 43 0.0 2 30 11 2
63 72 0.0 2 61 9 2
64 57 0.0 2 35 18 4
65 44 0.0 2 29 12 3
66 43 0.0 2 30 11 2
67 71 0.0 2 47 18 6
68 49 0.0 2 43 6
69 111 0.0 2 96 12 3
70 60 0.0 2 42 17 1
71 82 0.0 2 70 7 5
72 49 0.0 2 37 10 2
73 43 0.0 2 36 7
74 51 0.0 2 44 5 2
75 38 0.0 2 25 12 1
76 37 0.0 2 30 4 3
77 53 0.0 2 44 7 2
78 50 0.0 2 37 11 2
79 40 0.0 2 34 4 2
80 40 0.0 2 35 3 2
81 64 0.0 2 56 4 4
82 45 0.0 2 35 8 2
83 108 0.0 2 102 6
84 43 0.0 2 38 5
85 54 0.0 2 49 4 1
86 72 0.0 2 66 3 3
87 61 0.0 2 50 9 2
88 62 0.0 2 48 10 4
89 73 0.0 2 61 8 4
90 53 0.0 2 44 9
91 46 0.0 2 39 4 3
92 55 0.0 2 48 5 2
93 59 0.0 2 52 4 3
94 39 0.0 2 29 10
95 116 0.0 2 107 8 1
96 148 0.0 2 140 4 4
97 251 0.0 2 241 8 2
98 523 0.0 2 453 23 47
99 989 0.0 2 888 78 23
100 18487 0.0 2 18351 120 16
101 8385 0.0 2 8297 83 5
102 877 0.0 2 796 75 6
103 335 0.0 2 278 46 11
104 209 0.0 2 167 34 8
105 123 0.0 2 86 33 4
106 65 0.0 2 36 27 2
107 45 0.0 2 23 21 1
108 52 0.0 2 29 21 2
109 36 0.0 2 21 11 4
110 41 0.0 2 22 16 3
111 17 0.0 2 10 6 1
112 16 0.0 2 11 3 2
113 16 0.0 2 10 5 1
114 10 0.0 2 7 3
115 10 0.0 2 8 1 1
116 3 0.0 2 3
117 6 0.0 2 6
118 7 0.0 2 4 0 3
119 7 0.0 2 6 1
120 7 0.0 2 6 1
121 8 0.0 2 7 1
122 7 0.0 2 4 2 1
123 5 0.0 2 5
124 8 0.0 2 8
125 3 0.0 2 3
126 3 0.0 2 0 1 2
127 3 0.0 2 3
129 3 0.0 2 2 0 1
130 4 0.0 2 3 1
131 3 0.0 2 3
132 2 0.0 2 0 1 1
133 2 0.0 2 0 2
135 4 0.0 2 4
136 3 0.0 2 3
137 3 0.0 2 3
138 3 0.0 2 3
140 2 0.0 2 2
141 4 0.0 2 4
142 3 0.0 2 1 2
143 4 0.0 2 3 1
144 2 0.0 2 2
145 5 0.0 2 4 1
147 1 0.0 2 1
148 1 0.0 2 1
149 1 0.0 2 1
150 1 0.0 2 1
151 1 0.0 2 1
152 1 0.0 2 1
153 2 0.0 2 2
157 2 0.0 2 2
162 1 0.0 2 1
163 1 0.0 2 1
164 3 0.0 2 3
166 1 0.0 2 1
170 1 0.0 2 1
173 1 0.0 2 1
175 2 0.0 2 2
181 1 0.0 2 0 0 1
182 1 0.0 2 1
183 1 0.0 2 1
187 1 0.0 2 1
190 1 0.0 2 1
193 1 0.0 2 1
198 2 0.0 2 2
199 1 0.0 2 1
203 1 0.0 2 1
206 1 0.0 2 1
207 1 0.0 2 1
208 1 0.0 2 1
215 1 0.0 2 1
218 1 0.0 2 1
219 1 0.0 2 1
221 1 0.0 2 1
225 1 0.0 2 1
258 1 0.0 2 1
266 1 0.0 2 1
297 3 0.0 2 3
298 1 0.0 2 1
299 1 0.0 2 1
300 1 0.0 2 1
301 1 0.0 2 1
309 1 0.0 2 1
312 1 0.0 2 1
338 1 0.0 2 1
347 1 0.0 2 1
348 2 0.0 2 2
349 1 0.0 2 0 0 1
354 2 0.0 2 1 1
360 1 0.0 2 1
=== Adapter 2 ===
Sequence: ACACCGCCCGTCACTCT; Type: regular 5'; Length: 17; Trimmed: 17228 times.
No. of allowed errors:
0-9 bp: 0; 10-17 bp: 1
Overview of removed sequences
length count expect max.err error counts
9 1 0.2 0 1
10 1 0.1 1 1
11 5 0.0 1 5
12 9 0.0 1 9
13 79 0.0 1 75 4
14 193 0.0 1 183 10
15 90 0.0 1 62 28
16 485 0.0 1 378 107
17 15298 0.0 1 15169 129
18 1058 0.0 1 57 1001
19 5 0.0 1 2 3
24 1 0.0 1 1
30 1 0.0 1 1
33 1 0.0 1 1
42 1 0.0 1 1
Traceback (most recent call last):
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/qiime2/plugin/model/file_format.py", line 24, in validate
self._validate_(level)
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/q2_types/per_sample_sequences/_format.py", line 159, in _validate_
self._check_n_records(record_count_map[level])
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/q2_types/per_sample_sequences/_format.py", line 130, in _check_n_records
% (i * 4 + 1))
qiime2.plugin.model.base.ValidationError: Missing sequence for record beginning on line 5
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/q2cli/commands.py", line 224, in __call__
results = action(**arguments)
File "<decorator-gen-356>", line 2, in trim_single
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/qiime2/sdk/action.py", line 228, in bound_callable
output_types, provenance)
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/qiime2/sdk/action.py", line 391, in _callable_executor_
spec.qiime_type, output_view, spec.view_type, prov)
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/qiime2/sdk/result.py", line 239, in _from_view
result = transformation(view)
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/qiime2/core/transform.py", line 57, in transformation
self.validate(view)
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/qiime2/core/transform.py", line 131, in validate
view.validate('min')
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/qiime2/plugin/model/directory_format.py", line 171, in validate
getattr(self, field)._validate_members(collected_paths, level)
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/qiime2/plugin/model/directory_format.py", line 101, in _validate_members
self.format(path, mode='r').validate(level)
File "/home/qiime2/miniconda/envs/qiime2-2017.12/lib/python3.5/site-packages/qiime2/plugin/model/file_format.py", line 29, in validate
) from e
qiime2.plugin.model.base.ValidationError: /tmp/q2-CasavaOneEightSingleLanePerSampleDirFmt-fxl7g674/MC1_run1_Teleo_0_L001_R1_001.fastq.gz is not a(n) FastqGzFormat file:
Missing sequence for record beginning on line 5
Plugin error from cutadapt:
/tmp/q2-CasavaOneEightSingleLanePerSampleDirFmt-fxl7g674/MC1_run1_Teleo_0_L001_R1_001.fastq.gz is not a(n) FastqGzFormat file:
Missing sequence for record beginning on line 5
See above for debug info.
Any suggestions on how I can get this to produce a trimmed output file? It looks like it’s able to trim the primers and then something goes wrong.
Thanks!
Erin