Issue while importing BOLD database TSV taxonomy file

Hello,

I'm new to qiime2 and facing a problem importing the taxonomy tsv file from BOLD database. I'm using qiime2 2024.5 (amplicon distribution). I downloaded BOLD_Public.25-Oct-2024.tsv from Bold Systems v4

I used the command:
qiime tools import
--type 'FeatureData[Taxonomy]'
--input-format HeaderlessTSVTaxonomyFormat
--input-path BOLD_Public.25-Oct-2024.tsv
--output-path BOLD_Public.25-Oct-2024.tsv.qza

And I received following error:
Traceback (most recent call last):
File "/dss/work/fitu9124/qiime2/miniconda3/envs/qiime2-amplicon-2024.5/lib/python3.9/site-packages/q2cli/builtin/tools.py", line 852, in _import
artifact = qiime2.sdk.Artifact.import_data(
File "/dss/work/fitu9124/qiime2/miniconda3/envs/qiime2-amplicon-2024.5/lib/python3.9/site-packages/qiime2/sdk/result.py", line 332, in import_data
return cls.from_view(type, view, view_type, provenance_capture,
File "/dss/work/fitu9124/qiime2/miniconda3/envs/qiime2-amplicon-2024.5/lib/python3.9/site-packages/qiime2/sdk/result.py", line 360, in _from_view
result = transformation(view, validate_level)
File "/dss/work/fitu9124/qiime2/miniconda3/envs/qiime2-amplicon-2024.5/lib/python3.9/site-packages/qiime2/core/transform.py", line 70, in transformation
new_view = transformer(view)
File "/dss/work/fitu9124/qiime2/miniconda3/envs/qiime2-amplicon-2024.5/lib/python3.9/site-packages/qiime2/core/transform.py", line 221, in wrapped
file_view = transformer(view)
File "/dss/work/fitu9124/qiime2/miniconda3/envs/qiime2-amplicon-2024.5/lib/python3.9/site-packages/q2_types/feature_data/_transformer.py", line 224, in _20
_taxonomy_formats_to_dataframe(str(ff), has_header=False))
File "/dss/work/fitu9124/qiime2/miniconda3/envs/qiime2-amplicon-2024.5/lib/python3.9/site-packages/q2_types/feature_data/_transformer.py", line 62, in _taxonomy_formats_to_dataframe
df = pd.read_csv(filepath, sep='\t', skip_blank_lines=True,
File "/dss/work/fitu9124/qiime2/miniconda3/envs/qiime2-amplicon-2024.5/lib/python3.9/site-packages/pandas/io/parsers/readers.py", line 1026, in read_csv
return _read(filepath_or_buffer, kwds)
File "/dss/work/fitu9124/qiime2/miniconda3/envs/qiime2-amplicon-2024.5/lib/python3.9/site-packages/pandas/io/parsers/readers.py", line 626, in _read
return parser.read(nrows)
File "/dss/work/fitu9124/qiime2/miniconda3/envs/qiime2-amplicon-2024.5/lib/python3.9/site-packages/pandas/io/parsers/readers.py", line 1923, in read
) = self._engine.read( # type: ignore[attr-defined]
File "/dss/work/fitu9124/qiime2/miniconda3/envs/qiime2-amplicon-2024.5/lib/python3.9/site-packages/pandas/io/parsers/c_parser_wrapper.py", line 234, in read
chunks = self._reader.read_low_memory(nrows)
File "parsers.pyx", line 838, in pandas._libs.parsers.TextReader.read_low_memory
File "parsers.pyx", line 905, in pandas._libs.parsers.TextReader._read_rows
File "parsers.pyx", line 874, in pandas._libs.parsers.TextReader._tokenize_rows
File "parsers.pyx", line 891, in pandas._libs.parsers.TextReader._check_tokenize_status
File "parsers.pyx", line 2061, in pandas._libs.parsers.raise_parser_error
pandas.errors.ParserError: Error tokenizing data. C error: Expected 76 fields in line 1334102, saw 77

An unexpected error has occurred:

Error tokenizing data. C error: Expected 76 fields in line 1334102, saw 77

See above for debug info.

I really don't understand what the problem is. I even tried to cut off column 77 from the file, still same error.

Any help would be greatly appreciated :slight_smile:
Thanks in advance

Hi @Val,

I'd recommend reading through the following tutorial:

1 Like

Hi @SoilRotifer,
thanks a lot for your reply.

I've read this tutorial. I was just hoping that 4 years later it would be possible to import the BOLD database directly into qiime2.

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.