Hi,
I had a follow-up question for this thread. I've used QIIME2's CLI interface complete an analysis using the same parameters used below (so I know it works) but am now hoping to build my pipeline using the Artifact API and the python plugins as much as possible.
My question is related to passing the artifact generated by Artifact.import_data() into q2_dada2.denoise_paired().
I am passing in demultiplexed, paired-end, Illumina FASTQs. I use the following python code to import the data into python and save it as an artifact (QZA):
from qiime2 import Artifact
import q2_dada2 as dada2
# Import data from demultiplexed FASTQs and save as an artifact
demux_unfiltered = Artifact.import_data('SampleData[PairedEndSequencesWithQuality]',
'markerXXX_manifest.csv',
view_type='PairedEndFastqManifestPhred33')
demux_unfiltered.save('markerXXX_manifest.qza')
When I ask python for the help file for DADA2's denoise_paired algorithm, I get the following. Am I correct is reading that it expects only single lane per sample data?
help dada2.denoise_paired
Help on function denoise_paired in module q2_dada2._denoise:
denoise_paired(demultiplexed_seqs:q2_types.per_sample_sequences._format.SingleLanePerSamplePairedEndFastqDirFmt, trunc_len_f:int, trunc_len_r:int, trim_left_f:int=0, trim_left_r:int=0, max_ee:float=2.0, trunc_q:int=2, chimera_method:str='consensus', min_fold_parent_over_abundance:float=1.0, n_threads:int=1, n_reads_learn:int=1000000, hashed_feature_ids:bool=True) -> (<class 'biom.table.Table'>, <class 'q2_types.feature_data._transformer.DNAIterator'>)
Just to check, I try to use the artifact I previously created in denoise_paired() and am presented with the following error:
# Filter, denoise, pair, and remove chimeras
dada2.denoise_paired(Artifact.load('markerXXX_manifest.qza'),
trunc_len_f[idx], trunc_len_r[idx],
trim_left_f[idx], trim_left_r[idx], trunc_q[idx],
n_threads=cpus)
AttributeError: 'Artifact' object has no attribute 'sequences'
I must be missing something basic here - like a conversion to a different artifact type. Casava 1.8, perhaps? I can't find anything clear in the documentation and when I inspect the _denoise.py source, it clearly requires that my artifact class have a .sequences object, which it does not. Maybe there is a format that constructs a .sequences object.
Any advice would be appreciated.
Thanks!
Josh