I think I'd better to fully introduce my condition.
My data are PGM data. detailed information is here Demultiplex joined reads sequences
then my work route is as follow:
-
download and setup qiime1 as tutorial
-
validate mapping file “sample-mapping.txt”
-
demux and quality control the "full-files" (not the pr-file because little sequence pass the filter)
split_libraries.py -m sample-mapping.txt -f full.fasta -q full.qual -b 8 -o split_library_output” -
construct unrooted tree with code “
pick_de_novo_otus.py -i split_library_output/seqs.fna -o otus" -
enter q2:
-
import data
import sequences
qiime tools import \
--input-path seqs.fna \
--output-path seqs.qza \
--type 'SampleData[Sequences]’
import FeatureTable
qiime vsearch dereplicate-sequences
--i-sequences seqs.qza \
--o-dereplicated-table table.qza \
--o-dereplicated-sequences rep-seqs.qza
import unrooted tree (phylogenetic tree)
qiime tools import \
--input-path unrooted-tree.tre \
--output-path unrooted-tree.qza \
--type 'Phylogeny[Unrooted]'
unrooted to rooted
qiime phylogeny midpoint-root \
--i-tree unrooted-tree.qza \
--o-rooted-tree rooted-tree.qza
#generate core metrics: sampling depth could vary. Here 50000 was chosen since the least counts are 6XXX, 3XXXX, 6XXXX, 7xxxx, and chinese version tutorial indicates that usually 30000/50000 are suitable depth.
#generate core metrics
qiime diversity core-metrics-phylogenetic --i-phylogeny rooted-tree.qza --i-table table.qza --p-sampling-depth 50000 --m-metadata-file sample-metadata.txt --output-dir core-metrics-results
Did I do any thing wrong?