Silva_132 16S database creation with RESCRIPt

AndreaC · September 11, 2021, 6:05pm

Hi!

I want to create a database using SILVA_132, using only 16S sequences.

I have read the RESCRIPt tutorial on creating the database starting from silva, but I cannot figure out how to create it from silva 132 compared to silva 138 shown in the example. If I download the Silva_132_relase.zip folder from the link Archive, I cannot understand how to import the content into qiime2 as in the first part of the tutorial ...

instead of:

tax_slv_ssu_138.txt.gz
taxmap_slv_ssu_ref_nr_138.txt.gz
tax_slv_ssu_138.tre.gz
the sequence file:
SILVA_138_SSURef_Nr99_tax_silva_trunc.fasta.gz

what should i use ???

Thanks for your help

A.C.

crusher083 · September 11, 2021, 6:12pm

Hi, Andrea!

Switch to SILVA 132/Exports and continue with Getting SILVA data: Hard Mode.. Use the same filenames as in the tutorial with 138 substituted with 132.

Alternatively, use inbuilt script & run command:

qiime rescript get-silva-data \
    --p-version '132' \
    --p-target 'SSURef_NR99' \
    --p-include-species-labels \
    --o-silva-sequences silva-132-ssu-nr99-seqs.qza \
    --o-silva-taxonomy silva-132-ssu-nr99-tax.qza

Cheers

AndreaC · September 11, 2021, 6:36pm

Thanks for your help!

But if I wanted to extract the entire 16S sequence, as in the archive of the other link, how can I do?
I can only do this by selecting a sequence based on the primers used with:
qiime feature-classifier extract-reads
?

Thanks!

crusher083 · September 11, 2021, 8:30pm

Hello Andrea,

SILVA database in the file contains full length sequences of 16S rRNA (or Small Sububnit - SSU). The further steps in the tutorial are made to optimize the performance for sequenced target region.

May I ask what is your final goal? Taxonomic classification?
If yes, you can download a trained classifier on full-length SILVA_132 from older versions of QIIME2 here.

AndreaC · September 11, 2021, 9:02pm

my goal is to replicate metagenomic analyzes conducted on the 16s of bacteria with qiime2.

using the complete silva database with qiime I get % totally different from those provided, while if I repeat the analysis with the silva 16s database on kraken2 I can get practically identical results.

Surprised by this difference in values obtained using different databases, I want to create a database with only the 16s as the kraken2 one hoping to get similar results...

Any advice?

system · October 13, 2021, 3:03am

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.