Hi I am doing meta-analysis (sample size - 80 files) downloaded from NCBI and ENA.I have done the initial sequence analysis in dada2 in R and i have saved the files in RDS formate after makeSequenceTable command.
Now my question is - can I import the individual rds files into qiime2 and merge the seq tables and perform chimera. so that I can use fragment insertion to overcome the difference of primers used.
I tried to merge the rds files in dada2 r but its showing error as ; cannot allocate vector size of 76 GB.
Another difficulty might be importing the RDS data into QIIME 2, though this older community tutorial might be helpful here.
I can't say that I am overly optimistic about this working, but if you can't simply start by pulling all of the data into QIIME 2 initially, this might let you move forward! The easiest solution may be to merge the RDS files in R using a machine with more than 76 GB of RAM, though I am not familiar enough with that operation to know how much more you would need.
Sorry for taking your time, I just want to make sure the following workflow for 16S rDNA meta-analysis (different primer regions) is good.
so far I have the ASV tables in rds format (80 files) analyzed in R dada2, now I will convert and import it to qiime2 and merge the table and sequence, then I will do remove chimeras using vsearch.
followed by fragment insertion with the chimera-free files and downstream analysis.
I need some clarification regarding the merging of sequence and table and the issue of duplicate ASVs
I read some Q&A in the qiime2 forum, but I couldn't understand. Kindly clarify me, if I use the fragment insertion it can overcome the duplication of ASVs and downstream diversity analysis.
What concern about duplicate ASVs do you have? When you merge tables QIIME will recognize duplicate ASVs and merge them in the expected way. What is it about the process of fragment insertion that you expect to influence/interact with the presence of duplicate ASVs?
As a note, DADA2 performs chimera removal, if that is useful or unknown information to you.
If it would simplify things for you, running dada2 within qiime is a possibility. Then you wouldn't have to worry about importing into qiime afterwards. Has your issue with merging ASV tables been resolved?