This is not much of a problem...
- They are using proprietary primers? Often companies are willing to tell you how many bases to trim. Did you ask them for these details?
- Are you sure the primers are even in the resulting sequence data? I ask because not all protocols sequence "through" the primer.
- Mixed orientation:
- Trimming primers from mixed orientated reads is not a problem for cutadapt.
- You can also simply re-orient all of your reads to a reference database (e.g. SILVA ), using the
orient-seqs
command from the RESCRIPt plugin.
This is not true. You can simply use the full-length SILVA classifier for all variable regions.
This will work too of course. But you might be better off with my next suggestion.
Have you looked into the q2-sidle plugin? You'll be able to leverage all of your amplicon regions to obtain a better taxonomy.
-Cheers!
-Mike