Hi @Brandon,
I’ll give my point of view on but please remember that other may disagree.
I’ll start on your point (2). The 16S primers you using produce an amplicon roughly of 300bp. Of course there is a variability for this length depending on the species in the samples, may be short but may be a bit longer as well (where the limiting factor is the performance for the polymerase). Considering at the moment the length of 300bp, as you rightly saying, using the 2x150bp sequencing none of the tool you mention would be able to join R1 and R2 because there is no overlap between the two. I am not aware of any sensible methods to joint R1 and R2 in this case, but I would be suspicious if I found one: it probably would imply to predict a joining sequences, not sure from where and of which length. Also I would not how to treat possible chimeric sequences, in case you have, within a pair, R1 from one 16S and R2 from a different one (not an impossible case from the PCR step). Not considering the fact that I try to remove from the begin of the analysis any artificial sequence and any of this joining method would go in the opposite direction for me.
Considering shorter length amplicons, you may get joined sequences (with the length distribution you are seeing), but where they are coming from? It may derive from real short 16S but also from PCR artefacts. You will end up with a pool of these, and if we trust deblur results with a ratio about half and half.
Concluding, my point is that joining the reads you taking the risk of getting a resulting dataset enriched for the short sequence 16S and for PCR artefact, losing longer 16S. Now, if you have any source of information that in your samples you expect species with short 16S, you may not find this as problem.
On your point (1), I am not used to the 2x150 bp sequences, however I know there are other users that fairly commonly work with this read length (at least some using R1 only for the analysis). I understand it may be difficult to discuss this in a possible paper but you have equally to discuss the possible bias due to the non-overlapping reads. I suppose controls would be very useful in both cases.
On the difference between dada2 and deblur, given these are different methods I have no problem on the fact that they give slightly different results. For me just pick one, you have plenty of references for both. Again, only using a known community would help in the choice.
I hope I did not create more confusion.
Best wishes,
Luca