Using tax-credit for 18S rRNA classifier evaluation

:blush:

Awesome! Glad to hear tax-credit will be useful for you.

I am not really planning any tutorials at the moment... but see the instructions here for using the jupyter notebooks included in tax-credit. Here are a few tips:

  1. All mock communities used in tax-credit are derived from mockrobiota, which does contain one 18S mock community (it's a bit old and small so yours are probably better! If so, please consider contributing to mockrobiota so others can re-use). You can format your mock community datasets in the same way as this 18S mock community (mock-11) to get them in the correct format for tax-credit.
  2. Check out the jupyter notebooks for processing/analyzing 16S mock communities in tax-credit. You can analyze 18S datasets that are in the same format using the same notebooks just by changing the paths.

One reason why a formal tutorial is not being made is because the jupyter notebooks are sort of one part reproducible analysis, one part tutorial for anyone who can read a bit of python code (if that's not you, don't be too discouraged — most of the heavy lifting is done by other code behind the scenes, and the code in the notebooks should be reasonably easy to follow and mostly just requires altering file paths to match those on your system).

If following the tax-credit notebooks is too daunting, you may want to check out the methods in q2-quality-control for evaluating mock communities. These methods are not as suited as tax-credit for large-scale method testing/optimization, but expose many of the same functions in a much more user-friendly way (i.e., via QIIME2), so should get the job done, particularly if you are not planning on doing the same parameter tuning that we did in that preprint.

Thanks for your interest in tax-credit! I hope that it or q2-quality-control as useful for you. Please let me know if you run into any more trouble with tax-credit / q2-quality-control.

I hope that helps!