Error from feature-classifier: indices and data should have the same size

You are correct — this is related to the number of sequences, but not due to memory constraints.

@BenKaehler reported this bug to scikit-learn — did you figure out a way to address this in q2-feature-classifier @BenKaehler?

If you have not done so already, I would recommend dereplicating your sequences to reduce the number of unique sequences, rather than training on the full sequence set.

Otherwise, let’s see what Ben has to say!