Always predicting the most common taxonomy (baseline for comparison)
The dummy classifier is a baseline model that always predicts the most common taxonomy class from the training data. This provides a simple benchmark to compare against more sophisticated models. Any model that performs worse than this baseline is essentially useless.
For each fold, the model identifies the most frequent taxonomy in the training set and predicts it for every test example, regardless of tags or other features.
| Fold | Predictions | Accuracy | Mean p-adic loss |
|---|---|---|---|
| Fold 0 | 659 | 8.50% | 0.0013 |
| Fold 1 | 682 | 5.72% | 0.0013 |
| Fold 2 | 678 | 6.93% | 0.0013 |
| Fold 3 | 666 | 6.76% | 0.0014 |
| Fold 4 | 680 | 9.12% | 0.0013 |