Latest UMLLR Tag-order Ablation

One ordering change at a time, with the regressor held fixed.

Last rendered 2026-04-20 01:12 UTC

← Back to Latest benchmark summary

Rolling nightly benchmark compiled from the live operational runs. Best strategy in this bundle: taxonomy_association (0.275238 mean p-adic loss).

For these UMLLR ablations, Avg active params / classification is the mean number of active coefficients touched while classifying one product.

Ordering methods

The ablation keeps the greedy p-adic regressor fixed and changes only the tag ordering heuristic used before coefficient fitting.

taxonomy_association

Taxonomy-peaked tags first

For each tag, measure the share of its training occurrences that land in its single most common taxonomy. Tags with the strongest one-taxonomy association are scored first.

frequency

Most common tags first

Ranks tags by how often they appear in the training products.

random

Seeded random control

Uses a seeded random shuffle of the training tag vocabulary as a control condition.

battle_elo

Pairwise battle ranking

Ranks tags by fold-local Elo scores estimated from tag battles, while excluding the holdout fold from the ranking fit.

mean_title_position

Average title position

Ranks tags by their average recorded title position in the training products.

UMLLR tag-order ablation chart for latest benchmark view
Bar chart generated from the same bundle rows consumed by the notebook.
StrategyMean p-adic lossΔ vs battle_eloFold winsExact acc.Prefix-2 acc.Avg active params / classification
taxonomy_association0.275238-0.0589885/550.70%64.11%1.09
frequency0.321831-0.0123945/544.09%59.55%1.87
random0.325144-0.00908121/2543.45%58.18%1.44
battle_elo0.3342250.0000000/543.97%58.36%1.45
mean_title_position0.3354990.0012742/544.36%58.45%1.69