Latest UMLLR Tag-order Ablation

One ordering change at a time, with the regressor held fixed.

Last rendered 2026-05-19 22:18 UTC

← Back to Latest benchmark summary

Rolling nightly benchmark compiled from the live operational runs. Best strategy in this bundle: taxonomy_association (0.288520 mean p-adic loss).

For these UMLLR ablations, Avg active params / classification is the mean number of active coefficients touched while classifying one product.

Ordering methods

The ablation keeps the greedy p-adic regressor fixed and changes only the tag ordering heuristic used before coefficient fitting.

taxonomy_association

Taxonomy-peaked tags first

For each tag, measure the share of its training occurrences that land in its single most common taxonomy. Tags with the strongest one-taxonomy association are scored first.

frequency

Most common tags first

Ranks tags by how often they appear in the training products.

random

Seeded random control

Uses a seeded random shuffle of the training tag vocabulary as a control condition.

battle_elo

Pairwise battle ranking

Ranks tags by fold-local Elo scores estimated from tag battles, while excluding the holdout fold from the ranking fit.

mean_title_position

Average title position

Ranks tags by their average recorded title position in the training products.

UMLLR tag-order ablation chart for latest benchmark view
Bar chart generated from the same bundle rows consumed by the notebook.
StrategyMean p-adic lossΔ vs battle_eloFold winsExact acc.Prefix-2 acc.Avg active params / classification
taxonomy_association0.288520-0.0458725/548.50%62.81%1.10
frequency0.329248-0.0051435/542.62%58.81%1.90
random0.330284-0.00410719/2542.35%57.75%1.41
battle_elo0.3343910.0000000/542.99%58.21%1.41
mean_title_position0.3503600.0159680/541.91%56.60%1.74