Back to dashboard
`docs/design/search-quality-eval.md` Phase 1 (Class A canonical lookup + Class B framework-root, …

Search-quality version diff: v1.1.0 → v1.2.0

This is the Phase 1.8 version-to-version comparison KPI specified in issue #830, applied to the v1.1.0v1.2.0 jump. End-to-end measurement (binary + DB both swap between arms) so it captures the full user-felt delta, not a binary-held-constant or schema-held-constant slice.

Measured 2026-05-21·Strong

Headline result
+20 / 50 queries newly rank-1

Read in detail

Each card opens its own page. The headline and charts above are all you need at a glance; the cards are for the why and how.

Sources cited in this measurement

Every metric and method this audit relies on, with a link to the foundational source. Auto-collected from the audit text.

Mean Reciprocal Rank

Voorhees (1999), TREC-8 QA Report

Open citation

P@k (Precision at k)

Manning, Raghavan, Schütze (2008) IIR §8.4

Open citation

Wilcoxon signed-rank test

Wilcoxon (1945), Biometrics Bulletin

Open citation