What is self-measurement for a marketing technology vendor?

It is the discipline of scoring your own commercial claims — positioning, pricing, competitive lines, benchmark results — by the same public methodology you apply to everyone else. The vendor runs its own canon through the same measurement axes, with no preferential treatment. Source: Assay Commercial Truth Index v0 spec.

How is self-measurement different from a benchmark report?

A benchmark report measures other vendors. Self-measurement includes the author in the same run, under the same rules, with the same published audit trail. The benchmark is only credible if the operator can lose on it. Source: Assay Commercial Truth Index v0 spec.

What does a vendor actually measure about itself?

Four axes: groundedness (can a buyer inspect each claim's source), calibration (does stated confidence match measured outcomes), coherence (does a change reach every surface), and auditability (is there a defensible record). Each maps to a brand pillar. Source: Assay Commercial Truth Index v0 spec.

Does Assay measure itself this way today?

Its own positioning canon — roughly 700 typed nodes, regenerated from the live database before publication — already lives in the Truth Graph the product sells, carrying source types and confidence ceilings. The public self-scored benchmark run is in development. Source: Assay dogfood-loop case study.

Self-measurement: when a marketing technology vendor scores its own claims

A VP of GTM evaluating martech in 2026 reads a lot of vendor benchmarks. Every one of them measures the customer’s data, the customer’s pipeline, the customer’s content. Almost none of them measure the vendor’s own claims by the same yardstick.

That asymmetry is the quiet tell. A vendor that will score your positioning but never publishes a score of its own is asking for trust it has not staked anything on.

The Commercial Truth manifesto argues that marketing is the last commercial function without infrastructure — engineering got version control, finance got the ledger, and the claims a company makes about itself never got a substrate (Source: Assay brand canon, §10 Truth Graph manifesto). Self-measurement is what happens when a vendor turns that substrate on itself first.

The concept, named plainly

Self-measurement is the discipline of scoring your own commercial claims — every positioning statement, every pricing tier, every competitive line, every benchmark result — by the same public methodology you apply to everyone else (Source: Assay Commercial Truth Index v0 spec). The vendor is one row in its own benchmark, under the same rules, with no preferential treatment.

The word doing the work is same. Not a separate internal QA pass, not a softer bar for the home team, not a marketing-approved highlight reel. The identical methodology, the identical test sets, the identical published audit trail that every other measured vendor receives (Source: Assay Commercial Truth Index v0 spec).

The honesty test is built into the design. A benchmark is credible only if its operator can lose on it (Source: Assay Commercial Truth Index v0 spec). A vendor that always tops its own index has not measured anything; it has marketed.

Why this matters now

This is not a tidiness argument. It became load-bearing because the substrate underneath GTM changed.

AI agents now read a company’s claims and re-emit them at machine speed, so an ungrounded or stale claim no longer waits for a human to catch it (the confident wrong answer). When five tools ground from five copies of your positioning, the buyer hears five versions of your company (5 AI tools, 5 versions of your company). The cost of an unmeasured claim moved from embarrassing to operational.

In that environment, a vendor’s own commercial truth is the thing under the most strain — and the thing buyers can least afford to take on faith (garbage in, gospel out). Self-measurement answers the procurement question that now precedes the demo: can you show me your own numbers, scored the way you score mine?

The primitives it introduces

Self-measurement is not a slogan; it rests on a small set of structural primitives that an ordinary benchmark never carries. Each maps to one axis of the framework Assay is developing, and each maps in turn to a brand pillar (Source: Assay Commercial Truth Index v0 spec).

Groundedness, measured (PIL-B1). For each fact the vendor publishes, can a buyer follow a link or verifier chain to an inspectable origin (Source: Assay Commercial Truth Index v0 spec)? The metric is source-coverage rate against a stratified sample, reported alongside a source-type distribution — what share of claims are first-party-verified versus AI-extracted (source-type taxonomy for marketing claims).

Calibration, measured (PIL-B2). When the vendor expresses confidence, is it calibrated against measured outcomes (Source: Assay Commercial Truth Index v0 spec)? The metric is expected calibration error, reported with a credible interval rather than a point estimate (expected calibration error in lead scoring). The companion metric is a “not yet” honesty rate — of the cases where the data is too thin to make a calibrated claim, what share does the vendor flag as insufficient rather than answering with false confidence (Source: Assay Commercial Truth Index v0 spec)?

Coherence, measured (PIL-B3). When the vendor’s information changes, does every downstream surface that consumes it reflect the change (Source: Assay Commercial Truth Index v0 spec)? The metrics are cross-surface consistency, propagation lag, and version-skew rate — the share of consumer-facing artifacts carrying a fact older than the canonical version (the knowledge decay curve).

Auditability, measured (PIL-B4). Can the vendor produce a defensible record of every decision, change, and AI inference (Source: Assay Commercial Truth Index v0 spec)? The metrics are audit-trail completeness, retention compliance, and tamper-evidence — a verifiable integrity mechanism a buyer can independently check (your AI made 50,000 claims last month).

The primitive that earns the word self-measurement is governance. The methodology constrains the operator equally — methodology changes pass a public comment window, result updates require regeneration from primary records rather than in-place edits, and the operator’s own scores run through the same pipeline as every other vendor’s (Source: Assay Commercial Truth Index v0 spec).

A worked example

The clearest example is not hypothetical. Assay already operates its own commercial canon on the substrate it sells, and the behavior is documented in the dogfood-loop case study.

The company’s positioning, pricing, competitive talk tracks, and proof points live as roughly 700 typed nodes — a figure the case study reports as approximate, regenerated from the live database before publication, not as a fixed point claim (Source: Assay dogfood-loop case study). Each node carries a source type and a confidence score that cannot exceed the source-type ceiling, the same constraint the groundedness axis measures in any vendor (Source: Assay dogfood-loop case study).

The self-measurement shows up honestly in the distribution. As of an approximate 2026-05-27 snapshot, regenerated from the live database before publication, the canon was roughly 60% first-party-verified and roughly 25% first-party-unverified — drafts and candidate frameworks not yet promoted (Source: Assay dogfood-loop case study). A vendor practicing self-measurement publishes that 25% rather than rounding it away; the unverified share is a measured number, not a blemish to hide (ai doesn’t hallucinate about your product).

Coherence is observable in the same canon. When one positioning line changed on that snapshot date, the cascade map flagged fourteen downstream nodes — the manifesto section, the positioning canon, a website section, the pricing copy, and several message variants — and the single decision propagated to all fourteen in one pass (Source: Assay dogfood-loop case study). The exact propagation-lag improvement at customer scale is not yet measured publicly; the case study reports the cascade as an approximate, regenerated count, not a benchmarked figure (Source: Assay dogfood-loop case study).

What the case study explicitly does not claim is the other half of self-measurement. It does not claim every node is perfectly maintained, it does not claim the loop is fully automatic, and it does not claim parity with customer-scale operations at 5,000 to 20,000 nodes (Source: Assay dogfood-loop case study). Stating the gaps in the same breath as the strengths is the discipline (consistency beats brilliance).

Where this lives in the substrate

Self-measurement is not a report a vendor writes once a year and files. It is a property of running your commercial claims on a substrate that can be measured at all — typed nodes, source ceilings, cascade edges, a hash-chained audit trail (what CRM did for pipeline, commercial truth does for knowledge).

For the VP of GTM, the takeaway is an evaluation lens, not a purchase. Ask any martech vendor for its own four-axis profile — groundedness, calibration, coherence, auditability — and watch whether the answer is a number with an audit trail or a deflection (the architect of truth). For the analyst, self-measurement is the structural credibility signal that separates an infrastructure play from a tool: the platform’s first customer is itself, scored by the standard it built (the infrastructure layer).

Closes / opens

Closes LSO §F.20 — the dogfood-loop cluster, and specifically the question of what it means for a marketing technology vendor to measure itself rather than only its customers.

Opens the next question: once a vendor publishes its own four-axis profile, what stops every competitor from publishing a self-flattering score under its own private methodology — and how does an open, operator-constrained standard with a public dispute queue become the one buyers trust over vendor self-report?

The methodology Assay is developing for the Commercial Truth Index is exactly this: an annual public benchmark that scores Assay alongside every other vendor with no preferential treatment, measuring whether each is grounded, calibrated, coherent, and auditable.

This essay is grounded in the Assay Commercial Truth Index v0 specification (CTI-V0) and the dogfood-loop case study (DLC-01). Methodology for the Commercial Truth Index is in development.