What Is Incrementality Testing?
Incrementality testing is a measurement method that estimates the true causal effect of a marketing activity by comparing outcomes between a group exposed to the activity and a group that was held out. Unlike attribution, which infers credit from observational data, incrementality testing creates a controlled comparison that isolates lift from baseline.
The method comes from clinical trial design. The exposed group is the treatment arm; the held-out group is the control. The difference in outcomes between the two is the incremental lift, the part of the result that would not have happened without the activity. Anything that would have converted regardless is excluded.
Why Incrementality Testing Matters
Last-click attribution and even multi-touch attribution chronically overstate the value of channels that capture in-market demand, especially branded search and retargeting. A user who has already decided to buy will click whatever appears in front of them, and the attribution system gives the credit to the last touch. Incrementality testing reveals that much of this "attributed" revenue would have happened anyway, while genuinely additive channels like upper-funnel video or AI search visibility may be undercredited.
For brands evaluating generative engine optimization spend, incrementality testing is the only honest answer to the board question "is this real?" An MMM will give a number; an incrementality test will tell you whether that number is causal.
How Incrementality Testing Works
The dominant designs are geo experiments, where some geographic regions receive the activity and matched regions do not, and conversion lift studies, where a randomly held-out audience is suppressed from a campaign. Both designs produce a difference-in-differences estimate of lift. Geo tests work for channels that cannot be targeted at the user level, including AI search, TV, OOH, and most podcast advertising. User-level lift studies work for digital channels where the platform can randomize delivery, which is how Meta and Google run their own lift products.
Statistical rigor depends on geo-matching quality, holdout sizing, and minimum detectable effect calculations done before the test starts. Synthetic control methods, including those used in tools like Aryma's DiDetective, can extract causal estimates even when randomization is not feasible.
In Practice
For AI search, the natural design is a geo holdout on the inputs that drive AI visibility, for example, pausing PR and content syndication in matched regions while continuing in the rest of the country. The lift in branded search, direct traffic, and AI-attributed referrals in the exposed regions, relative to controls, is the incrementality of the visibility investment.
Incrementality tests are not always running. They are periodic experiments, often two to six weeks, that calibrate the always-on MMM. A practical measurement program runs MMM continuously and incrementality tests on a rolling basis, one channel per quarter, to keep the model honest.
How Presenc AI Helps
Presenc AI provides the geo-level AI visibility data that incrementality tests need: weekly share of voice and citation frequency segmented by region. When a brand runs a geo holdout on AI visibility inputs, Presenc tracks whether the AI signal actually moved in exposed regions and stayed flat in controls, which is the precondition for the test to produce a clean lift estimate.
