How-To Guide

Causal Inference for Marketing Measurement

A practical introduction to causal inference methods for marketing measurement: RCTs, geo experiments, synthetic control, and difference-in-differences. When to use each.

By Ramanath, CTO & Co-Founder at Presenc AI · Last updated: May 17, 2026

Why Causal Inference Belongs in the Marketing Stack

Most marketing measurement is observational: it reports what was touched before a conversion and assigns credit by rule. Causal inference asks a different question: what would have happened to the outcome without the intervention? The difference between observed and counterfactual is the causal effect. Anything that would have happened anyway does not count.

For AI search and other dark-funnel channels, causal inference is the only honest measurement framework. Attribution systems do not see these channels at all; MMM sees correlations that need experimental calibration to become causal claims.

Method 1: Randomized Controlled Trials (User-Level)

When the platform supports randomization (Meta, Google Ads, Reddit, some ad networks), conversion lift studies randomize which users are exposed to the campaign and measure the difference in conversion rate between exposed and held-out groups. This is the cleanest causal evidence available and should be the default for digital channels with platform-side randomization.

Limitations: only available where the platform offers it, requires enough conversion volume to detect the expected lift, and does not work for channels with no user-level signal (TV, OOH, AI search).

Method 2: Geographic Lift Tests

For channels with no user-level signal, geographic regions are the unit of randomization. Treatment regions get the intervention; control regions do not. Difference in outcomes after pre-period adjustment is the causal estimate. This is the standard approach for TV, OOH, podcast, PR, and AI search.

Modern practice uses synthetic control rather than strict matched pairs. Tools include Google CausalImpact, Meta GeoLift, and Aryma's DiDetective. The synthetic control approach produces tighter confidence intervals and is more robust to imperfect market matching.

Method 3: Synthetic Control for Single Events

When a single intervention happens in a single unit (a brand campaign launches nationally, a Wikipedia article goes live, a new product is featured by a major AI platform), conventional comparison is impossible. Synthetic control constructs a weighted combination of comparable units to serve as the counterfactual for the single treated unit.

The result is a causal estimate even when randomization was infeasible, with the caveat of stronger assumptions about pre-period comparability. CausalImpact from Google is the standard implementation.

Method 4: Difference-in-Differences

When a treatment is applied to some units and not others, and pre-period data is available, difference-in-differences estimates the causal effect as the difference between pre-to-post change in treated units and pre-to-post change in controls. Less sophisticated than synthetic control but often sufficient for quick causal estimates.

Common application: a marketing program rolled out in some markets but not others, observed for at least one quarter pre and one quarter post. The diff-in-diff estimate is a simple regression that any analyst can run.

Method 5: Instrumental Variables

When randomization is impossible and synthetic control is not credible, an exogenous shock that affects the treatment but not the outcome directly can serve as an instrument. Examples: a competitor's outage that temporarily inflates your AI visibility, a regulatory change that affects PR coverage. Instrumental variable methods can extract causal estimates from these natural experiments but require strong assumptions and are mainly an academic tool.

How to Choose

The decision tree: if user-level randomization is available, use RCT. If not, and geographic targeting is possible, use geographic lift testing with synthetic control. If only a single event in a single unit, use Google CausalImpact for synthetic control. If a natural treatment-control split exists with pre and post data, use difference-in-differences. If an exogenous shock is available, consider instrumental variables. If none of the above, MMM with informative priors, calibrated against periodic lift tests when the opportunity arises.

Integrating Causal Inference With MMM

MMM is correlational on its own. The integration is to run periodic causal tests (RCTs where available, geo lift tests otherwise) on each material channel on a rolling basis. The MMM coefficient on each channel should agree with the causal test estimate within confidence interval; persistent disagreement is a signal that the MMM spec needs revisiting.

Mature programs run one causal test per quarter on a different channel each time, cycling through the channels every six to eight quarters. AI search visibility belongs in the rotation, with the test designed as a geographic lift on AI visibility inputs.

How Presenc AI Helps

Presenc AI provides the AI visibility signal that causal inference work on AI search depends on. Both the always-on MMM use case (needs weekly visibility series) and the periodic lift test use case (needs DMA-level visibility data) are supported by Presenc's default exports. For marketing science teams running rigorous causal measurement programs, Presenc is the missing data layer for the AI channel.

Frequently Asked Questions

Not on its own. MMM is a regression on observational data; the coefficients become causal when calibrated against periodic experimental ground truth (lift tests or RCTs). Without calibration, MMM is decision-support rather than causal evidence. The combination of MMM plus periodic causal tests is what produces channel-level causal claims that survive scrutiny.
Some methods, yes. Geographic lift testing with synthetic control is accessible via open-source tools (Google CausalImpact, Meta GeoLift) and can be run by a competent analyst with a strong stats background. Sophisticated MMM and instrumental variable methods typically need dedicated marketing science capacity. The right starting point for non-specialist teams is a vendor-led lift test.
A geographic lift test typically costs 5 to 15 percent of the campaign budget being tested (the cost of running the holdout, plus analyst time). The cost is more than offset by the budget-allocation improvements the test enables. Tests that confirm the channel works are nearly as valuable as tests that show it does not, because both produce the calibration data MMM needs.
Underpowered tests. Running a geo holdout for four weeks because "we needed to see something" produces null results that get misread as "the channel does not work" when in fact the test was too short to detect the real effect. The fix is mechanical: run the power calculation before the test starts, and commit to the duration the calculation requires.

Track Your AI Visibility

See how your brand appears across ChatGPT, Claude, Perplexity, and other AI platforms. Start monitoring today.