The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲
← Papers

A simple budget-split that adds an explicit 'unknown' gender allocation meaningfully reduces gender skew in Google Ads delivery without shutting out users platforms can't infer; it offers resource-constrained public advertisers a lower-cost compromise between ignoring demographics and expensive granular targeting.

Into the Unknown: Accounting for Missing Demographic Data when Mitigating Ad Delivery Skew
Isabel Corpus, Allison Koenecke · May 12, 2026
arxiv quasi_experimental medium evidence 7/10 relevance Source PDF
Allocating campaign budgets across male, female, and an explicit unknown gender bucket reduces gender-based under-delivery on Google Ads without excluding users whose demographics are not inferred, offering a practical middle ground between ignoring demographics and costly granular targeting.

Online advertising platforms use algorithmic systems to power the process of matching ads to users, termed ad delivery. Prior audits have demonstrated that ad delivery can be skewed by demographic attributes, such that ads are systematically under-delivered to certain groups despite advertiser intent to reach groups proportionally. This under-delivery raises a serious concern in the context of ads promoting public services, which might prevent certain groups of individuals from accessing information about resources on the basis of their demographic identity. In the absence of platform-provided solutions to skewed ad delivery, advertisers can counteract skew by targeting demographic groups directly. However, direct targeting excludes users whose demographics the platform cannot infer ("unknown users") if advertising platforms do not provide a way to target unknown users directly, as is the case on Google Ads. We collaborate with a state-level government agency to reduce gender-based skew in ad delivery with an intervention that accounts for unknown users while incorporating gender-based targeting. In particular, we design a budget split intervention that directly incorporates unknown users and targets users with Google-inferred gender labels (i.e., male, female). We find that this intervention is a valuable approach to addressing ad delivery skew without excluding unknown users, and serves as a middle ground in the trade-off between higher costs (from more granular demographic targeting) and skew (from ignoring demographics entirely). This approach is responsive to the needs of real-world, resource-constrained advertisers who are committed to the equitable distribution of public service outreach via online advertising. We conclude with recommendations for government advertisers, online advertising platforms, and researchers.

Summary

Main Finding

A field case study of a government Google Search advertising campaign found persistent gender-based ad delivery skew (over-delivery to users labeled male). Implementing a novel "unknown-aware" budget split intervention — splitting the original campaign into separate campaigns that target Google-inferred male, female, and unknown users with tailored budget allocations — reduced delivery skew while mitigating the cost increases typically associated with demographic budget-splitting. Including the platform’s "unknown" demographic category avoided excluding users with missing labels (who are non-random and likely to include socioeconomically disadvantaged and non-binary individuals) and provided a practical middle ground between ignoring demographics and expensive granular targeting.

Key Points

  • Problem: Algorithmic ad delivery can introduce demographic skew (predicted relevance, competition, and baseline engagement drive this), undermining public-interest outreach.
  • Data limitations: Advertisers often must rely on platform-inferred demographics (male, female, unknown) rather than ground-truth sources (voter files) or purchased broker data; "unknown" is common and non-random.
  • Intervention: Designed and deployed a budget split that explicitly included an "unknown" campaign in addition to male/female campaigns, allocating budget to enforce population-proportional delivery goals while preserving reach to unknown users.
  • Outcome (qualitative): The unknown-aware budget split reduced gender delivery skew and moderated cost increases relative to more granular demographic targeting approaches that exclude unknowns or rely on external data.
  • Trade-offs: Using platform-inferred labels raises fairness-privacy and fairness-utility tensions — improving parity may raise costs and involves operating on imperfect/invasive demographic inferences.
  • Broader significance: The approach is pragmatic for resource-constrained public-sector advertisers and illustrates how handling missing demographic data changes mitigation design and outcomes.

Data & Methods

  • Setting and partnership
    • Collaborated with a state-level government agency promoting a website with small-business resources.
    • Focus platform: Google Search (dominant paid-search market share).
  • Timeline
    • Pre-intervention evaluation: April–November 2024 (historical campaigns).
    • Intervention deployed: January–February 2025.
  • Metrics used
    • Impressions (primary measure of ad delivery), clicks, conversions (site actions indicating meaningful engagement).
    • CTR (clicks/impressions), CVR (conversions/impressions), CPM (cost per 1,000 impressions), spend attributed to clicks.
    • Demographic attribution via Google-inferred gender labels: male, female, unknown.
  • Audit and intervention design
    • Audit: measured delivery shares by Google-inferred gender and compared to target distribution (e.g., population parity).
    • Baseline finding: consistent over-delivery to users labeled male.
    • Intervention: "Unknown-aware budget split" — replaced the single campaign with multiple targeted campaigns (male, female, unknown), allocating budgets to achieve desired delivery shares while allowing the platform’s auction and relevance scoring to operate within each targeted sub-campaign.
  • Rationale for unknown inclusion
    • Platform "unknown" users are common on search platforms (higher logged-out share), and missingness is correlated with low online footprint, lower socioeconomic status, and non-binary gender identities. Excluding them risks systematic exclusion of disadvantaged groups.
  • Limitations
    • Demographic labels are platform inferences (measurement error, unknown accuracy).
    • Single case study on one platform and one state campaign; quantitative effect sizes are context-dependent.

Implications for AI Economics

  • Market structure and platform power
    • Platforms’ inference systems and delivery optimization create externalities: advertisers’ objectives interact with platform auction dynamics and relevance models, yielding unpriced distributional effects (who sees ads).
    • The presence of a sizable "unknown" population (and opaque inference quality) affects bidders’ effective demand and competition across audience segments.
  • Pricing and competition
    • Targeting splits campaigns into separate auctions for subgroups; this changes competition and prices (CPM/CPV) across segments. Unknown-aware splitting can mitigate price spikes caused by attempting to force parity across costly subpopulations.
    • Platforms that offer or withhold controls (e.g., ability to target unknown) influence advertisers’ feasible strategies and thus surplus distribution between advertisers and platforms.
  • Valuation of demographic data
    • Demographic inferences are economically valuable but noisy. The uncertainty and missingness of such signals reduce the precision of targeting, affecting advertiser returns and the social value of outreach (especially for public-good advertising).
    • Data-broker alternatives are costly and ethically fraught; reliance on platform labels creates a market for demographic inference that has distributional and privacy externalities.
  • Welfare and regulatory implications
    • Skew in ad delivery creates allocative harms (inequitable information distribution) that are not internalized by platforms’ objective functions. Economic analysis should account for these social externalities when evaluating platform policies or interventions like VRS (variance reduction).
    • Interventions (platform-side algorithmic mediation vs. advertiser-side budget splits) have different cost, transparency, and control properties. Budget splits give advertisers control but may increase costs; platform mediation may be opaque and costly.
  • Design and mechanism implications
    • Auction and relevance mechanisms should be studied under missing attribute information (how auctions allocate and price impressions when demographic labels are missing or noisy).
    • Mechanism design solutions could incorporate parity constraints, priced fairness, or subsidization for public-interest campaigns to internalize distributional goals without imposing undue cost on advertisers.
  • Research directions for AI economics
    • Formal models of ad auctions with missing/noisy demographic signals: characterize equilibrium outcomes and cost-fairness trade-offs.
    • Welfare analysis comparing platform-side fairness enforcement (e.g., VRS) versus advertiser-side strategies (budget splits), including cost transfers and impacts on user privacy.
    • Empirical work on how missingness correlates with socioeconomic attributes and how that changes the marginal value of impressions.
    • Policy analysis on mandates for platform transparency (accuracy of inferences) and controls (explicit unknown targeting), and the economic effects of such regulation on small and public-sector advertisers.

Practical takeaways for economists and policymakers: missing demographic information materially affects mitigation strategies and economic outcomes in ad markets; designing auctions and regulation that account for missingness and distributional externalities can better align platform incentives with public-interest outreach goals.

Assessment

Paper Typequasi_experimental Evidence Strengthmedium — The study uses real-world campaign deployments in partnership with a government advertiser and demonstrates practical reductions in gender delivery skew; however, the lack of explicit randomization, likely small and single-agency sample, platform-specific dynamics, and potential confounding from auction mechanics and temporal effects limit strong causal inference and external validity. Methods Rigormedium — The intervention is well-motivated and implemented in operational settings with relevant outcome measurement (impressions, spend, delivery shares), but the paper appears to rely on quasi-experimental comparisons without randomized assignment, has limited detail (or control) over platform auction and inference processes, and faces measurement issues around platform-inferred gender and the 'unknown' category. SampleCampaign-level data from a state-level government agency's Google Ads public-service campaigns, including impressions, clicks, spend, and delivery broken down by Google-inferred gender labels (male, female, and unknown); intervention consists of campaigns where budget was explicitly split across those targeting buckets; sample size and time window not specified in the summary. Themesgovernance inequality IdentificationImplement a budget-split intervention that allocates campaign spend across Google-inferred gender buckets (male, female) plus an explicit 'unknown' allocation, and compare ad delivery outcomes (impression share, spend, reach) across these allocations versus baseline campaigns that do not incorporate unknowns; identification rests on within-campaign or across-campaign comparisons of delivery shares under different budget allocations rather than randomized assignment, with adjustment for campaign-level constraints and auction dynamics. GeneralizabilitySingle government advertiser at state level — may not generalize to private-sector advertisers or other government contexts, Results rely on Google Ads' specific demographic inference and auction mechanics — other platforms (Facebook, TikTok, programmatic exchanges) may behave differently, Gender labels are limited to platform-inferred categories (male/female/unknown) and do not capture nonbinary identities or inference errors, Findings pertain to public-service outreach campaigns; commercial ad dynamics and value-based bidding could alter outcomes, Scale and geographic/demographic composition of the campaign audience may limit transferability to other regions or campaign sizes

Claims (8)

ClaimDirectionConfidenceOutcomeDetails
Ad delivery can be skewed by demographic attributes, such that ads are systematically under-delivered to certain groups despite advertiser intent to reach groups proportionally. Task Allocation negative high gender-based (and demographic) skew in ad delivery / under-delivery of ads to certain demographic groups
0.48
Skewed ad delivery of public-service ads can prevent certain groups of individuals from accessing information about resources on the basis of their demographic identity. Consumer Welfare negative high access to public-service information due to demographic skew in ad delivery
0.08
In the absence of platform-provided solutions to skewed ad delivery, advertisers can counteract skew by targeting demographic groups directly. Task Allocation positive high ability of advertisers to mitigate ad delivery skew via direct demographic targeting
0.24
Direct demographic targeting excludes users whose demographics the platform cannot infer ('unknown users') if advertising platforms do not provide a way to target unknown users directly, as is the case on Google Ads. Adoption Rate negative high inclusion/exclusion of 'unknown' users under direct demographic targeting on Google Ads
0.48
We design a budget split intervention that directly incorporates unknown users and targets users with Google-inferred gender labels (male, female). Task Allocation neutral high design and implementation of a budget split intervention incorporating unknown users and Google-inferred gender labels
0.48
The budget split intervention is a valuable approach to addressing ad delivery skew without excluding unknown users. Task Allocation positive high reduction of gender-based ad delivery skew while maintaining inclusion of unknown users
0.48
The intervention serves as a middle ground in the trade-off between higher costs (from more granular demographic targeting) and skew (from ignoring demographics entirely). Firm Productivity mixed high trade-off between advertising cost and magnitude of ad delivery skew
0.48
This budget-split approach is responsive to the needs of real-world, resource-constrained advertisers committed to equitable distribution of public service outreach via online advertising. Organizational Efficiency positive high practical suitability / responsiveness of the intervention for resource-constrained advertisers
0.24

Notes