Limit, label and let users contest: the paper argues that LLM-powered robots allocating scarce, real-time help should use a governance-vetted menu of prioritization modes, clearly communicate which mode is active, and offer an easy contest pathway; this reduces hidden value skews and ad-hoc user configuration but requires governance, adjudication capacity, and careful design to avoid new inequities.

Designing for Disagreement: Front-End Guardrails for Assistance Allocation in LLM-Enabled Robots

Carmen Ng · March 17, 2026

arxiv descriptive n/a evidence 7/10 relevance Source PDF

The paper proposes 'bounded calibration with contestability' — constrain LLM-enabled robots to a governance-approved menu of prioritization modes, make the active mode legible at deferral, and provide outcome-specific contest channels to manage pluralistic values and LLM unpredictability in real-time assistance allocation.

LLM-enabled robots prioritizing scarce assistance in social settings face pluralistic values and LLM behavioral variability: reasonable people can disagree about who is helped first, while LLM-mediated interaction policies vary across prompts, contexts, and groups in ways that are difficult to anticipate or verify at contact point. Yet user-facing guardrails for real-time, multi-user assistance allocation remain under-specified. We propose bounded calibration with contestability, a procedural front-end pattern that (i) constrains prioritization to a governance-approved menu of admissible modes, (ii) keeps the active mode legible in interaction-relevant terms at the point of deferral, and (iii) provides an outcome-specific contest pathway without renegotiating the global rule. Treating pluralism and LLM uncertainty as standing conditions, the pattern avoids both silent defaults that hide implicit value skews and wide-open user-configurable "value settings" that shift burden under time pressure. We illustrate the pattern with a public-concourse robot vignette and outline an evaluation agenda centered on legibility, procedural legitimacy, and actionability, including risks of automation bias and uneven usability of contest channels.

Summary

Main Finding

The paper proposes "bounded calibration with contestability" as a design pattern for LLM-enabled robots that must allocate scarce, real-time assistance among multiple people. The pattern constrains prioritization to a governance-approved set of admissible modes, makes the active mode legible at the moment of deferral, and provides an outcome-specific, easy-to-use contest channel. This approach treats pluralistic values and LLM behavioral variability as persistent conditions and aims to avoid hidden value skews and burdensome, ad-hoc user configuration under time pressure.

Key Points

Problem framed: LLM-mediated, multi-user assistance allocation combines pluralistic (reasonable-disagreement) value tradeoffs with unpredictable LLM behavior across prompts and contexts, making onsite allocation decisions socially fraught and technically brittle.
Design pattern (bounded calibration with contestability) has three core elements:
Governance-approved menu of admissible prioritization modes (limits the policy space to vetted options).
Legibility at the contact point (the robot clearly communicates which mode is active and why when deferring or prioritizing).
Outcome-specific contest pathway (users can contest a particular decision without renegotiating global rules).
Rationale: Avoids two problematic alternatives—opaque silent defaults that mask value choices, and wide-open user-configurable value sliders that offload moral choice under stress.
Illustration and evaluation: The pattern is shown via a public-concourse robot vignette and an evaluation agenda focused on legibility, procedural legitimacy, and actionability.
Risks identified: automation bias (over-reliance on robot decisions), uneven usability/access to contest channels, potential gaming or strategic behavior, and governance/design costs.

Data & Methods

Nature of work: Conceptual/design proposal rather than an empirical study.
Evidence in paper: A concrete vignette (public-concourse robot) used to illustrate how the pattern plays out in practice, plus a proposed evaluation agenda.
Proposed evaluation methods (as outlined by the authors):
- User studies and field pilots measuring legibility (do people understand the active mode?), procedural legitimacy (do people see the process as fair?), and actionability (can users effectively contest outcomes?).
- Metrics to collect: contest frequency and outcomes, time-to-help for different groups, user satisfaction, perceived fairness, incidence of automation bias, and usability/access disparities for contest pathways.
- Potential empirical tools: A/B experiments, logs/audit trails of decisions, surveys/interviews, and controlled scenarios to test manipulation/gaming.
Limitations: No empirical data reported; evaluation agenda remains to be operationalized and tested in deployments.

Implications for AI Economics

Allocation efficiency vs. equity tradeoffs: The pattern recognizes that resource allocation in real time is both an efficiency and distributional problem. Constraining policy options can reduce costly misallocations due to LLM unpredictability but may also lock in distributional outcomes chosen by governance.
Transaction and enforcement costs: Implementing governance-approved menus, legibility interfaces, and contest systems creates administrative and operational costs (design, monitoring, adjudication). These are economic frictions that affect overall system welfare and scale.
Externalities and strategic behavior: Readily contestable decisions create incentives for strategic contesting (false claims, gaming) and may increase congestion of the assistance system; conversely, transparent rules can reduce costly disputes and increase trust.
Incentives for governance design: Who designs the admissible menu matters—regulators, platform operators, or local stakeholders—and their incentives shape distributional outcomes. Economic policy analyses should account for capture, political economy, and welfare implications of menu selection.
Measurement and auditing value: Legible decision modes and recorded contest pathways improve verifiability and lower information asymmetries. This has value for regulators and platforms in reducing litigation, reputational risk, and monitoring costs.
Adoption and labor effects: Devices that transparently allocate help and offer contest routes may increase user trust and uptake but could reduce some on-site human discretion (affecting jobs that currently triage); conversely, contest resolution may create new labor or adjudication roles.
Welfare and inequality: If contest channels are unevenly usable (digital literacy, language, physical access), the pattern could exacerbate inequities unless contest pathways are designed and resourced to be inclusive.
Research and policy priorities: Empirical evaluation (field experiments, logged outcomes, cost–benefit analyses) is needed to quantify tradeoffs—efficiency gains from constrained, legible systems versus costs of governance and contest handling; distributional impacts across demographic groups; and effects on subsequent behavior (gaming, reliance).

If you want, I can (a) draft an experimental design and metrics to evaluate bounded calibration with contestability in a field pilot, or (b) map the likely welfare tradeoffs into a simple economic model (social planner vs. platform designer) to quantify when the pattern improves expected welfare. Which would be most useful?

Assessment

Paper Typedescriptive Evidence Strengthn/a — The paper is a conceptual/design proposal with no empirical data or causal estimates; it provides a vignette and a proposed evaluation agenda but does not present tested evidence. Methods Rigorlow — Arguments are conceptual and illustrated by a single vignette; proposed empirical methods (user studies, field pilots, A/B tests, logs) are not implemented or reported. SampleNo empirical sample — the work is illustrated with a single public-concourse robot vignette and supplemented by a proposed evaluation agenda that suggests future user studies, field pilots, logs/audit trails, surveys, and controlled experiments. Themeshuman_ai_collab governance inequality GeneralizabilityNo empirical validation — unknown performance or effects in deployed settings or at scale, Illustration is domain-specific (public concourse); unclear transferability to other contexts such as healthcare, manufacturing, or online platforms, Relies on governance capacity to define, vet, and enforce admissible mode menus — may not hold where governance is weak or contested, Assumes equitable access and usability of contest channels; outcomes may differ across users with varying digital literacy, language ability, or physical access, Effectiveness depends on LLM behavior and system architecture; variability across models and prompts could limit applicability

Claims (14)

Claim	Direction	Confidence	Outcome	Details
Bounded calibration with contestability is a viable design pattern for LLM-enabled robots that must allocate scarce, real-time assistance among multiple people. Task Allocation	positive	medium	feasibility/viability of the design pattern (qualitative)	0.02
The pattern constrains prioritization to a governance-approved menu of admissible modes, limiting the policy space to vetted options. Governance And Regulation	null_result	high	existence of governance-approved admissible modes (system design property)	0.03
The pattern requires legibility at the contact point so the robot clearly communicates which active mode is in use and why when deferring or prioritizing. Worker Satisfaction	null_result	high	legibility of active mode (user understanding at time of deferral)	0.03
The pattern provides an outcome-specific, easy-to-use contest channel allowing users to contest particular decisions without renegotiating global rules. Governance And Regulation	null_result	high	availability and specificity of contest channels (system functionality)	0.03
Bounded calibration with contestability avoids opaque silent defaults that mask value choices and avoids wide-open user-configurable value sliders that offload moral choice under stress. Ai Safety And Ethics	positive	medium	reduction in hidden value-skews and offloaded moral choice (qualitative assessment)	0.02
The pattern improves legibility, procedural legitimacy, and actionability compared to systems without these elements (proposed as evaluation goals). Worker Satisfaction	positive	medium	legibility (user comprehension), procedural legitimacy (perceived fairness), actionability (successful contest rates, time-to-resolution)	0.02
Implementing governance-approved menus, legibility interfaces, and contest systems imposes administrative and operational costs (design, monitoring, adjudication). Organizational Efficiency	negative	medium	administrative/enforcement costs (design time, ongoing monitoring/adjudication workload)	0.02
The pattern can reduce costly misallocations caused by LLM unpredictability by constraining policy options, improving overall allocation efficiency in expectation. Task Allocation	positive	medium	allocation efficiency (time-to-help, correct-priority assignments, resource utilization)	0.02
Readily contestable decisions create incentives for strategic contesting (false claims, gaming) and may increase congestion of the assistance system. Organizational Efficiency	negative	medium	contest frequency, incidence of strategic/gaming behavior, system congestion (delays)	0.02
Legible decision modes and recorded contest pathways improve verifiability and lower information asymmetries, aiding regulators and platforms in monitoring and reducing litigation/reputational risk. Regulatory Compliance	positive	medium	verifiability/auditability (availability of logs), regulator/platform monitoring costs, incidence of disputes	0.02
If contest channels are unevenly usable (due to digital literacy, language, physical access), the pattern could exacerbate inequities unless contest pathways are designed inclusively. Inequality	negative	medium	equity measures (time-to-help by demographic group, contest access/use rates, usability scores)	0.02
Adoption of devices that transparently allocate help and offer contest routes may increase user trust and uptake but could reduce on-site human discretion, affecting jobs that triage assistance. Adoption Rate	mixed	low	user trust/adoption rates, change in human triage roles/employment	0.01
The paper does not report empirical data; instead it provides a vignette and a proposed evaluation agenda (user studies, field pilots, A/B tests, logs, surveys). Other	null_result	high	presence/absence of empirical data in the paper (binary)	0.03
Key measurable metrics for future evaluation include contest frequency and outcomes, time-to-help for different groups, user satisfaction, perceived fairness, incidence of automation bias, and usability/access disparities. Other	null_result	high	the specified metrics (contest frequency/outcomes, time-to-help, satisfaction, perceived fairness, automation bias incidence, usability/access disparities)	0.03