HARMONY: a proposed operating model for agentic R&D that seeks to reverse the productivity decline by shifting researchers' time from hidden coordination work to strategic science through bounded autonomous agents, orchestration metrics, and a new sciencepreneur role; evidence is conceptual and based on four expert interviews and foresight scenarios.

From Replacement to Orchestration: A Socio-Technical Architecture for Agentic AI in Corporate R&D

Haithem Boussaid, Marc Heemskerk, Jimmy Siméon, Adam Breen, Merouane Debbah · May 23, 2026

arxiv theoretical low evidence 7/10 relevance Source PDF

The paper proposes HARMONY, a four‑pillar socio‑technical operating model that reallocates researchers' cognitive load from coordination to high‑value scientific work via hybrid human‑agent systems, ethics‑by‑design, strategic visibility, and new sciencepreneur roles.

Purpose: Corporate R&D faces a persistent productivity paradox: rising investment and expanding scientific knowledge have not translated into proportional innovation output. In pharmaceuticals this is captured as Eroom's Law; analogous patterns appear across engineering, materials science, and healthcare. The core cause is not insufficient tools but cognitive saturation: researchers spend an increasing share of their effort on coordination, documentation, and data governance -- hidden work that displaces high-value hypothesis formation, interpretation, and strategic synthesis. Design/Methodology/Approach: The paper uses a Design Science Research (DSR) methodology. The artifact is the HARMONY operating model. Evidence is triangulated from four semi-structured expert interviews with senior R&D leaders across industrial, healthcare, and academic settings; a foresight scenario analysis projecting four plausible 2040 R&D futures; and pattern matching with documented agentic R&D deployments. Two non-negotiable design requirements guide the architecture: cognitive-load redistribution (DR1) and bounded autonomy with alignment (DR2). Findings: We propose HARMONY -- Hybrid Agentic Research Model for Organisational New Yield -- a four-pillar socio-technical architecture comprising ResOps (Industrialized Execution), the Control Tower (Strategic Visibility and Drift Detection), the Ethics Fabric (Bounded Autonomy by Design), and the Talent Studio (Sciencepreneur Capability). The model introduces the Sciencepreneur as the central human archetype in agentic R&D, and Orchestration Leverage as a candidate productivity metric suited to human-agent hybrid systems.

Summary

Main Finding

HARMONY (Hybrid Agentic Research Model for Organisational New Yield) is a socio-technical operating architecture that reframes agentic AI in corporate R&D from a “replacement” problem to an “orchestration” problem. Deploying agentic systems without a purpose-built organizational architecture creates Research Debt and strategic drift. HARMONY’s four interdependent pillars — ResOps, the Control Tower, the Ethics Fabric, and the Talent Studio — plus a new human archetype (the Sciencepreneur) are designed to (a) absorb hidden work, (b) preserve bounded autonomy and alignment, and (c) unlock productivity gains by shifting researcher effort toward framing, synthesis, and strategy. The model implies that agentic automation will increase, not decrease, the strategic value of human researchers (the authors frame this as a Jevons Paradox of R&D).

Key Points

R&D productivity paradox: decades of rising R&D inputs have not produced proportional innovation output (Eroom’s Law / Bloom et al.). A major cause is “hidden work” (coordination, data governance, documentation) that displaces high-value scientific thinking.
Agentic AI capability: modern agent architectures and self-driving labs can perform multi-step delegated execution and can absorb hidden work if integrated correctly.
Replacement fallacy vs orchestration logic:
- Replacement hypothesis (linear substitution of humans with agents) is empirically and conceptually flawed.
- Jevons Paradox of R&D: lower marginal costs of execution (via agents) will expand experimental volume and therefore increase demand for human strategic roles (framing, interpretation, governance).
Research Debt: misaligned or ungoverned agentic outputs accumulate costs (errors, churn, compliance risks) analogous to technical debt.
Two non-negotiable design requirements:
- DR1 — Cognitive-Load Redistribution: systematically absorb hidden work to free human cognitive bandwidth for exploration and synthesis.
- DR2 — Bounded Autonomy and Alignment: enable high agent autonomy while maintaining real-time control on strategy, ethics, and resource use.
The HARMONY pillars:
- ResOps (Execution): version-controlled, reusable research pipelines and workflow abstraction to offload routine, mechanical, and coordination tasks.
- Control Tower (Coordination): real-time portfolio visibility, Research Drift detection, and dynamic reallocation of agent capacity to maintain strategic alignment.
- Ethics Fabric (Governance): governance-by-design (autonomy wallets, guardian agents, audit trails) that bind agent action to constraints (compute, data access, risk class).
- Talent Studio (Capability): training and career design to develop Sciencepreneurs — researchers with orchestration skills (hypothesis architecture, orchestration fluency, meaning-making, ethical judgment).
System coherence: pillars are interdependent; removing one produces Research Debt and systemic failure. The model is meant to jointly redesign technical and social subsystems (socio-technical approach).
Practical indicators and early signals: authors propose Orchestration Leverage as a productivity metric tailored to human-agent hybrids and cite early empirical signals (2025–2026) of rising compute costs, increased code churn in AI-heavy projects, and organizational budget strains.

Data & Methods

Research paradigm: Design Science Research (DSR) — artifact creation and evaluation against design requirements.
Evidence triangulation:
Four semi-structured expert interviews (45–75 min) with senior R&D leaders from diverse sectors: - Prof. Merouane Debbah (6G Research Center, Khalifa University) — academic/frontier AI - Marc Heemskerk (ASML) — high-tech manufacturing R&D - Jimmy Siméon (IA Medical / CHU Pointe-à-Pitre) — healthcare/clinical AI - Joyce Lansen (Transdev) — public transport/service
Foresight / scenario analysis: four 2040 scenarios varying regulatory intensity and decentralization to stress-test design choices: - A: Hyper-Regulated World - B: Decentralized Innovation - C: Talent Scarcity - D: Collaborative AI Ecosystems
Pattern matching with documented agentic R&D deployments and contemporary software engineering signals (e.g., compute cost pressures, code churn reports).
Analytical approach: derive design requirements from literature and interviews (thematic coding), construct HARMONY artifact, demonstrate via pattern matching, evaluate against requirements and scenarios.
Limitations (implicit in method): small interview sample, qualitative/DSR focus, foresight scenarios are hypothetical, early empirical evidence is preliminary.

Implications for AI Economics

Labor complementarity and revaluation of human roles:
- Agentic automation increases demand for higher-order human tasks (framing, synthesis, governance). Economically, this implies substitution limited to routine tasks and complementarity for skilled orchestration roles (Sciencepreneurs). Wages and scarcity premia may shift toward orchestration-capable researchers.
Productivity measurement and metrics:
- Traditional R&D productivity metrics (outputs per researcher or headcount) are inadequate. The paper proposes Orchestration Leverage — a metric to capture value added by human direction relative to autonomous execution — and suggests tracking Research Debt accumulation and compute vs personnel cost ratios.
Capital and operating cost structure:
- Agentic R&D may shift cost structures toward larger variable compute and tooling expenditures. Early signals (compute consuming budgets, token costs) suggest firms must model compute as a first-order operating cost and incorporate it into ROI and capacity planning models.
Organizational capital and returns to governance:
- Investments in governance (Ethics Fabric) and coordination (Control Tower) are productive rather than purely compliance costs; they reduce Research Debt and increase leverage of agentic systems. Economically, returns to organizational design (socio-technical capital) may rise relative to pure algorithmic investment.
Jevons-like demand effects:
- Lower marginal cost of execution can expand the scope and scale of experiments (more hypotheses tested). This can increase total resource consumption (compute, materials) and create novel externalities (data use, environmental footprint). Policymakers and firms should anticipate scale effects rather than assuming automation reduces aggregate resource use.
Policy and market structure:
- Regulation scenarios (e.g., Hyper-Regulated World) make embedded accountability and auditability economically valuable. Firms that standardize governance-by-design may gain first-mover advantages in regulated sectors (pharma, healthcare).
Measurement and investment implications for economists and managers:
- Track orchestration-related KPIs (Orchestration Leverage, Research Drift incidents, Research Debt accumulation, compute-to-personnel cost ratios).
- Portfolio allocation models should internalize dynamic agentic throughput and the risk of drift; decision rules and discounting should account for governance costs and potential rework from Research Debt.
- Human capital investment should prioritize orchestration competencies; returns to training and career redesign may be higher than incremental automation spending alone.
Risks and externalities:
- Mis-specified deployments risk increased churn, opaque outputs, and costly rework. Market signals (compute cost spikes, increased debugging/churn) can precede value destruction. Economically, firms must treat governance and orchestration as risk mitigation investments with measurable expected value.

Overall, HARMONY reframes the economics of agentic R&D away from headcount substitution toward investments in orchestration capacity, governance, and new human capital, with implications for how R&D productivity, costs, and returns are measured and managed.

Assessment

Paper Typetheoretical Evidence Strengthlow — Claims are supported by a small set of qualitative sources (four semi‑structured interviews), foresight scenarios, and pattern matching with documented deployments rather than systematic empirical tests or causal inference; there is no quantitative measurement of productivity gains or counterfactual comparison. Methods Rigormedium — The paper follows a recognized Design Science Research approach and triangulates across interviews, scenario analysis, and documented examples, but the primary empirical input is four expert interviews (limited sample and potential selection bias), with no systematic coding protocol, limited transparency about interview selection and analysis, and no validation of the proposed model through pilot implementations or quantitative evaluation. SampleQualitative data from four semi‑structured expert interviews with senior R&D leaders spanning industrial, healthcare, and academic settings; a foresight exercise constructing four plausible R&D futures for 2040; and pattern matching against documented agentic R&D deployments (case examples). No large‑scale survey, administrative, or experimental data are used. Themesinnovation human_ai_collab org_design productivity GeneralizabilityVery small, non‑representative interview sample limits external validity, Insights drawn from specific sectors (pharma, engineering, healthcare, academia) may not transfer to other industries, Foresight scenarios are inherently speculative and depend on assumptions about technology, policy and organizational change, Proposed model is conceptual and untested empirically — performance and adoption barriers are unknown, Cultural and regulatory differences across firms/countries may limit applicability

Claims (9)

Claim	Direction	Confidence	Outcome	Details
Corporate R&D faces a persistent productivity paradox: rising investment and expanding scientific knowledge have not translated into proportional innovation output (Eroom's Law); analogous patterns appear across engineering, materials science, and healthcare. Research Productivity	negative	high	innovation output relative to R&D investment	0.12
The core cause of the R&D productivity paradox is cognitive saturation: researchers spend an increasing share of their effort on coordination, documentation, and data governance—hidden work that displaces high-value hypothesis formation, interpretation, and strategic synthesis. Task Allocation	negative	high	researchers' allocation of effort between hidden/administrative work and high-value scientific tasks	n=4 0.02
Empirical evidence for the design is triangulated from four semi-structured expert interviews with senior R&D leaders across industrial, healthcare, and academic settings. Other	null_result	high	qualitative expert insights informing design	n=4 0.06
The study includes a foresight scenario analysis projecting four plausible 2040 R&D futures to stress-test design choices. Other	null_result	high	plausibility and robustness of design across future scenarios	n=4 0.06
Evidence also includes pattern matching with documented agentic R&D deployments. Other	null_result	high	similarity between proposed design and existing agentic R&D deployments	0.06
We propose HARMONY (Hybrid Agentic Research Model for Organisational New Yield), a four-pillar socio-technical architecture comprising ResOps (Industrialized Execution), the Control Tower (Strategic Visibility and Drift Detection), the Ethics Fabric (Bounded Autonomy by Design), and the Talent Studio (Sciencepreneur Capability). Research Productivity	positive	high	organizational capability to conduct agentic R&D / R&D productivity	0.02
The model introduces the 'Sciencepreneur' as the central human archetype in agentic R&D. Skill Acquisition	null_result	high	role definition and skill profile for human operators in agentic R&D	0.02
The model introduces 'Orchestration Leverage' as a candidate productivity metric suited to human–agent hybrid systems. Research Productivity	positive	high	productivity of human–agent hybrid research teams (via proposed metric)	0.02
Two non-negotiable design requirements guide the architecture: cognitive-load redistribution (DR1) and bounded autonomy with alignment (DR2). Task Allocation	positive	high	degree to which design reduces researcher cognitive load and constrains agentic autonomy	0.02