Generative AI widens the space of feasible ideas and speeds early-stage R&D, but gains crystallize only when humans vet, curate and implement outputs; unchecked use risks low-quality, biased, or risky innovations.

ChatGPT as an Innovative Tool for Idea Generation and Problem-Solving: Nano Review

Horn Sarun · March 27, 2026 · Zenodo (CERN European Organization for Nuclear Research)

openalex review_meta medium evidence 8/10 relevance DOI Source PDF

Generative AI functions as a cognitive co-pilot that increases idea quantity/diversity and accelerates early-stage problem decomposition and prototyping, but its economic value depends on human curation, domain expertise, and downstream implementation.

Abstract The advent of generative artificial intelligence, particularly advanced large language models like ChatGPT, is fundamentally reshaping the cognitive architecture of creativity and systematic problem-solving. This nano review examines its emergent, dual function as both a high-volume catalyst for divergent idea generation and a structured assistant for deconstructing and navigating complex problems. It synthesizes empirical findings on its utility in overcoming cognitive fixation, generating cross-domain analogies, and providing scaffolded support for hypothesis formulation and solution prototyping. The review critically confronts the model's inherent limitations, including its reliance on recombinative rather than deeply original insight, its susceptibility to bias and mediocrity, and its lack of contextual, domain-specific wisdom. It concludes that the true potential of these tools is unlocked not through autonomous ideation, but through a synergistic "cognitive co-pilot" model. This framework strategically leverages AI to expand the solution space and challenge assumptions, while reserving for human intelligence the critical roles of curation, strategic evaluation, and the application of experiential judgment to navigate ambiguity and ethical implications. Keywords: ChatGPT, innovation, ideation, creative problem-solving, divergent thinking, cognitive augmentation, human-AI collaboration, design thinking, creativity support tools

Summary

Main Finding

Generative AI (e.g., large language models like ChatGPT) serves a dual cognitive role: (1) a high-volume catalyst for divergent idea generation and cross-domain analogy-making, and (2) a structured assistant for deconstructing complex problems and scaffolding hypotheses and prototypes. Its economic value arises when it is used as a "cognitive co-pilot"—expanding the solution space and challenging assumptions—while humans retain roles in curation, strategic evaluation, experiential judgment, and ethical oversight.

Key Points

Dual function: simultaneously boosts ideational fluency (quantity/diversity of ideas) and supports systematic problem breakdown and prototyping.
Empirical benefits: helps overcome fixation, produces cross-domain analogies, accelerates hypothesis generation and early-stage prototyping, and can increase creative output in lab and field tasks.
Limitations of the tool:
- Recombination bias: tends to remix existing patterns rather than produce deeply original, paradigm-shifting insight.
- Quality variability: frequently generates mediocre, plausible-sounding outputs that require human filtering.
- Bias and errors: susceptible to social/representational biases and hallucinations; lacks contextual, tacit, domain-specific wisdom.
Interaction model: effectiveness depends on human-AI workflows—prompt design, iterative refinement, and human vetting are critical.
Net value is contingent: gains are largest where breadth of ideas and rapid iteration matter; gains are smaller or riskier where deep domain expertise, tacit knowledge, or high-stakes judgments are required.

Data & Methods

Nature of the paper: a nano review synthesizing empirical findings across experimental, lab, and applied studies rather than presenting new primary data.
Evidence base summarized:
- Controlled experiments and user studies measuring idea fluency, novelty, and problem-solving performance with/without AI assistance.
- Field and case studies of AI-supported prototyping and design tasks in organizational settings.
- Qualitative analyses of interaction patterns (prompting, iteration, curation).
Typical outcome measures in the literature: number and diversity of ideas, judged creativity/novelty, time-to-prototype, user-perceived usefulness, error rates.
Methodological caveats in the literature: small or convenience samples, short-term interventions, domain-specific contexts, heterogeneous evaluation metrics, and potential publication bias toward positive results.

Implications for AI Economics

Productivity and innovation:
- Lowered cost and time of ideation and early-stage R&D may accelerate innovation cycles and reduce search costs for firms.
- AI can raise per-worker productivity for tasks that involve brainstorming, drafting, and prototyping, but measured gains will depend on downstream filtering and implementation costs.
Labor market effects:
- Complementarity: increased returns to skills in evaluation, curation, synthesis, domain expertise, and managerial judgment that supervise or integrate AI outputs.
- Substitution: potential displacement of entry-level or routine cognitive work that centers on generation or drafting without significant evaluative responsibility.
- Occupational reallocation: demand grows for "AI-savvy" roles (prompt engineers, human-in-the-loop evaluators, domain curators).
Returns to skill and organizational capital:
- Economic rents may shift toward agents who control data, computing, and organizational processes that effectively integrate AI as a co-pilot.
- Firms that build strong human-AI complementarities (training, workflows, culture) can realize larger productivity gains.
Market structure and competition:
- Platform effects and scale economies in data/compute could increase concentration among AI providers, affecting entry, pricing, and bargaining power.
- Lower ideation costs may reduce barriers for startups in some domains, but model-as-platform dominance could reintroduce entry frictions.
Measurement challenges:
- Standard productivity metrics (TFP) may undercount the value of ideation and creative augmentation; disentangling AI contribution from human curation is empirically difficult.
- Quality-adjusted output and longer-term impact (e.g., which AI-generated ideas are implemented and succeed) are crucial but hard to observe.
Innovation quality and externalities:
- Proliferation of low-quality or biased ideas can impose filtering and reputational costs; false or misleading outputs introduce social externalities.
- Rapid idea generation without robust evaluation may increase risk (poor product designs, ethical lapses, regulatory violations).
Policy and business implications:
- Education and training should emphasize evaluative, domain, and integrative skills to complement AI.
- Firms should invest in governance: human-in-the-loop processes, evaluation metrics, and data stewardship.
- Regulatory focus areas: intellectual property boundaries for AI-generated content, liability for downstream harms, competition policy for model/data concentration, and standards for bias/robustness.
Research priorities for AI economics:
- Quantify net welfare impacts across sectors and skill groups, incorporating implementation and evaluation frictions.
- Develop methods to attribute innovation and productivity gains between human and AI contributions.
- Study long-run effects on market structure, wage distribution, and the nature of tasks.

Overall, generative AI expands the feasible space of ideas and accelerates early-stage problem solving, but its economic promise is realized mainly through complementarity with human judgment, curation, and domain expertise. Policies and firm strategies that cultivate those complementarities will determine how gains are distributed across workers, firms, and society.

Assessment

Paper Typereview_meta Evidence Strengthmedium — Draws on controlled experiments, lab/user studies, and field case studies that consistently show short-term gains in ideation, prototyping speed, and creative fluency, but the underlying studies are often small, short-term, domain-specific, and heterogeneous, and there is limited causal evidence on downstream implementation, firm-level productivity, and long-run welfare. Methods Rigormedium — The paper synthesizes empirical findings across multiple study designs (RCT-like user experiments, lab tasks, and field case studies) and discusses methodological caveats, but it is a compact/narrative synthesis rather than a preregistered systematic review or meta-analysis and is constrained by the varying rigor and metrics of the cited literature. SampleA synthesis of evidence from controlled experiments and user studies (typically small or convenience samples, short tasks measuring idea fluency/novelty), qualitative analyses of human-AI interaction patterns, and a set of field/case studies of AI-supported prototyping and design in organizational settings; studies span multiple domains but are often domain-specific and use heterogeneous outcome measures. Themeshuman_ai_collab productivity innovation labor_markets skills_training org_design governance GeneralizabilityMany primary studies use small, convenience or student samples limiting external validity, Most interventions are short-term with immediate outputs (ideas/prototypes) rather than long-run implemented outcomes, Domain-specific tasks (design, writing, brainstorming) may not generalize to high-stakes or tacit-knowledge-intensive work, Heterogeneous and nonstandardized outcome measures (fluency, judged novelty, time-to-prototype) impede comparability, Rapid evolution of models means findings for one generation of LLMs may not hold for later ones, Potential publication bias toward positive results and selective case reporting

Claims (19)

Claim	Direction	Confidence	Outcome	Details
Generative AI serves a dual cognitive role: (1) a high-volume catalyst for divergent idea generation and cross-domain analogy-making, and (2) a structured assistant for deconstructing complex problems and scaffolding hypotheses and prototypes. Creativity	positive	medium	ideational fluency/diversity; incidence of cross-domain analogies; quality/speed of problem decomposition and prototyping	0.14
When used as a 'cognitive co-pilot' that expands the solution space and challenges assumptions while humans curate and evaluate, generative AI generates economic value. Innovation Output	positive	medium	idea space breadth; time-to-prototype; downstream implemented/valued ideas (largely not directly measured in many studies)	0.14
Generative AI boosts ideational fluency—the quantity and diversity of ideas produced in brainstorming tasks. Creativity	positive	medium	number of ideas generated; diversity indices of ideas	0.14
Generative AI supports systematic problem breakdown and early-stage prototyping, accelerating hypothesis generation and prototype development. Task Completion Time	positive	medium	time-to-prototype; number/quality of generated hypotheses/prototypes; user-perceived usefulness	0.14
AI assistance helps people overcome fixation and produces cross-domain analogies that they might not generate alone. Creativity	positive	medium	measures of fixation (e.g., repetition of prior solutions); count/quality of cross-domain analogies	0.14
Generative AI can increase creative output in both lab and field tasks as judged by external raters. Creativity	positive	medium	rated creativity/novelty scores; externally judged idea quality	0.14
Generative models exhibit recombination bias: they tend to remix existing patterns rather than produce deeply original, paradigm-shifting insights. Creativity	negative	medium	degree of novelty vs. recombination in generated outputs; incidence of paradigm-shifting ideas (rare)	0.14
The quality of AI-generated outputs is highly variable; models frequently produce mediocre but plausible-sounding content that requires human filtering. Output Quality	negative	high	output quality distributions; user-perceived quality; time/effort for human filtering	0.24
Generative AI is susceptible to social and representational biases and to factual errors or hallucinations; it lacks tacit, contextual domain expertise. Ai Safety And Ethics	negative	high	incidence of biased content; factual error/hallucination rate; performance on domain-specific tasks requiring tacit knowledge	0.24
The effectiveness of generative AI depends critically on human-AI workflows: prompt design, iterative refinement, and human vetting materially affect outcomes. Task Allocation	mixed	medium-high	variation in output quality based on prompt design; changes in output after iterative refinement; measures of human vetting effort	0.02
Net value from generative AI is contingent: gains are largest where breadth of ideas and rapid iteration matter, and smaller or riskier where deep domain expertise, tacit knowledge, or high-stakes judgments are required. Task Allocation	mixed	medium	task-dependent differences in idea quantity/quality; implementation success rates in high-stakes vs. exploratory tasks	0.14
Lowered cost and time of ideation and early-stage R&D due to generative AI may accelerate innovation cycles and reduce firms' search costs. Innovation Output	positive	low	time-to-prototype; search costs; firm-level innovation cycle length (largely unmeasured in reviewed literature)	0.07
Generative AI can raise per-worker productivity for tasks involving brainstorming, drafting, and prototyping, but realized gains depend on downstream filtering and implementation costs. Firm Productivity	positive	medium	task output (ideas/drafts) per worker; downstream filtering effort; implemented outcomes	0.14
Generative AI will create complementarity: increasing returns to skills in evaluation, curation, synthesis, and domain expertise that integrate AI outputs. Skill Acquisition	positive	low	demand for evaluative/curation skills; wage premia for such skills (not directly measured)	0.07
Generative AI poses substitution risk for entry-level or routine cognitive work focused on generation or drafting without evaluative responsibility. Job Displacement	negative	low	task automatability; employment/demand for routine-generation roles (largely unmeasured)	0.07
Economic rents and advantages may accrue to agents who control large datasets, computing resources, and organizational processes that effectively integrate AI as a co-pilot, potentially increasing market concentration among AI providers. Market Structure	negative	low	market concentration measures; returns to data/compute ownership (not fully measured in the review)	0.07
Standard productivity metrics (e.g., TFP) may undercount the value of ideation and creative augmentation provided by generative AI, making attribution between human and AI contributions difficult. Firm Productivity	negative	medium	coverage/accuracy of productivity metrics for ideation-related gains; attribution of output to human vs. AI	0.14
Proliferation of low-quality or biased AI-generated ideas creates externalities: increased filtering and reputational costs for firms and risks of poor product designs, ethical lapses, or regulatory violations if evaluation is insufficient. Organizational Efficiency	negative	medium	filtering effort/costs; incidence of reputational/regulatory incidents tied to AI-generated outputs (sparse)	0.14
Policy and firm responses should emphasize human-in-the-loop governance, training in evaluative/domain skills, data stewardship, and regulatory attention to IP, liability, competition, and robustness standards. Governance And Regulation	positive	speculative	effectiveness of governance/training/regulation in mitigating harms and enhancing benefits (recommended but not empirically demonstrated within the review)	0.02