AI assistant Gemini did not improve code security in a controlled developer experiment; developers' own programming experience—rather than the tool—drove more secure outcomes.
The ongoing shortage of skilled developers, particularly in security-critical software development, has led organizations to increasingly adopt AI-powered development tools to boost productivity and reduce reliance on limited human expertise. These tools, often based on large language models, aim to automate routine tasks and make secure software development more accessible and efficient. However, it remains unclear how developers' general programming and security-specific experience, and the type of AI tool used (free vs. paid) affect the security of the resulting software. Therefore, we conducted a quantitative programming study with software developers (n=159) exploring the impact of Google's AI tool Gemini on code security. Participants were assigned a security-related programming task using either no AI tools, the free version, or the paid version of Gemini. While we did not observe significant differences between using Gemini in terms of secure software development, programming experience significantly improved code security and cannot be fully substituted by Gemini.
Summary
Main Finding
- In a quantitative study of 159 software developers given a security-related programming task, use of Google's Gemini (free or paid) did not produce significant differences in code security compared to no-AI assistance.
- By contrast, developers' general programming experience was a significant predictor of more secure code; AI assistance could not fully substitute for that experience.
Key Points
- Context: Organizations are adopting AI-powered development tools to mitigate shortages of skilled developers and to speed secure software development.
- Experimental conditions: participants were assigned to one of three conditions — no AI, Gemini free, or Gemini paid.
- Outcome: No significant security improvements from using Gemini (free or paid) relative to no-AI. No significant difference between the free and paid Gemini conditions was reported.
- Human capital matters: greater general programming experience materially improved security outcomes; reliance on Gemini alone did not close that gap.
Data & Methods
- Sample: n = 159 software developers.
- Design: Participants completed a security-related programming task while using either no tool, Gemini (free), or Gemini (paid). Assignment to conditions was experimental (participants were assigned to one of the three conditions).
- Measurement: Code security was assessed from the task submissions (summary does not provide specific metrics or statistical tests used).
- Limitations (implicit from design): single AI tool (Gemini), single task domain (security-focused programming), and lab/experimental context—these constrain external validity.
Implications for AI Economics
- Complementarity > Substitution for skilled labor: Results indicate AI assistance (Gemini) did not replace the value of programming experience for producing secure code. Investments in human capital (training, hiring experienced developers) remain economically important.
- Pricing and product-market implications: No observed security advantage for paid vs. free Gemini suggests that, at least for security outcomes on this task, willingness to pay for premium model access may not translate into better security, affecting firms’ ROI calculations for paid developer tools.
- Labor market effects: Widespread adoption of LLM-based dev tools may not substantially reduce demand for experienced developers in security-critical roles; instead, tools may shift the task composition toward less-experienced labor supplemented by supervision from experienced staff.
- Procurement and regulation: Buyers of AI-assisted development tools should not assume these tools reduce the need for experienced engineers in security-sensitive contexts; procurement decisions and regulatory guidance should account for persistent reliance on human expertise.
- Policy and training: Public and private investment in developer training and security education remains warranted; subsidizing upskilling could be more effective for improving software security than subsidizing tool adoption alone.
- Research priorities: Economists should study heterogeneous effects across task types, tool capabilities, team settings, and long-run dynamics (learning effects, model improvements) to better estimate the substitutability between AI tools and developer experience.
Assessment
Claims (8)
| Claim | Direction | Confidence | Outcome | Details |
|---|---|---|---|---|
| We conducted a quantitative programming study with software developers (n = 159) exploring the impact of Google's AI tool Gemini on code security. Output Quality | null_result | high | impact of Gemini on code security (security of code produced in the study) |
n=159
0.6
|
| Participants were assigned a security-related programming task using either no AI tools, the free version, or the paid version of Gemini. Output Quality | null_result | high | experimental condition (tool used) as it relates to subsequent code security outcomes |
n=159
0.6
|
| We did not observe significant differences between using Gemini (free or paid) and not using Gemini in terms of secure software development. Output Quality | null_result | medium | secure software development / code security (e.g., detected vulnerabilities or security score of submitted solutions) |
n=159
no significant difference
0.36
|
| Programming experience significantly improved code security. Output Quality | positive | high | code security (security quality of participants' solutions) as a function of programming experience |
n=159
significant positive effect of programming experience
0.6
|
| Programming experience cannot be fully substituted by Gemini. Output Quality | mixed | medium | degree to which Gemini use offsets the effect of programming experience on code security |
n=159
programming experience effect remained despite Gemini
0.36
|
| Organizations increasingly adopt AI-powered development tools to boost productivity and reduce reliance on limited human expertise, especially in security-critical software development. Adoption Rate | positive | medium | adoption of AI-powered development tools (general trend; not measured in this study) |
0.36
|
| AI-powered developer tools (often based on large language models) aim to automate routine tasks and make secure software development more accessible and efficient. Developer Productivity | positive | medium | intended goals of AI tools (automation of routine tasks; accessibility/efficiency in secure development) |
0.36
|
| It remains unclear how developers' general programming and security-specific experience, and the type of AI tool used (free vs. paid), affect the security of the resulting software — motivating this study. Other | null_result | high | the combined effect of developer experience and AI tool type on code security (identified as an open question prior to the study) |
0.6
|