📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent Google whitepaper reveals that in AI-assisted development, the model itself accounts for only 10% of system behavior. The focus should be on harness design and context engineering, which drive performance and cost-efficiency.

A new Google whitepaper emphasizes that in AI-assisted software development, the model accounts for only about 10% of system behavior. Instead, the harness and context engineering are where most of the value and control lie, fundamentally shifting how organizations should approach AI integration.

The paper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that the dominant factor in AI system behavior is not the AI model itself, but the surrounding scaffolding — including prompts, rules, tools, and observability. Experiments cited show that changing only the harness or prompt structure can significantly improve performance, even when using the same model.

It also introduces the concept of agentic engineering, where AI is embedded within a structured framework with verification, testing, and safeguards, as opposed to vibe coding, which involves minimal oversight. The authors argue that this shift is crucial for cost-effective, reliable AI deployment, especially as AI’s role in code generation increases, with 41% of new code now AI-generated.

Furthermore, the paper highlights that the cost of AI development hinges more on token economy and system design than on the raw model size, advocating for a disciplined approach that minimizes unnecessary context and maximizes structured, reusable components.

At a glance

reportWhen: published early 2026

The developmentGoogle’s new whitepaper on SDLC highlights that the core of AI-driven development is not the model size but the surrounding harness and context management.

The Model Is Only 10% — The New SDLC With Vibe Coding

AI Dispatch · Field Notes

Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified

Vibe Coding

Casual prompts · “does it seem to work?” · disposable code · high risk

Structured AI-Assisted

Detailed prompts + constraints · manual testing · features in real codebases

Agentic Engineering

Formal specs · automated tests + evals + CI gates · production scale · low risk

Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.

The idea worth building your strategy around

Agent = Model + Harness

~10%

HARNESS — prompts · tools · context · hooks · sandboxes · observability

MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S

Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.

“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.

The economics: it’s a token-cost problem (CapEx vs OpEx)

Vibe Coding

Low CapEx · High OpEx

Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.

Agentic Engineering

High CapEx · Low OpEx

Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.

85%

of devs use AI coding agents (51% daily)

41%

of all new code is AI-generated

~90%

of agent behavior is the harness, not the model

+19%

longer on some tasks (METR) — verification is the cost

The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.

thorstenmeyerai.com

Implications for AI Development Strategies

This shift means organizations should prioritize harness design and context engineering over chasing the latest, largest models. The realization that model size is only 10% of behavior underscores the importance of configuration, tooling, and verification in achieving reliable, cost-efficient AI systems. It challenges the common focus on model upgrades and suggests that durable competitive advantage lies in system architecture.

For CTOs and developers, this redefines where to invest resources, emphasizing system scaffolding, context management, and verification processes. It also raises awareness about total cost of ownership, including ongoing maintenance and security, which are heavily influenced by how well the harness and context are engineered.

AI-Powered Web Design Mastery: Harness the Power of Framer AI to Build, Customize, and Launch Stunning Websites—A Step-by-Step Guide for Beginners and Professionals

As an affiliate, we earn on qualifying purchases.

Background on AI-Driven SDLC and System Design

The paper builds on the ongoing evolution of AI in software engineering, where as of early 2026, 85% of developers use AI coding agents regularly, and 41% of new code is AI-generated. Previous focus was on model improvements, but recent experiments and industry insights suggest that system configuration and scaffolding are more impactful than raw model capabilities.

The concept of vibe coding—accepting AI outputs with minimal review—has been widespread but is increasingly seen as inefficient compared to agentic engineering, which involves structured workflows, testing, and safeguards. This shift is driven by the need for cost-effective, reliable AI systems that can scale without exponential increases in operational expenses.

“The model you’re paying so much attention to is only 10% of the system. The rest is harness and context.”
— Addy Osmani

AI Prompt Engineering: Foundations of Communication with LLMs – Building Generative AI and Agentic AI Prompt Systems Across Development, Testing, and Deployment (AI Engineering)

As an affiliate, we earn on qualifying purchases.

Unresolved Questions About Implementation and Impact

While the paper makes a compelling case, it remains unclear how widely organizations will adopt this approach and whether the emphasis on harness and context engineering will translate into measurable competitive advantages in practice. Details on specific best practices and tooling are still emerging, and the long-term impact on AI development costs and security is yet to be fully assessed.

AI-Powered Observability: From Noise to Insight: Transforming How We Monitor, Detect, and Respond

As an affiliate, we earn on qualifying purchases.

Next Steps for Organizations and AI Developers

Organizations should evaluate their current AI workflows, focusing on harness design and context management. Investing in training and tooling for system configuration will be critical. Industry leaders are expected to publish more detailed guidelines and best practices in the coming months, and further research will clarify how these principles influence AI reliability and cost over time.

Layered Volatility Decomposition: Foundation for AI-Driven Development

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model size only 10% of the system behavior?

The whitepaper shows that most of what determines AI system behavior is the harness, prompts, tools, and verification processes, not the model itself. The model provides the core generation capability, but the surrounding system controls its output and reliability.

How does this shift affect AI development costs?

Focusing on system design and context engineering can reduce long-term costs by minimizing unnecessary token use, improving reliability, and reducing security risks. While initial setup may be higher, the ongoing operational expenses are lower compared to vibe coding approaches.

What is agentic engineering?

Agentic engineering involves embedding AI within a structured framework that includes verification, safeguards, and context management, moving away from minimal oversight to a disciplined, reliable deployment process.

Will this approach work across all AI applications?

The principles are broadly applicable but may vary depending on the specific domain and use case. The emphasis on harness and context is especially relevant for complex, high-stakes systems requiring reliability and cost control.

What should organizations do now?

Organizations should assess their current AI workflows, invest in system architecture, and develop best practices for harness and context engineering to maximize efficiency and reliability.

Source: ThorstenMeyerAI.com

This content is for general information only and is not financial, tax or legal advice. Consult a qualified professional for decisions about your money.

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Cutrova: Edit the Words, Not the Timeline

Author

Direct Sales Help Team

Share article

The model is only 10%

Implications for AI Development Strategies

AI-Powered Web Design Mastery: Harness the Power of Framer AI to Build, Customize, and Launch Stunning Websites—A Step-by-Step Guide for Beginners and Professionals

Background on AI-Driven SDLC and System Design

AI Prompt Engineering: Foundations of Communication with LLMs – Building Generative AI and Agentic AI Prompt Systems Across Development, Testing, and Deployment (AI Engineering)

Unresolved Questions About Implementation and Impact

AI-Powered Observability: From Noise to Insight: Transforming How We Monitor, Detect, and Respond

Next Steps for Organizations and AI Developers

Layered Volatility Decomposition: Foundation for AI-Driven Development

Key Questions

Why is the model size only 10% of the system behavior?

How does this shift affect AI development costs?

What is agentic engineering?

Will this approach work across all AI applications?

What should organizations do now?

Apple’s SpeechAnalyzer API: Transforming Speech Monitoring In Tech Operations

7 Best LCD Monitor Prime Day Deals for Gaming, Work, and Travel in 2026

Forezai · TradingAgents: A Trading Firm Made of Agents

The City That Watches Itself: The Living Digital Twin, and the God’s-Eye View We’re Building

Alaska Air Group Surges In Global Coverage

The Booth Setup Trick That Makes Your Portable Power Station Work Harder

The Future Of Filmmaking: AI In ‘Kanton Alpin Verkehrsbetriebe’

When The Cloud Turned Against AI: A Deep Dive Into The Hugging Face Breach

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Author

Direct Sales Help Team

Share article

The model is only 10%

Implications for AI Development Strategies

AI-Powered Web Design Mastery: Harness the Power of Framer AI to Build, Customize, and Launch Stunning Websites—A Step-by-Step Guide for Beginners and Professionals

Background on AI-Driven SDLC and System Design

AI Prompt Engineering: Foundations of Communication with LLMs – Building Generative AI and Agentic AI Prompt Systems Across Development, Testing, and Deployment (AI Engineering)

Unresolved Questions About Implementation and Impact

AI-Powered Observability: From Noise to Insight: Transforming How We Monitor, Detect, and Respond

Next Steps for Organizations and AI Developers

Layered Volatility Decomposition: Foundation for AI-Driven Development

Key Questions

Why is the model size only 10% of the system behavior?

How does this shift affect AI development costs?

What is agentic engineering?

Will this approach work across all AI applications?

What should organizations do now?

You May Also Like