📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent Google whitepaper reveals that in AI-assisted development, the model itself accounts for only 10% of system behavior. The focus should be on harness design and context engineering, which drive performance and cost-efficiency.

A new Google whitepaper emphasizes that in AI-assisted software development, the model accounts for only about 10% of system behavior. Instead, the harness and context engineering are where most of the value and control lie, fundamentally shifting how organizations should approach AI integration.

The paper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that the dominant factor in AI system behavior is not the AI model itself, but the surrounding scaffolding — including prompts, rules, tools, and observability. Experiments cited show that changing only the harness or prompt structure can significantly improve performance, even when using the same model.

It also introduces the concept of agentic engineering, where AI is embedded within a structured framework with verification, testing, and safeguards, as opposed to vibe coding, which involves minimal oversight. The authors argue that this shift is crucial for cost-effective, reliable AI deployment, especially as AI’s role in code generation increases, with 41% of new code now AI-generated.

Furthermore, the paper highlights that the cost of AI development hinges more on token economy and system design than on the raw model size, advocating for a disciplined approach that minimizes unnecessary context and maximizes structured, reusable components.

At a glance
reportWhen: published early 2026
The developmentGoogle’s new whitepaper on SDLC highlights that the core of AI-driven development is not the model size but the surrounding harness and context management.
The Model Is Only 10% — The New SDLC With Vibe Coding
AI Dispatch · Field Notes
Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified
Vibe Coding
Casual prompts · “does it seem to work?” · disposable code · high risk
Structured AI-Assisted
Detailed prompts + constraints · manual testing · features in real codebases
Agentic Engineering
Formal specs · automated tests + evals + CI gates · production scale · low risk
Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.
The idea worth building your strategy around
Agent = Model + Harness
~10%
HARNESS — prompts · tools · context · hooks · sandboxes · observability
MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S
Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.
“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.
The economics: it’s a token-cost problem (CapEx vs OpEx)
Vibe Coding
Low CapEx · High OpEx
Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.
Agentic Engineering
High CapEx · Low OpEx
Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.
85%
of devs use AI coding agents (51% daily)
41%
of all new code is AI-generated
~90%
of agent behavior is the harness, not the model
+19%
longer on some tasks (METR) — verification is the cost
The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.
thorstenmeyerai.com

Implications for AI Development Strategies

This shift means organizations should prioritize harness design and context engineering over chasing the latest, largest models. The realization that model size is only 10% of behavior underscores the importance of configuration, tooling, and verification in achieving reliable, cost-efficient AI systems. It challenges the common focus on model upgrades and suggests that durable competitive advantage lies in system architecture.

For CTOs and developers, this redefines where to invest resources, emphasizing system scaffolding, context management, and verification processes. It also raises awareness about total cost of ownership, including ongoing maintenance and security, which are heavily influenced by how well the harness and context are engineered.

Amazon

AI harness design tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background on AI-Driven SDLC and System Design

The paper builds on the ongoing evolution of AI in software engineering, where as of early 2026, 85% of developers use AI coding agents regularly, and 41% of new code is AI-generated. Previous focus was on model improvements, but recent experiments and industry insights suggest that system configuration and scaffolding are more impactful than raw model capabilities.

The concept of vibe coding—accepting AI outputs with minimal review—has been widespread but is increasingly seen as inefficient compared to agentic engineering, which involves structured workflows, testing, and safeguards. This shift is driven by the need for cost-effective, reliable AI systems that can scale without exponential increases in operational expenses.

“The model you’re paying so much attention to is only 10% of the system. The rest is harness and context.”

— Addy Osmani

Amazon

prompt engineering software

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Unresolved Questions About Implementation and Impact

While the paper makes a compelling case, it remains unclear how widely organizations will adopt this approach and whether the emphasis on harness and context engineering will translate into measurable competitive advantages in practice. Details on specific best practices and tooling are still emerging, and the long-term impact on AI development costs and security is yet to be fully assessed.

Amazon

AI observability tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for Organizations and AI Developers

Organizations should evaluate their current AI workflows, focusing on harness design and context management. Investing in training and tooling for system configuration will be critical. Industry leaders are expected to publish more detailed guidelines and best practices in the coming months, and further research will clarify how these principles influence AI reliability and cost over time.

Amazon

structured AI development frameworks

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model size only 10% of the system behavior?

The whitepaper shows that most of what determines AI system behavior is the harness, prompts, tools, and verification processes, not the model itself. The model provides the core generation capability, but the surrounding system controls its output and reliability.

How does this shift affect AI development costs?

Focusing on system design and context engineering can reduce long-term costs by minimizing unnecessary token use, improving reliability, and reducing security risks. While initial setup may be higher, the ongoing operational expenses are lower compared to vibe coding approaches.

What is agentic engineering?

Agentic engineering involves embedding AI within a structured framework that includes verification, safeguards, and context management, moving away from minimal oversight to a disciplined, reliable deployment process.

Will this approach work across all AI applications?

The principles are broadly applicable but may vary depending on the specific domain and use case. The emphasis on harness and context is especially relevant for complex, high-stakes systems requiring reliability and cost control.

What should organizations do now?

Organizations should assess their current AI workflows, invest in system architecture, and develop best practices for harness and context engineering to maximize efficiency and reliability.

Source: ThorstenMeyerAI.com

This content is for general information only and is not financial, tax or legal advice. Consult a qualified professional for decisions about your money.
You May Also Like

Search as Code: Perplexity Is Right About the Future — Just Not First to It

Perplexity introduces Search as Code, enabling AI agents to build custom retrieval pipelines, promising higher accuracy and efficiency, but with questions remain.

7 Best PC Motherboards for Prime Day Deals in 2026

Discover the best PC motherboard deals for Prime Day 2026, including options for AM4 and AM5 platforms, with detailed insights on features and value.

7 Best PC Routers for Prime Day Deals in 2026

Discover the best PC router deals for Prime Day 2026, including Wi-Fi 7, Wi-Fi 6, and security-focused options, tailored for different user needs.

7 Best PC Tablets for Prime Day Deals in 2026

Discover the best PC tablets on Prime Day 2026, including deals on Samsung Galaxy Tab S9, Surface Pro 11, iPad 9th Gen, and more, with buying tips.