📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.
TL;DR
The latest Google whitepaper reveals that in AI-based software development, the model itself accounts for only 10% of system behavior. The focus should be on harness design and context engineering, which drive performance and cost efficiency.
A new Google whitepaper, „The New SDLC With Vibe Coding,“ emphasizes that the most significant shift in software engineering is moving from writing code to expressing intent and trusting AI to generate software. The paper states that the model constitutes only about 10% of what determines AI behavior, shifting focus toward harness design and context engineering.
The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, underscores that 85% of professional developers now use AI coding agents regularly, with 51% doing so daily. Despite this, the authors argue that the key to effective AI systems lies in the harness and context management rather than the size or sophistication of the models themselves.
Concrete evidence cited includes experiments where changing only the harness or prompts significantly improved performance—moving a coding agent from outside the Top 30 to the Top 5 on a benchmark, and boosting scores by nearly 14 points through prompt and middleware tweaks. The whitepaper stresses that most failures in AI agents stem from configuration issues—missing tools, vague rules, or noisy context—rather than the model’s raw capabilities.
The authors advocate for a disciplined approach called agentic engineering, which involves structured verification, testing, and context management, contrasting with the more casual vibe coding. They warn that focusing solely on model upgrades is misguided and that the real competitive advantage resides in how the AI is scaffolded and guided through harness design.
The model is only 10%
A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.
The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.
Why Harness and Context Are More Critical Than Model Size
This shift in focus from model size to harness design and context engineering has profound implications for AI development and deployment. It suggests that organizations can achieve better results and cost savings by investing in configuration, tooling, and structured workflows rather than constantly chasing larger or newer models.
For technical leaders, this means re-evaluating priorities: instead of spending heavily on model upgrades, they should focus on building robust harnesses, developing effective context management strategies, and creating reusable schemas. This approach can reduce operational costs, improve reliability, and accelerate development cycles, making AI integration more sustainable and scalable.

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Background of the Shift Toward Intent Expression in AI Development
The whitepaper builds on recent trends where AI models have become central to software development, with over 85% of developers using AI agents regularly. Previously, emphasis was placed on adopting the latest models or frameworks. However, as AI systems matured, experts observed diminishing returns from model size increases alone, prompting a reevaluation of what truly influences AI performance.
This evolution aligns with earlier concepts like vibe coding, which prioritized quick prompts and minimal oversight, but has now matured into a more disciplined practice termed agentic engineering. The authors highlight that the real challenge is not the AI’s raw capabilities but how developers structure, verify, and control its behavior through scaffolding and context management.
„The model constitutes only about 10% of what determines AI behavior; the rest is harness and context.“
— Addy Osmani

Designing Large Language Model Applications: A Holistic Approach to LLMs
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
What Aspects of Harness and Context Engineering Are Still Unclear
While the whitepaper provides strong evidence that harness and context are critical, it does not specify exact best practices or standardized frameworks for implementing these strategies across different AI applications. The precise methods for scaling context management and automating schema development remain under development, and industry adoption of these practices is still evolving.

AI Context Engineering: Architecting Intelligence Through Prompt Structures, Tools, and Memory
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Next Steps for AI Development and Organizational Adoption
Organizations should prioritize investing in harness and context management tools, developing best practices for schema design, and training teams in structured AI workflows. Further research and industry collaboration are expected to refine these approaches, with upcoming updates likely providing more detailed frameworks for effective implementation. Monitoring how these strategies impact cost and performance will shape future AI development standards.

Supply Chain Software Security: AI, IoT, and Application Security
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Key Questions
Why is the model size less important than the harness?
According to the whitepaper, the behavior of AI systems depends more on how they are configured, guided, and scaffolded than on the raw size of the models. The harness and context management determine correctness, reliability, and cost-effectiveness.
What is agentic engineering?
Agentic engineering is a disciplined approach that involves structured verification, testing, and context management to ensure AI systems behave correctly and efficiently, moving beyond vibe coding.
How can organizations improve their AI systems based on this insight?
Organizations should focus on building robust harnesses, managing context effectively, and developing reusable schemas, rather than solely upgrading to larger models.
Does this mean model upgrades are unnecessary?
The whitepaper suggests that, while model improvements can help, the primary gains come from better harness design and context engineering, which are more cost-effective and controllable.
What are the risks of focusing too much on harness and context?
The main risk is that improper configuration or poor schema design can still lead to failures or vulnerabilities. Ongoing testing and verification are essential to mitigate these risks.
Source: ThorstenMeyerAI.com