The AI Agent Platform Shift

Similar to how Web 2.0 refactored the internet, new AI agents are refactoring what AI can do for us and how to use AI. It’s the AI era’s first platform shift since the ChatGPT moment.

Aug 01, 2025

A painting of a ferris wheel

AI-generated content may be incorrect. — Figure 1. AI art – a Ferris wheel in modern art style.

Introduction

The AI revolution is moving so fast, it’s hard to keep up with where we are, let alone predict where we might be a year or two down the road. We are so attuned to AI advancing and doing new things, we may not notice that the rise of highly capable AI agents is creating a platform shift - from AI as chatbots to AI as agents.

This is important for AI users because how to best use and leverage AI agents is distinctly different from how to interact with AI chatbots.

Here’s the TL;DR:

1. AI agents are now reliable, capable, smart, and robust enough for real-world use.

2. This is leading to a platform shift in AI, from AI chatbot to AI agent.

3. The platform shift dramatically changes what AI can do for you, how best to interact with AI, and how to integrate AI into your workflows and daily life.

4. The prompt, whether by voice or text, is still the interface, the main way to direct AI. But what and how you direct AI will shift. Prompt engineering is out, context engineering is in.

5. AI coding agents are the leading use case and tip of the spear for agentic AI, but general AI agents are following the pattern of AI coding agents, in utility, capability, and how they work.

6. Thanks to Model Context Protocol (MCP), capabilities such as web search, coding, research and other skills can now be exposed as tools, making AI agents far more generally capable.

7. AI agents are serving many different vertical applications and are being embedded to make “AI-first” applications. Your browser, IDE, word processor, and even your spreadsheet will soon become agentic applications.

The New AI Agent

The term “Agent” is quite generic and has been used before to describe applications far simpler than today’s leading AI Agents. Simple reflex agents operate on condition–action rules and respond to your environment, so even a smoke detector can be called an agent.

Agents take autonomous actions, so when you combine agents with AI, you get (hopefully) intelligent autonomous action. We expressed this AI Agent definition in Manus and the New AI Agents:

AI agents are autonomous AI systems designed to perceive their environment, make decisions, and take actions to complete specific tasks.

We also described the challenge of AI agents, that until recently they have been “unreliable and fragile.” AI Agents have broken those barriers this year, thanks to several key advances:

Improved AI reasoning: AI models can generate thoughts and plans that make sense and adhere to user intentions.
Tool use integration with Model Context Protocol (MCP) and SDKs.
Better use and management of context, memory and data, including memory features for personalization. AI agents can manage semantic, episodic and procedural memory, so the AI agent has access to needed knowledge bases, can ‘remember’ past interactions, and can personalize AI responses.

As we explained in March in “Design Patterns for Effective AI Agents”:

AI agents are constructed by organizing AI models and ecosystem components, connecting them with AI agent frameworks to execute workflows. The latest AI agents are more powerful because their underlying AI models and AI components are more reliable and capable.

In particular, we cannot underestimate the benefits of tool use standardization with MCP. MCP has given all AI agents tool-using powers, and the standardization has made it easier for AI models to get trained to be good at accessing and calling the right tools.

The AI Agent Platform Shift

More capable AI agents are leading to an explosion in AI agent use and applications. AI agent usage is ramping up rapidly for good reason; they are incredibly useful and productive.

This new development is a platform shift because AI agents dramatically expand what AI can do for you. From an interface perspective, this is a shift from an AI chatbot like ChatGPT to an AI agent, which can be a stand-alone AI agent application, like Claude Code or Manus, or may be embedded in other interfaces or applications. For example, Google search has Agent mode, Perplexity and the latest ChatGPT has an Agent feature.

The “what” in AI agents is that don’t just generate text, they do tasks. The main task for AI coding agents is to create code or other software artifacts, and the main task for AI research agents is to create reports. For these AI agents, it’s an evolution from AI model generation output. However, for AI browsing agents and AI shopping agents, the tasks are not to generate artifacts but complete actions.

The prompt, whether by voice or text, is still the interface, the main way to direct AI. But what and how you direct AI will shift. Prompt engineering is out, context engineering is in. It’s not enough to have a good prompt, you need the full context of prompt instructions, data in context (such as a corporate knowledge base or code base), and memory.

The expansion of prompting to full context to support the AI agent includes several aspects. If you want the AI agent to properly generate an artifact you guide it with a specification. For code, the specification would look like a software requirements specification and design specification. The proper way to AI coding becomes pec-driven AI coding. For research, you specify the task with a research plan and a report outline.

AI agents will follow a Plan-then-Execute pattern for efficient operation. To avoid an AI agent executing incorrect tasks, the user will specify tasks needed or the AI agent will do it in a planning step. The AI agent can define a plan and present it for feedback, then execute it. For example, Gemini Deep Research creates a plan for user review before doing its web research and report generation. Most AI coding assistants now have a planning, for example RooCode has Architect mode.

The platform shift to AI agents is a shift to AI that can take action.

A table with text and words

AI-generated content may be incorrect. — Figure 2. The Platform shift to AI agents is expanding possible use cases, changing the requirements for inputs and how we prompt the AI, and enabling AI to take action.

The AI Agent Platform Shift - The Web 2.0 Analogy

The early internet in the 1990s was both amazing and frustrating at the same time. You could access data and even do shopping, but interactions were ‘clunky’ and slow. About a decade after the initial web, a feature called AJAX (Asynchronous JavaScript and XML) was developed, which let you reconstruct a part of a web page without a full HTTP response, avoiding long reload times.

AJAX was a simple but powerful shift in how web applications were built, paving the way for the dynamic, responsive web we use today. Called Web 2.0 at the time, that shift enabled new classes of SaaS (software-as-a-service) applications.

Improved user experience dramatically changed how the internet was used. In the same way, we have experienced major step-changes in AI, first with ChatGPT and now with AI agents. The original Alexa pre-LLM chatbot is quite different from experiencing ChatGPT with GPT-4 in 2023, which in turn pales in comparison to today’s AI agent experience you can have now.

A screenshot of a computer

AI-generated content may be incorrect. — Figure 3. Meet the new ChatGPT. You still direct it via a prompt, but how and what you ask is expanded and different.

The AI Agent Landscape

AI code assistants have been on the leading-edge of AI adoption. Copilot was the first LLM-based application, and AI coding agents have been the leading use case for agentic AI.

MCP has made it much easier to connect AI models to supporting tools, and the latest AI models have been trained to become much better at selecting and using tools. Many skills and capabilities, including web browsing, coding, and research skills, can be exposed as tools to an AI agent. As a result, any AI agent with MCP support has access to a “Swiss Army knife” of dozens of tools on tap, making AI agents generally capable.

Embedded AI Agents and AI-First Applications

AI agents are also being embedded in applications to make “AI-first” applications.

The Perplexity Comet browser is an example of how the browser is being reconfigured with agentic AI. As we have noted, this is reinventing the browser experience. Comet rebuilt the browser with an AI agent integrated into it, combing AI search and AI assistance within the browser. Microsoft just introduced Copilot Mode in Edge, that integrates chat, search and web navigation.

The IDE (interactive development environment) is evolving in this direction. Many AI coding assistants have been built as extensions of VSCode. As they get more powerful, users will find there is a full AI agent under the hood. Tools like Cursor will end up looking a lot like Cognition’s Devin, while the Devin AI software agent absorbs the capabilities of the Windsurf AI coding assistant.

Over time, most applications will become “AI-first” applications, with an AI agent merged as a feature into the application, ready to exercise that application agentically.

Figure 4. From OpenAI’s Roadmap to AGI from 2024. We are now firmly at Level 3, with AI agents that take actions on our behalf.

The Road Ahead

We are witnessing an agentic AI revolution, with increased AI agent capabilities leading to releases of many capable AI agents, and an explosion in their popularity and use. This is driving a platform shift for AI.

As we go forward, we can expect more improvements in AI models for agentic use and continued waves of AI agent product releases. We predicted in March “A flood of new AI agents is coming soon.” That flood is here and now.

AI users can expect a lot more from these AI agents than before. Raise your expectations and creatively explore how far you can take agentic AI for your tasks.

This platform shift means AI users need to rethink how to best use AI agents. This means using Context Engineering for best results. You won’t get the benefits of AI by using AI agents like you used ChatGPT in 2023.

Finally, AI progress is on an accelerated time-line. Single AI agents will evolve into hierarchical AI agents, multi-agent AI swarms, embedded AI agents, and ambient AI agents. Each will have different twists on the AI interface and what it can do, and this may supersede today’s AI agents as a platform, but that’s a story for another day.

AI Changes Everything

Discussion about this post