guide9 min read

The Protocol Layer for AI Agents: Why MCP Is the TCP/IP of Agentic Infrastructure

A universal protocol that lets any agent talk to any tool

MCP (Model Context Protocol) is an open standard from Anthropic that lets any AI agent talk to any tool through a unified interface — the way TCP/IP lets any computer talk to any network. Instead of building one-off integrations per tool, agents and tools speak MCP, making interoperability the default.

Before TCP/IP, every network was a silo. Mainframes talked to their own terminals. Proprietary protocols locked users into vendor ecosystems. The internet happened when a single protocol layer made interoperability the default. MCP is doing the same thing for AI agents. Tool builders implement MCP once, agent builders consume MCP once, and the cross-product of integrations explodes — without the N×M glue code that has historically slowed every ecosystem.

This is not hyperbole. Before MCP, every agent framework built its own integration layer. LangChain had its tools. AutoGPT had its plugins. Each implementation was incompatible with the others. If you built a Snowflake connector for LangChain, it did not work in CrewAI. Data Workers is built MCP-native because we saw this fragmentation problem early: 15 agents that need to connect to 85+ tools cannot afford per-framework integration overhead.

The Integration Tax Before MCP

Before MCP, connecting an AI agent to a tool required building a bespoke integration. Every combination of agent framework and tool needed its own adapter: serialization logic, authentication handling, error mapping, schema translation. For N agent frameworks and M tools, you needed N times M integrations.

The math was brutal. A data platform with 20 tools (warehouse, orchestrator, quality monitors, catalogs, BI tools, version control, CI/CD) and 3 agent frameworks needed 60 custom integrations. Each integration had to be maintained, updated when APIs changed, and tested independently. This is why most early agent platforms supported a handful of tools and nothing more.

The integration tax was also a trust tax. Every custom integration was a security surface. Every tool adapter had its own approach to authentication, authorization, and audit logging. There was no standard way to control what an agent could access or to audit what it did.

What MCP Actually Is (And Is Not)

MCP is an open protocol, released by Anthropic under the MIT license, that standardizes how AI agents communicate with external tools and data sources. At its core, MCP defines three primitives.

  • Tools. Executable functions that agents can call -- query a database, run a dbt model, create a Jira ticket. Each tool has a typed schema that tells the agent what inputs it needs and what outputs to expect.
  • Resources. Data that agents can read -- documentation, schema definitions, configuration files. Resources provide context without requiring the agent to execute anything.
  • Prompts. Pre-built interaction patterns that guide agent behavior for specific tasks -- like a template for investigating a data quality issue or a procedure for executing a schema migration.

What MCP is not: it is not an agent framework. It does not tell agents how to think, plan, or decide. It is purely the communication layer -- the protocol that lets any agent (Claude, GPT, Gemini, open-source models) talk to any tool (Snowflake, dbt, Airflow, GitHub) through a standard interface.

The TCP/IP Analogy: Why Protocol Standards Win

The comparison to TCP/IP is instructive because the dynamics are identical. In the 1970s, computer networks existed but were incompatible. ARPANET used NCP. IBM had SNA. DECnet connected DEC machines. Each was technically capable but isolated.

TCP/IP won not because it was the best protocol for any single use case, but because it was good enough for all use cases and it was open. It reduced the integration problem from N times M to N plus M. Every computer just needed to speak TCP/IP, and it could communicate with every other computer that spoke TCP/IP. The network effect was unstoppable.

MCP creates the same dynamic for agents. Instead of N agent frameworks times M tools requiring N times M integrations, you need N MCP clients (agent frameworks) plus M MCP servers (tool connectors). Build one MCP server for Snowflake, and every MCP-compatible agent can use it. Build one MCP client into your agent, and it can use every MCP server. The integration problem becomes linear instead of quadratic.

MCP Architecture: Clients, Servers, and the Transport Layer

MCP uses a client-server architecture with a clean separation between the agent (client) and the tool (server).

ComponentRoleExamples
MCP ClientThe agent or host application that consumes toolsClaude Code, Cursor, VS Code Copilot, custom agents
MCP ServerExposes tools, resources, and prompts via the MCP protocolData Workers MCP servers, dbt MCP, Snowflake MCP
TransportCommunication channel between client and serverstdio (local), HTTP+SSE (remote), WebSocket

The transport layer is worth noting. MCP supports both local (stdio) and remote (HTTP+SSE) transports. Local transport is ideal for development -- the MCP server runs as a subprocess of the client. Remote transport enables production deployments where MCP servers run as services that multiple agents can connect to simultaneously.

Data Workers provides MCP servers that work in both modes. In Claude Code and Cursor, they run locally via stdio for fast iteration. In production deployments, they run as remote services that the entire agent swarm connects to, with centralized authentication and audit logging.

Why MCP Matters for Data Engineering

Data engineering is one of the best use cases for MCP because the tool landscape is so fragmented. A typical data stack includes a warehouse (Snowflake, BigQuery, Databricks), an orchestrator (Airflow, Dagster, Prefect), a transformation layer (dbt), quality monitoring (Great Expectations, Elementary), a catalog (Atlan, DataHub), BI tools (Looker, Tableau), and version control (GitHub, GitLab).

Before MCP, an AI agent that needed to investigate a data quality issue had to integrate with each of these tools individually. With MCP, each tool exposes an MCP server, and any agent can connect to all of them through the same protocol. The agent asks the warehouse for query history, the orchestrator for run logs, dbt for model lineage, and the quality tool for test results -- all through MCP.

Data Workers leverages this by providing 85+ integrations through MCP servers. Each integration is an MCP server that any MCP-compatible client can use -- not just our agents. If you are using Claude Code, Cursor, or any other MCP client, you can connect to Data Workers' MCP servers and get the same tool access that powers our 15-agent swarm.

The Network Effect: MCP's Growing Ecosystem

The TCP/IP analogy predicts what happens next: a network effect that accelerates adoption. As more tools build MCP servers, agents become more capable. As agents become more capable, more tools invest in MCP servers. The ecosystem compounds.

This is already happening. Major data platforms -- Snowflake, dbt, Databricks, Postgres, MongoDB -- all have MCP servers. AI platforms -- Claude Code, Cursor, VS Code, Windsurf -- all support MCP clients. Open-source MCP servers are proliferating on GitHub. The protocol is at the inflection point where adoption becomes self-reinforcing.

For data teams, the implication is clear: invest in MCP-native tooling now. Tools that speak MCP will integrate with everything. Tools that do not will become the IBM SNA of the agent era -- capable but isolated. Data Workers' MCP-native architecture, released under Apache 2.0 license, ensures your agent infrastructure is compatible with the emerging standard.

Data Workers is built MCP-native from the ground up. Our 15 agents connect to 85+ data tools through the MCP protocol -- the same standard supported by Claude Code, Cursor, and the growing agent ecosystem. Book a demo or explore the docs to see universal agent connectivity in action.

See Data Workers in action

15 autonomous AI agents working across your entire data stack. MCP-native, open-source, deployed in minutes.

Book a Demo

Related Resources

Explore Topic Clusters