Chapter 13 of 24

Integration Spaghetti: Point-to-Point vs. Platform

When systems connect directly to each other without a coherent integration architecture, the result is spaghetti: fragile, expensive to maintain, and nearly impossible to understand. Here's why it happens and what to do about it.

4 min read

Overview

At some point in its growth, almost every enterprise technology organization creates an architecture diagram of its systems and the connections between them. The first time this diagram is drawn honestly, it frequently looks like a plate of spaghetti. Lines cross in every direction. Systems connect to other systems without apparent logic. Integrations overlap and branch in ways that nobody planned and nobody can fully explain.

Diagram

This is integration spaghetti. It is one of the most common and most costly problems in enterprise IT. It is almost always the result of the same pattern: integration decisions made one at a time, to solve immediate needs, without a coherent architecture to govern how systems should connect.

How Point-to-Point Integration Happens

Imagine an organization with five core systems: an ERP, a CRM, an HR system, a billing platform, and a customer portal.

The CRM needs customer financial data from the ERP. An integration is built: a nightly batch job extracts financial data from the ERP and loads it into the CRM. Done.

A year later, the billing platform needs the same customer data from the ERP, but also needs deal information from the CRM. Two more integrations are built.

Another year passes. The customer portal needs to display account information from the billing platform and support history from the CRM. Two more integrations.

The HR system needs to sync employee data to the ERP for payroll purposes. Another integration.

Each of these integrations solved a real problem. But look at what has been built: a web of direct connections between systems, each built independently, each with its own logic, its own error handling, its own schedule. If the ERP's data format changes, multiple integrations may break simultaneously. If the CRM's API is unavailable, everything that reads from it fails. There is no central place to monitor what is running and what is failing. Tracing whether the customer portal shows current data requires following a chain through multiple integrations, each with different latency and error characteristics.

This is point-to-point integration. When it exists between five systems, it is manageable. When it exists between fifty systems — as in many large enterprises — it is the spaghetti problem.

The Geometric Problem

The reason spaghetti gets so bad so quickly is mathematical. With n systems that all need to talk to each other directly, the maximum number of connections is n times (n minus 1) divided by two. With five systems, that is ten potential connections. With ten systems, it is forty-five. With fifty systems, it is 1,225.

Not every system needs to talk to every other system. But in a large enterprise, the number of actual integrations can easily reach into the hundreds or thousands. Each integration is a potential point of failure. Each requires monitoring, documentation, and maintenance. The aggregate cost is enormous.

Platform-Based Integration

The alternative to point-to-point is hub-and-spoke. Instead of systems connecting directly to each other, they all connect through a central integration platform.

The integration platform sits in the middle. It receives data from source systems, applies transformation rules to convert data from one system's format to another, and routes the data to the destination system. Instead of n-squared connections, you have n connections, one from each system to the platform.

The advantages are significant. Monitoring is centralized: all integrations are visible in one place, and failures are immediately apparent. Changes to one system's data format only need to be addressed in the platform, not in every downstream integration. New integrations can often be built faster because the connections to existing systems are already established.

The integration platform comes in several flavors. An Enterprise Service Bus (ESB) is the traditional enterprise approach: a heavyweight, on-premises integration layer that handles message routing, transformation, and orchestration. More modern approaches include Integration Platform as a Service (iPaaS), cloud-based platforms like MuleSoft, Dell Boomi, or Azure Logic Apps that provide similar capabilities with less infrastructure overhead. Event streaming platforms like Apache Kafka handle high-volume, real-time data flows.

The Trade-Offs

Platform-based integration is better than spaghetti. It introduces its own challenges, however.

The integration platform becomes a single point of failure. If the platform is unavailable, all integrations stop. This requires the platform itself to be highly available and carefully managed.

The platform adds latency. Data routed through a platform takes longer to arrive than data transferred directly. For real-time use cases that need data in milliseconds, a platform may introduce unacceptable delay.

The platform requires investment. Building and maintaining an integration platform is not cheap. The initial setup, the training for the team that manages it, and the ongoing management are significant costs. For small organizations with few integrations, the cost may exceed the benefit.

A platform does not eliminate the need for good integration design. A poorly designed integration running through a well-managed platform is still a poorly designed integration. The platform improves manageability. It does not substitute for thoughtful architecture.

What This Means for AI

Integration architecture matters for AI in a direct way: AI systems need data, and data gets to AI systems through integrations.

An AI system that needs real-time data from multiple operational systems is at the mercy of the integration architecture. If the architecture is spaghetti, the data will be inconsistent, late, or missing, and the AI system will reflect that in its outputs.

When planning an AI initiative, one of the first architectural questions should be: how will data flow from the source systems to the AI system? Through what integrations? What is the latency? What happens when one of those integrations fails? The answer to these questions shapes the design of the AI system as much as any model or algorithm decision.