Evolving to a Resilient API Future: Lessons from the Past, Tools for the Future

Published on July 23, 2025

If you would prefer a visual / narrated version of this blog post, feel free to watch my talk at WWDC25

APIs are the digital backbone of modern software, powering everything from online banking and retail to AI-powered automation and connected vehicles. They form the foundation of digital transformation strategies, but as their ubiquity grows, so does the impact of their failure. In this post, we’ll explore the historical pitfalls of API design, the challenges posed by modern microservice architectures, and how contract testing offers a path toward resilient, scalable, and future-ready APIs.

APIs: The critical infrastructure of the digital age

APIs aren’t just developer tools anymore – they’re mission-critical infrastructure. Whether it’s unlocking a car via a mobile app or facilitating multi-billion dollar e-commerce transactions, APIs enable seamless digital interactions. But as reliance on APIs deepens, so do the consequences of poorly designed, insecure, or untested endpoints.

What we’ve learned from 20+ years of API failures

Let’s explore a few high-profile examples of API-related failures:

1. Zombie APIs and legacy debt

APIs are only secure and manageable when they’re visible. A legacy app at T-Mobile, using outdated encryption, was exploited – highlighting the dangers of forgotten endpoints.

https://www.bleepingcomputer.com/news/security/t-mobile-hacked-to-steal-data-of-37-million-accounts-in-api-data-breach/

2. Insecure design

Kia discovered the hard way that unsecured API endpoints could allow unauthorized remote access to vehicles. Poor upfront security design led to APIs being misused with minimal information.

https://vicone.com/blog/now-patched-kia-vulnerabilities-could-have-allowed-remote-control-using-only-a-license-plate-number

3. Knowledge loss and poor documentation

Twitter (now X) faced major internal disruptions when technical staff were let go without preserving essential knowledge. One internal change took down parts of the platform – proof that tribal knowledge is not a scalable strategy.

https://www.theverge.com/2023/3/6/23627875/twitter-outage-how-it-happened-engineer-api-shut-down

The growing complexity of API landscapes

Modern software development has evolved dramatically:

From monoliths to microservices

From waterfall to agile/DevOps

From on-prem to cloud-native, distributed systems

Each of these transitions introduces new failure points. With microservices, for example, a single application may be split across hundreds of services. This leads to “microservice sprawl,” where system complexity becomes unmanageable.

A visual metaphor often used is the “microservices death star”—an explosion of interdependent services, each introducing potential failure and maintenance overhead. The following image shows popular services, in their infancy, as the images are around 10 years old. Despite the young age of these services, they were incredibly complex systems, which will have only grown over time.

Key stats (State of Software Quality – API 2023):

https://smartbear.com/state-of-software-quality/api

61% of API growth stems from microservices

81% of companies operate in multi-protocol environments

57% use three or more protocols (REST, GraphQL, Kafka, etc.)

According to State of API in 2023, there are a number of obstacles to producing APIs, the most common of which is a lack of time. The second largest contributor is a lack of people, followed by a lack of API design skills.

Design skill gaps may contribute to an over proliferation of microservices. This is problematic as too many APIs or microservices is noted by State of API as the sixth biggest obstacle to producing APIs, leading to “microservice sprawl”

https://smartbear.com/state-of-software-quality/api/challenges/#

The testing dilemma in a microservice world

Testing in modern systems is harder than ever. A quote from a real-world API team illustrates this:

“We’ve got GraphQL, REST, Kafka, EventBridge… Testing inside a highly volatile set of integrated environments is extremely challenging. We need faster feedback and more reliability.”

So, why does traditional testing fall short?

End-to-end testing is flaky, slow, and brittle

Integration environments are expensive and complex to maintain

Unit tests are fast but lack confidence in integration behavior

The testing pyramid: Balancing speed and confidence

To balance cost, confidence, and coverage, the testing pyramid offers guidance:

Base: Unit tests – fast, low cost, low coverage

Middle: Integration tests – fast, high confidence in integrations

Top: End-to-end tests – highest confidence, but fragile and slow

We might choose to shift our integration testing concerns into unit testing, but it brings along a false sense of confidence for developers.

The role of API specifications

API specifications like OpenAPI and AsyncAPI have become the standard for documenting APIs:

OpenAPI for RESTful services

AsyncAPI for event-driven architectures

These specifications serve as machine- and human-readable contracts. They enable automation in SDK generation, testing, governance and many other aspects of the software developer lifecycle.

But they come with pitfalls:

Challenges:

Provider drift: When the implementation diverges from the spec.

Consumer drift: When changes to the spec break real-world consumers.

Versioning chaos: Without insight into actual usage, it’s hard to manage versions effectively.

Lack of governance: Teams building APIs independently without central standards lead to inconsistency.

Tools like SmartBear Spectral (by Stoplight) and Vacuum help enforce governance rules using custom linters for OpenAPI/AsyncAPI specifications.

Enter contract testing: Verifying real integration needs

Contract testing solves a central problem in API development: verifying that consumers and providers agree on how to interact – without needing to spin up full integration environments.

What is contract testing?

Contract testing validates:

The consumer’s expectations of the API

The provider’s ability to meet those expectations

Rather than mocking with guesses, contract tests are based on real interaction contracts.

The Pact workflow:

Consumers write tests → Pact generates contracts (as JSON)

Contracts are published to a broker (e.g., PactFlow)

Providers verify these contracts against real code

Safe deployments are validated via can-i-deploy tooling

Contract testing helps shift validation left in the development lifecycle, enabling:

Safer refactoring

Independent deployments

Fewer integration surprises

From code to design-first: Bidirectional contract testing

If your team uses OpenAPI or AsyncAPI specs, Pact now supports bidirectional contract testing. This means:

You can verify the API spec against real-world usage

You reduce onboarding time for teams using design-first approaches

Pact contracts must be both concrete (behavioral) and valid subsets of your spec. This helps eliminate provider drift while maintaining human-readable documentation.

This now extends in SmartBear API Hub, to provide API Architects and consumers the ability to edit API specifications and determine impacted consumers, directly in their editor.

What’s next? AI, agentic consumers, and Arazzo

AI-generated contract tests

To lower the barrier of entry, AI-generated contract testing can scaffold tests based on:

OpenAPI specs

Code annotations

Patterns learned from 10+ years of contract testing best practices

Maximising the value of end-to-end testing, whilst addressing pitfalls

When we consider our test pyramid, we get the most confidence from activities performed higher up the stack. End to end tests ensure we are satisfied that the core journeys in our system work as expected, but have a large co-ordination effort to setup.

These activities are normally driven by a browser, this is very costly in terms of time/flakiness due to the remote nature of a browser interaction.

Subcutaneous (under the skin) testing allows us to drive user facing journeys, traditionally performed by the browser, by instead using the API calls that bind them.

• More reliable tests

• High-confidence levels

• No leading specification to document these API focused workflows

Agentic workflows & Arazzo

With AI agents poised to become major API consumers, workflow-level testing will be essential. The new Arazzo specification (by the OpenAPI Initiative) defines workflows and user journeys in deterministic, testable ways.

Example use case:

An agent books travel by composing multiple APIs (search, booking, payment)

Arazzo describes this as a sequence of API calls with clear data flow

This future-focused tooling enables:

Visual flow documentation

Testable orchestrations across APIs

Safe automation in AI-driven environments

Final thoughts: Resilient APIs require intentional design and testing

To evolve your APIs for the future, consider this roadmap:

Use OpenAPI/AsyncAPI to standardize and document
Enforce rules with tools like Spectral
Apply contract testing to decouple integration testing
Use Pact (or equivalent) to enable safe deployments
Prepare for AI consumers and document your E2E journeys with Arazzo workflows

With thoughtful governance, scalable testing, and a commitment to consumer visibility, your APIs can evolve gracefully over time, allowing your organisation the freedom to adapt to ever-changing markets and demands.

Continue reading on website

Other news

🌸 SPRING BINGO CHALLENGE: WE HAVE A WINNER! 🌸

May 5, 2025

The Results Are In (Drumroll Please...)April has officially sprung its last days, and our wellness warriors have completed their final bingo squares! Time to announce who's taking home the glory (and the dinner reimbursement)!🏆 GRAND CHAMPION EXTRAORDINAIRE: Romain !Congratulations to our Spring Champion! A winner was drawn randomly out of the participants and the fate stopped on our one and only