Monitoring StatusCake… With StatusCake

How does the monitor, monitor the monitor? No it’s not a tongue twister, but rather a question we faced when StatusCake started to be joined by big companies such as the BBC, NHS, EA to name but a few.

What do we do?

StatusCake now monitors over 250,000 websites and has tens of thousands of users who rely on it to let them know if their site goes down. If StatusCake were to face difficulties, it would impact our users on a grand scale delaying their alerts and missing downtime – so it’s clear we needed a good monitoring system.

StatusCake was built around the principals of easily deployable nodes that can come up and down without impacting service quality. We have a high level of redundancy with around 50% of our node servers able to go down at any one point without impacting check rates or alert quality at all. Each node is independent of each other, and each grabs a workload and holds the entire systems workload on it at any one time. Using this independent structure, if a node were to be unable to connect to the master servers it continues on and tests servers that have been assigned to it. To ensure tests are not duplicated among servers they each talk to each other letting each server know which servers are having trouble and what work load to take because of that.

So that reduces the possibilities of something going wrong to an extreme level and means we can use StatusCake to monitor StatusCake. We have a StatusCake account that is set up for us that monitors all our servers, even the HTTP server! If any part of StatusCake were to go down another part would notify us almost instantly. We don’t believe in over redundancy when it comes to offering our users the insurances that their monitoring will remain in place, no matter the difficulties that may arise.

James Barnes

More from StatusCake

When Code Becomes Cheap: The New Reliability Constraint in Software Engineering

4 min read How AI Is Shifting Software Engineering’s Primary Constraint For most of the history of software engineering, the primary constraint was production. Code was expensive, skilled engineers were scarce, and shipping features required concentrated human effort. Velocity was limited by how fast people could reason, implement, test, and deploy. That constraint shaped everything from team size,

James Barnes March 25, 2026

Buy vs Build in the Age of AI (Part 3)

5 min read Autonomous Code, Trust Boundaries, and Why Governance Now Matters More Than Ever In Part 1, we looked at how AI has reduced the cost of building monitoring tools. Then in Part 2, we explored the operational and economic burden of owning them. Now we need to talk about something deeper. Because the real shift isn’t

James Barnes March 18, 2026

Buy vs Build in the Age of AI (Part 2)

6 min read The Real Cost of Owning Monitoring Isn’t Code — It’s Everything Else In Part 1, we explored how AI has dramatically reduced the cost of building monitoring tooling. That much is clear. You can scaffold uptime checks quickly, generate alert logic in minutes, and set-up dashboards faster than most teams used to schedule the kickoff

James Barnes March 11, 2026

Buy vs Build in the Age of AI (Part 1)

5 min read AI Has Made Building Monitoring Easy. It Hasn’t Made Owning It Any Easier. A few months ago, I spoke to an engineering manager who proudly told me they had rebuilt their monitoring stack over a long weekend. They’d used AI to scaffold synthetic checks. They’d generated alert logic with dynamic thresholds. They’d then wired everything

James Barnes March 4, 2026

Alerting Is a Socio-Technical System

3 min read In the previous posts, we’ve looked at how alert noise emerges from design decisions, why notification lists fail to create accountability, and why alerts only work when they’re designed around a clear outcome. Taken together, these ideas point to a broader conclusion. That alerting is not just a technical system, it’s a socio-technical one. Alerting

James Barnes February 25, 2026

Designing Alerts for Action

3 min read In the first two posts of this series, we explored how alert noise emerges from design decisions, and why notification lists fail to create accountability when responsibility is unclear. There’s a deeper issue underneath both of those problems. Many alerting systems are designed without being clear about the outcome they’re meant to produce. When teams

James Barnes February 18, 2026

Want to know how much website downtime costs, and the impact it can have on your business?

Find out everything you need to know in our new uptime monitoring whitepaper 2021

When Code Becomes Cheap: The New Reliability Constraint in Software Engineering

Buy vs Build in the Age of AI (Part 3)

Buy vs Build in the Age of AI (Part 2)

Life @ StatusCake

Dev

When Code Becomes Cheap: The New Reliability Constraint in Software Engineering

Buy vs Build in the Age of AI (Part 3)

Buy vs Build in the Age of AI (Part 2)

Uptime

How to monitor IPFS assets with StatusCake

Website accessibility for all, by all

How to make money online for beginners

Freshly Baked

Monitoring StatusCake… With StatusCake

What do we do?

James Barnes

More from StatusCake

When Code Becomes Cheap: The New Reliability Constraint in Software Engineering

Buy vs Build in the Age of AI (Part 3)

Buy vs Build in the Age of AI (Part 2)

Buy vs Build in the Age of AI (Part 1)

Alerting Is a Socio-Technical System

Designing Alerts for Action

Monitoring Suite

Features

Our Plans

Resources

Company

Want to know how much website downtime costs, and the impact it can have on your business?

Life @ StatusCake

Monitoring StatusCake… With StatusCake

What do we do?

James Barnes

More from StatusCake

Sign up for the StatusCake newsletter

Monitoring Suite

Features

Our Plans

Resources

Company

Want to know how much website downtime costs, and the impact it can have on your business?