Want to know how much website downtime costs, and the impact it can have on your business?
Find out everything you need to know in our new uptime monitoring whitepaper 2021



The StatusCake.com team are going to be at a couple of events this week. We’d love to catch-up with any of our customers, as well as meet anyone interested in learning more about StatusCake, and sharing knowledge. If you’d like to arrange to meet with us please email us.
Our dev team and Head of Partnerships will be at DTX Europe this Wednesday at ExCel London.
Our teams are looking to talk with many of our existing partners as well are share knowledge on everything from Digital Transformation, AI, DevOps, networks, cloud, as well as the exploring what makes all of this possible, our people and teams themselves and looking at some of the cultural habits needed to support an agile DevOps team.
Our dev team will be at this meet up which features two speakers, the first by Roman Khavronenko of Victoria Metrics on how they build a fast and scalable open source time-series database “TSDB”, and the second speaker Mark Ottaway of Sohonet of monitoring with OpenNMS.
Speakers: @RomanHavronenko & Mark Ottaway.
Share this
3 min read In the previous posts, we’ve looked at how alert noise emerges from design decisions, why notification lists fail to create accountability, and why alerts only work when they’re designed around a clear outcome. Taken together, these ideas point to a broader conclusion. That alerting is not just a technical system, it’s a socio-technical one. Alerting
3 min read In the first two posts of this series, we explored how alert noise emerges from design decisions, and why notification lists fail to create accountability when responsibility is unclear. There’s a deeper issue underneath both of those problems. Many alerting systems are designed without being clear about the outcome they’re meant to produce. When teams
3 min read In the previous post, we looked at how alert noise is rarely accidental. It’s usually the result of sensible decisions layered over time, until responsibility becomes diffuse and response slows. One of the most persistent assumptions behind this pattern is simple. If enough people are notified, someone will take responsibility. After more than fourteen years
3 min read In a previous post, The Incident Checklist: Reducing Cognitive Load When It Matters Most, we explored how incidents stop being purely technical problems and become human ones. These are moments where decision-making under pressure and cognitive load matter more than perfect root cause analysis. When systems don’t support people clearly in those moments, teams compensate.
4 min read In the previous post, we looked at what happens after detection; when incidents stop being purely technical problems and become human ones, with cognitive load as the real constraint. This post assumes that context. The question here is simpler and more practical. What actually helps teams think clearly and act well once things are already
3 min read In the previous post, we explored how AI accelerates delivery and compresses the time between change and user impact. As velocity increases, knowing that something has gone wrong before users do becomes a critical capability. But detection is only the beginning. Once alerts fire and dashboards light up, humans still have to interpret what’s happening,
Find out everything you need to know in our new uptime monitoring whitepaper 2021