incident management

Why proper incident management is key to proper IT management

Proper IT management requires proper incident management. Otherwise, you court Murphy’s law at your peril. In the IT world, if a server can fail, a cache overload or traffic overload the network – it will. And the consequences are significant.

Many IT organizations face database, hardware, and software downtime, lasting short periods to shutting down the business for days. According to a January 2016 article in Network Computing on the high price of IT downtime, organizations face:

“an average of five downtime events each month, with each downtime event being expensive indeed: from $1 million a year for a typical midsize company to more than $60 million for a large enterprise.”

The major cause of this downtime is equipment failures which account for almost 40% of downtime. The second most frequent cause of downtime is human error which accounts for 25% of downtime. Cybersecurity accounts for only about 10% of this downtime. Yet in each of these cases, traditional workflows use emails to alert those in charge of downed networks. The use of email alerts assumes – falsely – that an email will get the attention of a data center manager. Yet data managers are faced with 100s of other emails per day. Clearly, an email doesn’t break through the noise and get noticed in this instance.

Best practices for effective incident management during downtime

While effective use of network monitoring tools is required to minimize the impact of downtime, using emails to provide effective response means you are expecting the person responding to the incident is sitting at their computer or hovering over their iPhone. And what happens when the servers go down at 3am? One hopes even the most devoted of employees is asleep at that hour.

Furthermore, traditional pagers are inadequate as they go off and then go silent. Pagers, when used either as an alternative to email or in addition, don’t always escalate and they don’t persistently get the attention of the necessary individual. Instead, you need data security control tools coupled with proper incident management applications. This means, that when incidents do occur the appropriate individuals are alerted and the alerts don’t stop until the requisite action happens.

Impact of having solutions in place

Mitigating downtime requires good workflows, human response and – most importantly – proper alarms to alert relevant individuals when things go wrong. Proper incident notification is crucial to effect management of IT downtime. And there’s more than just the cost savings. There’s also the savings to reputation. If a company frequently experiences downtime to its IT infrastructure, then it is courting a besmirched reputation for lacking reliability. When a company has a bad reputation, business is more difficult and costly to conduct. Much of the writing on customer service notes that it is more difficult to retain customers and important stakeholders when a company’s reputation is damaged. This, in turn, makes the costs of doing business significantly higher.

Conclusion

Of great importance in this is that while you cannot avoid every incident, you can ensure proper incident management. In their attempts to provide proper alerts when trouble raises its ugly head and things go south, heads of IT need to ensure there are proper alerts that rise above the clutter.

Want to learn more about how alerts helped one IT team drive down response time? Download our whitepaper.

OnPage Corporation

Share
Published by
OnPage Corporation

Recent Posts

Top MSP Conferences of 2025

Conferences are a great way for MSPs of all sizes to establish long-term partnerships and…

2 weeks ago

Top 3 experiences we are looking forward to at HIMSS25!

The OnPage team is thrilled to announce that we will be attending HIMSS25! As we…

3 weeks ago

What’s New: Annotate Messages for additional context

We’re thrilled to introduce Personal Message Notes, a new feature designed to enhance the way…

3 weeks ago

What’s New: Supercharge workflows with Message Templates

We’re excited to introduce Message Templates, a powerful new feature designed to streamline communication and…

3 weeks ago

What’s New: OnPage Unveils Multiple Account Login

We’re thrilled to announce the launch of OnPage’s new Multiple Account Login feature. Designed to…

2 months ago

Home Call Survival Guide

Whether it's your first or hundredth home call shift, preparing yourself both physically and mentally…

2 months ago