As the big game approaches this Sunday, I’ve been thinking about the NFL’s introduction of instant replay and how it makes the league much more enjoyable! Whether you’re rooting for the Patriots led by Tom Brady … or the Rams, you can’t deny that instant replay makes every Super Bowl much more efficient and adds … Continued
ITSM incident management might seem like a lot of words and letters thrown together. However, when examined in the light of managing changes to IT functionality, you quickly realize their importance. ITSM incident management quickly become realized as a way to define how teams should organize themselves and operate their IT services. And key to … Continued
Businesses and organizations shouldn’t simply rely on monitoring tools for security management. Such tools don’t provide redundancies, time-stamped audit trails and other elements needed for incident resolution. Also, security threats are rampant and tend to go unchecked even with the most reliable monitoring service. That’s why companies require critical alerting to become aware of security … Continued
Managing alert noise from monitoring systems like SolarWinds can be tricky and failing to order the noise can cause: Alert fatigue: too many alerts waking engineers up at night will not only cause tired engineers, but also hurt your team’s effectiveness at maintaining effectiveness. Decreased MTTR: Because there are too many alerts, it will take extra … Continued
Imagine you’re the manager for the IT Operations for a multimillion-dollar retail chain. The chain not only has numerous stores throughout the U.S. but also a robust online presence. Now imagine that you need to conduct security and software updates on the company’s servers. The update will end up disrupting store services for 30 minutes … Continued
OnPage is an incident alert management platform and smartphone app that allows you to: Consolidate IT alerts onto one platform Add intelligent alerting and escalation workflows to systems and sensors that detect anomalies Connect to stakeholders and customers using real-time call routing Manage incident responders and stakeholders through: secure messaging, live ticket updates real-time reporting … Continued
OnPage Corp. just finished a survey of more than 100 ITOps professionals from across the United States. Our goal was to acquire a greater understanding of how well engineers in the industry are performing when it comes to critical alerting and alert management of their IT teams. We wanted to understand the antecedents of alert … Continued
OnPage releases new voicemail features In an era where voice mail is ubiquitous, our customers have been asking for the ability to receive voicemail attachments on their OnPage messages. You know, there are times when you need to send a critical message to your physician or IT professional with a voice mail attachment. And so … Continued
Blameless post-mortems allow us to examine mistakes in a way that focuses on the situational aspects of a failure’s mechanism and the decision-making process of individuals proximate to the failure. – The DevOps Handbook The engineers at Google describe post-mortem reporting as a “written record of an incident, its impact, the actions taken to mitigate … Continued
The following is an excerpt form the thesleuthjournal.com Having an IT team on-call is very important to ensuring your company’s end product retains its high level of quality. Without this component, it would be considerably harder for the IT team to get the information they need when issues do arise. Additionally, it would also be … Continued