Blog

Incident management insights, guides, and product updates from Rootly

Search...
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
When You Do DevSecOps, Don’t Forget the SREs

When You Do DevSecOps, Don’t Forget the SREs

It's time to break down the silos separating SREs from security engineers.

Quentin Rousseau

Quentin Rousseau

July 21, 2021
5 min read
De-Siloing Incident Management: How to Make Reliability Engineering Everyone’s Job

De-Siloing Incident Management: How to Make Reliability Engineering Everyone’s Job

4 best practices for breaking down silos and establishing a culture of shared responsibility toward reliability.

JJ Tang

JJ Tang

July 15, 2021
5 min read
Rootly Announces $3.2 Million in Seed Funding from XYZ Venture Capital, 8VC, & Y Combinator

Rootly Announces $3.2 Million in Seed Funding from XYZ Venture Capital, 8VC, & Y Combinator

Rootly is on a mission to create a world where maintaining reliability is frictionless, delightful, and accessible to anyone. Making resolving and learning from incidents every organizations superpower.

Quentin Rousseau

Quentin Rousseau

July 8, 2021
4 min read
The Incident Review: 4 Incidents in Outer Space

The Incident Review: 4 Incidents in Outer Space

From network problems to computer failures, a variety of incidents can disrupt operations for systems in outer space.

JJ Tang

JJ Tang

July 6, 2021
4 min read
7 Essential Tools for SREs

7 Essential Tools for SREs

From chaos engineering to monitoring and beyond, SREs rely on several key types of tools to do their jobs.

Quentin Rousseau

Quentin Rousseau

June 25, 2021
5 min read
Practical Guide to SRE: Incident Severity Levels

Practical Guide to SRE: Incident Severity Levels

Incident severity levels are a measurement of the impact an incident has on the business. Classifying the severity of an issue is critical to decide how quickly and efficiently problems get resolved.

Quentin Rousseau

Quentin Rousseau

June 17, 2021
4 min read
The Incident Review: 4 Times When Typos Brought Down Critical Systems

The Incident Review: 4 Times When Typos Brought Down Critical Systems

Sometimes, as these 4 incidents highlight, major failure results from a mere typo or configuration oversight.

JJ Tang

JJ Tang

June 4, 2021
5 min read
Incident Management vs. Incident Response - What's the Difference?

Incident Management vs. Incident Response - What's the Difference?

What are the differences between incident management and incident response? The answer varies widely depending on whom you ask.

Quentin Rousseau

Quentin Rousseau

May 28, 2021
4 min read
Practical Guide to SRE: Using SLOs to Increase Reliability

Practical Guide to SRE: Using SLOs to Increase Reliability

Service Level Objectives (SLOs) are a key component of any successful Site Reliability Engineering initiative. The question is, what are SLOs; and how do you determine what your SLOs should be? Once you've done that, how should you use them?

Quentin Rousseau

Quentin Rousseau

May 13, 2021
9 min read