Incident Response is at the heart of high availability of Amazon Web Services. We make customer impacting events shorter... and less frequent by providing large scale event and incident management. Our automated tooling quickly identies the cause of an issue...
Define SLOs, SLIs, monitoring, alerting and incident response practices. Set the bar for observability excellence within the... About The Role 🤔: As a Senior Site Reliability Engineer at Immutable, you will have the autonomy to influence our SRE...
, and procedures (TTPs). Experience with, and a deep understanding of, digital forensics and incident response capabilities... - receive support and coaching to progress your career Preventive and supportive mental health initiatives...
and optimise third-party service integrations, particularly for critical services Incident Management & Response Establish... that integrate with our observability stack Create and maintain incident response playbooks and automated runbooks Lead post...