Position: Monitoring and Operations Engineer / ITSM Incident Commander Location: 100% Remote Duration: 12+ Months... Contract Interview: Video Role Overview: Our IT Service Management (ITSM) team composed of highly skilled Incident...
Solutions has a great opportunity for an Incident Response Manager with a history of leading major incident responses.... Job Description Our Incident Response Manager (aka Incident Commander) leads and directs the team during major incidents related to the Cloud Video...
, work location, work experience and other individualized factors Description As an Incident Manager, you will be a part... Incident Management process. You will be working with your global incident management team to assess the severity of reported...
security, incident and crisis management programs. At Datadog, we place value in our office culture - the relationships... American regions. 5+ years as a fulltime people manager is essential Datadog values people from all walks of life...
efficiencies, best-in-class tooling, analytics, and programs. We are seeking a Senior Program Manager to elevate Datadog...Datadog's Recruiting Solutions team is on a mission to scale and optimize the department through operational...
As an Engineering Manager on the Metrics Alerting team, you and your team will be responsible for the reliable... scheduling and execution of queries that power Datadog's most critical infrastructure monitoring features. Our platform performs...
As an Engineering Manager on the Metrics Platform, you and your team will be responsible for the reliable scheduling... and execution of queries that power Datadog's most critical infrastructure monitoring features. Our platform ingests...
The Manager - Global DataOps & DevOps will play a crucial role in designing, implementing, and maintaining automation... and continuous improvement of global support processes. The manager will partner closely with Data Engineering leads, Platform Owners...
IT Manager Minneapolis, MN (onsite/hybrid) Full-time/Direct-hire + Benefits Must be a US Citizen Position Overview... The IT Manager will oversee daily technology operations and lead a team of IT professionals to deliver reliable technical...
The Manager - Global DataOps & DevOps will play a crucial role in designing, implementing, and maintaining automation... and continuous improvement of global support processes. The manager will partner closely with Data Engineering leads, Platform Owners...
Designation GROUP MANAGER No. of Positions 1 Experience 7-12 Years Skill (Primary) Cloud Services...-Autonomics-Program Management - Program Manager Qualification B-Tech Expected Date of Closure 01-Jan-2025 Employee...
Reliability Engineering (SRE) Manager to lead our growing SRE team and play a critical role in driving operational excellence... with industry best practices. Enhance system observability by integrating tools like Datadog for monitoring, alerting...
and support for the team's products, and leading incident management efforts to drive timely resolutions. Further, this manager...Our Opportunity: Chewy is hiring a Software Development Manager for our Fulfillment Support Applications (FSA) team...
We are seeking an experienced and dynamic Manager of Infrastructure Platform Engineering to lead and manage a team responsible..., Datadog, New Relic) Strong knowledge of infrastructure-as-code tools and frameworks (e.g., Terraform, Ansible, CloudFormation...
with logging and monitoring tools such as ELK, DataDog or NewRelic, LogEntries, SumoLogic, etc. Experience with Incident... for employees and clients, aligned to company D&I goals. Manager of Process & Data: Demonstrates deep process knowledge...
years of experience in ad technology, operations, support or incident management. Deep understanding of Exchanges, Real... like Google Ad Manager, Freewheel or similar system Experience managing a support ticket queue in JIRA, Zendesk, or similar...
and optimizing our cloud-based environments (Kubernetes/GCP) in the US region, ensuring robust monitoring, alerting, and incident... tools. (such as Grafana, Prometheus, Thanos, Loki, DataDog, Open Telemetry). Experience with CI/CD pipelines & platforms...
and optimizing our cloud-based environments (Kubernetes/GCP) in the US region, ensuring robust monitoring, alerting, and incident... tools. (such as Grafana, Prometheus, Thanos, Loki, DataDog, Open Telemetry). Experience with CI/CD pipelines & platforms...
CloudFormation, or Azure Resource Manager. - Ensure high availability and scalability of cloud services to support critical... applications. - **Observability and Monitoring:** - Implement comprehensive observability solutions using tools like Datadog...
such as Jenkins, GitHub Actions, or Azure DevOps. Participate in on-call rotations and incident response, providing timely resolution... (IaC) tools such as Terraform, AWS CloudFormation, or Azure Resource Manager. Collaborate with cross-functional teams...