Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior System Reliability Engineer, Location: Santa Clara, CA

Page: 1

Senior System Reliability Engineer

efficient and reliable systems is an imperative. We are looking for a System Reliability Engineer to join NVIDIA's existing... Reliability Engineering team, involved in NVIDIA's diverse system product range specifically Graphics and High-Performance...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 05 Nov 2024

Senior Site Reliability Engineer, Omniverse Cloud Platform

offering! We are seeking a highly motivated Senior Site Reliability Engineer to join our Omniverse Infrastructure organization... for us. Are you a creative and autonomous Site Reliability Engineer, who loves challenges? Do you have a genuine passion for advancing the state...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 06 Nov 2024

Senior Site Reliability Engineer

with the necessary resources and scale to foster innovation. We are seeking a Senior Site Reliability Engineer (SRE... reliability metrics to track and improve system and service reliability. Oversee capacity and performance management...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Oct 2024

Senior Site Reliability Engineer - Internal AI Research Clusters

Compute Clusters. As a Site Reliability Engineer, you will help us with the strategic challenges we encounter including... and reliability aspects of large scale large scale distributed systems with focus on performance at scale, real time monitoring...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 25 Sep 2024

Senior Site Reliability Engineer - AI Research Clusters

to improve researchers productivity. As a Site Reliability Engineer, you will help us with the strategic challenges we encounter... and operate these clusters at high reliability, efficiency, and performance and drive foundational improvements and automation...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 12 Sep 2024

Senior Site Reliability Engineer - DGX Cloud

Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale... at NVIDIA ensures that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Oct 2024

Senior Software Engineer, Reliability and Operational Excellence - DGX Cloud

DGXC SRE at NVIDIA ensures that our internal and external facing GPU cloud services run maximum reliability and uptime... as promised to the users and at the same time enabling developers to make changes to the existing system through careful...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 27 Sep 2024

Principal Site Reliability Engineer (Cloud Management)

Alto Networks runs a large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the Cloud... Your Experience 6+ years as an engineer in Infrastructure, Operations, DevOps, or System Engineering 4+ years building high...

Location: Santa Clara, CA
Posted Date: 08 Nov 2024
Salary: $147000 - 237500 per year

Principal Site Reliability Engineer (Cloud Management)

Alto Networks runs a large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the Cloud... Your Experience 6+ years as an engineer in Infrastructure, Operations, DevOps, or System Engineering 4+ years building high...

Location: Santa Clara, CA
Posted Date: 07 Nov 2024
Salary: $147000 - 237500 per year

Sr Staff Site Reliability Engineer (Cortex)

management in Site Reliability Engineering DevOps/SRE Expertise - 5+ years of experience as a DevOps/SRE engineer with a passion... into our systems’ performance and health. Your Impact As a Senior Staff SRE with the Cortex Observability team, you will: Cloud...

Location: Santa Clara, CA
Posted Date: 06 Nov 2024
Salary: $126000 - 203500 per year

Sr Staff Site Reliability Engineer (Cortex)

management in Site Reliability Engineering DevOps/SRE Expertise - 5+ years of experience as a DevOps/SRE engineer with a passion... into our systems' performance and health. Your Impact As a Senior Staff SRE with the Cortex Observability team, you will: Cloud...

Location: Santa Clara, CA
Posted Date: 06 Nov 2024
Salary: $126000 - 203500 per year

IC Reliability Engineer

and operations teams to create board and system reliability test plans for various products such as GPU, Server, Automotive, Gaming... world. Join us at the forefront of technological advancement. What you will be doing: Reliability (DfR) qualification...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 05 Nov 2024
Salary: $124000 - 195500 per year

Senior System Level Product Engineer

post-silicon Senior System Level Product Engineer who is passionate and committed to making a difference in the world.... You will cover products in all business units that Nvidia provides solutions for. Prior experience in the lab with system level post...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 03 Nov 2024

Senior System Software Engineer, Infrastructure Automation

We are now looking for a Senior System Software Engineer to work on NVIDIA is hiring software engineers for its GPU... or architecting (design patterns, reliability and scaling) of new and existing vm/container-based clusters to manage linux/windows...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 04 Oct 2024

Senior Silicon Circuits System Design Engineer

, professional, server, mobile, and automotive markets! What you will be doing: As a Silicon Circuit System design engineer... analysis. Find creative solutions to complex silicon and system-level problems and be on the frontline to lead show-stopper...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 03 Nov 2024

Senior Systems Software Engineer, Containers and Kubernetes

NVIDIA is looking for a hardworking Sr. Systems Software Engineer to work on platform software based on open-source... with System internals of Unix/Unix-like kernels such as Linux. Automation experience with hands-on skills in frameworks...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 01 Nov 2024

Senior System Software Engineer, Metropolis

Engineer to join the Metropolis Performance team! We are looking for an engineer to support and contribute to the... solutions for distributed computing environments. Optimize performance and reliability of cloud applications and services...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 07 Sep 2024

Senior DGX Cloud Software Engineer- Infrastructure Automation and Distributed Systems

with infrastructure automation and distributed systems design developing tools for running large scale private or public cloud system in...) associated systems. Experience working with or developing multi-cloud infrastructure services. Experience teaching reliability...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 26 Sep 2024

Ultra Pure Water Systems Engineer

infrastructure. The position is an Ultra Pure Water Systems Engineer supporting tool installs, new factory upgrades, and site..., projected durations, safety requirements, and resourcing requirements. Ultra Pure Water Systems Engineer Requires...

Location: Santa Clara, CA
Posted Date: 31 Oct 2024
Salary: $120000 - 165000 per year

Senior Hardware Customer Quality Engineer

We are looking for a Senior Hardware Customer Quality Engineer in an individual contributor role to support... reliability standards at the component, board, or system level Knowledge of server system hardware and design, development...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 07 Nov 2024