Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: System Software Engineer, LLM Inference and Performance Optimization, Location: Santa Clara, CA

Page: 1

System Software Engineer, LLM Inference and Performance Optimization

As a System Software Engineer (LLM Inference & Performance Optimization) you will be at the heart..., including multi-threading, data parallelism, and performance tuning. Validated expertise in LLM inference, with experience in...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 25 Oct 2024

Senior Deep Learning Algorithm Engineer - Agentic LLM Inference

systems into the NVIDIA LLM software stack. Workload Analysis and Optimization: Conduct in-depth analysis, profiling... performance. Ways to stand out from the crowd: 2+ years of experience in building large-scale LLM inference systems...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 25 Oct 2024

Senior DL Algorithms Engineer - Inference Optimizations

recipes for LLM inference as part of NVIDIA Inference Microservices (NIMs). Analyze, validate and debug performance... of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 08 Oct 2024

Senior DL Algorithms Engineer - Inference Optimizations

NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every... performance, we want to hear from you! This role offers an opportunity to directly impact the hardware and software roadmap in...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 18 Aug 2024