Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Software Engineer, Model Inference, Location: San Francisco, CA

Page: 1

Software Engineer, Networking - Inference

as accelerating research progression via model inference. About the Role We're looking for a senior engineer to design and build... them to do things that they've never been able to before. We focus on performant and efficient model inference, as well...

Company: OpenAI
Location: San Francisco, CA
Posted Date: 05 Nov 2025

Software Engineer, Load Balancing - Inference

as accelerating research progression via model inference. About the Role We're looking for a senior engineer to design and build... them to do things that they've never been able to before. We focus on performant and efficient model inference, as well...

Company: OpenAI
Location: San Francisco, CA
Posted Date: 17 Oct 2025

Software Engineer, Inference – AMD GPU Enablement

of the OpenAI inference stack on AMD hardware. Integrate internal model-serving infrastructure (e.g., vLLM, Triton... like NCCL/RCCL and understand their role in high-throughput model serving. Have worked on distributed inference systems...

Company: OpenAI
Location: San Francisco, CA
Posted Date: 09 Oct 2025

Senior Inference Platform Engineer - Data Center

, H200s, and B200s, ready to go for experimentation, full-scale model training, or inference. Our client operates high... platform, beginning with cost-efficient batch inference and expanding into low-latency, real-time inference and custom model...

Posted Date: 03 Dec 2025

Software Engineer, Scientific Models (Platform)

and implementing infrastructure for model inference that is fast, reliable, and cost-effective across a diverse set of scientific... modern software to modern science. ROLE OVERVIEW We're a team that's embedding cutting-edge scientific AI models...

Company: Benchling
Location: San Francisco, CA
Posted Date: 15 Nov 2025

Senior Machine Learning Engineer, Model Customization, Generative AI Innovation Center

closely with foundational model providers to optimize AI models for Amazon Silicon, enhancing performance and efficiency..., and Reinforcement Learning with Human Feedback (RLHF) Model Optimization on AWS Silicon: Optimize AI models for deployment on AWS...

Company: Amazon
Location: San Francisco, CA
Posted Date: 13 Dec 2025

Staff Full-Stack Software Engineer

or workflows, including validation of datasets, model predictions, and inference consistency. FD21 For positions... dedicated to delivering scalable, AI-powered software products that elevate how organizations work. We value clean architecture...

Company: ServiceNow
Location: San Francisco, CA
Posted Date: 01 Nov 2025

Sr. Software Engineer, ML Infra

. Twitch is looking for a Senior Software Engineer to join our Machine Learning Infrastructure team. You will work... with software engineers, applied scientists and product managers in our Models and Infrastructure group to build next-generation...

Company: Amazon
Location: San Francisco, CA
Posted Date: 29 Oct 2025

Software Development Engineer

is looking for a Senior Software Engineer to join our Machine Learning Infrastructure team. You will work with software engineers, applied... owning the software and data systems to develop, train and manage our real time and batch models at scale. We own the ML...

Company: Twitch
Location: San Francisco, CA
Posted Date: 29 Oct 2025
Salary: $99500 - 200000 per year

Software Engineer II, ML Infra

is looking for a Software Engineer II to join our Machine Learning Infrastructure team. You will work with software engineers, applied... and automate software for ML workflows Optimize cost and performance of training and inference workloads Actively mentor...

Company: Twitch
Location: San Francisco, CA
Posted Date: 24 Oct 2025

Software Engineer II, ML Infra

. Twitch is looking for a Software Engineer II to join our Machine Learning Infrastructure team. You will work with software... - Develop and automate software for ML workflows - Optimize cost and performance of training and inference workloads...

Company: Amazon
Location: San Francisco, CA
Posted Date: 22 Oct 2025

Staff Software Engineer, ML Platform

! About the Role We're seeking an accomplished Staff Software Engineer to join Attentive's Machine Learning Platform team... inference at scale, agentic capabilities, and robust model lifecycle management. \n What You'll Accomplish Setting Technical...

Company: Attentive
Location: San Francisco, CA
Posted Date: 15 Oct 2025

Staff Software Engineer, Fullstack (Government)

analysis, integrating with model inference pipelines. System Architecture: Architect and optimize high-throughput, service...; active clearances are a plus. We’re looking for a Full Stack Engineer with strong backend expertise and solid frontend...

Posted Date: 20 Sep 2025

Senior Software Engineer, Fullstack (Government)

analysis, integrating with model inference pipelines. System Architecture: Architect and optimize high-throughput, service...; active clearances are a plus. We’re looking for a Full Stack Engineer with strong backend expertise and solid frontend...

Posted Date: 19 Sep 2025

Senior Software Engineer - ML Infrastructure

Software Engineer on the Machine Learning Infrastructure team, you will design, build, and operate the systems that power... at the center of this transformation. We build the platforms that enable model developers to experiment, train, deploy...

Company: Plaid
Location: San Francisco, CA
Posted Date: 17 Sep 2025

Senior Software Engineer, Backend

, generation, and embedding, integrated with model inference pipelines. Architect high-throughput, service-oriented backend... video understanding and multimodal AI. About the Role As a Senior Product Backend Engineer at TwelveLabs, you’ll...

Company: TwelveLabs
Location: San Francisco, CA
Posted Date: 07 Nov 2025

Software Engineer, Data Infrastructure - Research

About the Team The Workload team is responsible for designing and running OpenAI's LLM training and inference... this foundation, the Workload team ensures that researchers can focus on advancing model capabilities while we handle the scale...

Company: OpenAI
Location: San Francisco, CA
Posted Date: 20 Sep 2025

Software Engineer, Private Computing

. About the Role We're looking for software engineers to design, build, and scale novel privacy features and infrastructure... across ChatGPT, API, and future consumer devices. This role is based in San Francisco, CA. We use a hybrid work model of 3 days...

Company: OpenAI
Location: San Francisco, CA
Posted Date: 22 Nov 2025

Lead Software Engineering - Salesforce ECommerce Agent

(such as model lifecycle, training/inference trade-offs, and evaluation principles) is essential, as your work will ensure that the... you are not duplicating efforts. Job Category Software Engineering Job Details About Salesforce Salesforce is the #1 AI CRM...

Company: Salesforce
Location: San Francisco, CA
Posted Date: 08 Nov 2025

Senior Machine Learning Engineer - Discovery (ML + Backend Engineering)

Must Have 4+ years of post qualification experience as a professional ML or software engineer, with a proven track record... looking for a Machine Learning Engineer who will design, build, and optimize ML systems that scale to millions of users. You'll work...

Company: Scribd
Location: San Francisco, CA
Posted Date: 11 Dec 2025
Salary: $146500 per year