GPU Optimization Engineer (ML Infrastructure)

kadence
Santa Rosa, CA

Kadence Talent is partnered with an AI x Quantum start-up, looking for a GPU Optimization/ML Infrastructure Engineer to make it easy for researchers to run compute-heavy workloads at scale.


You’ll sit between ML and infrastructure, helping scientists move from local experiments to scalable GPU-backed systems across cloud environments.


What You’ll Do

  • Enable GPU access and scaling across AWS/GCP
  • Build simple systems to run and manage compute-heavy workloads
  • Improve speed, reliability, and cost-efficiency of experiments
  • Help transition workflows from Modal → native cloud
  • Support research using tools like Qiskit and PennyLane
  • Reduce friction & make infra invisible and easy to use


What We’re Looking For

  • Experience with AWS or GCP (compute, basic networking)
  • Familiarity with GPU workloads (PyTorch, CUDA, etc.)
  • Strong Python skills
  • Experience with Docker (Kubernetes is a plus)
  • Comfortable working in a hands-on, startup environment


Nice to Have

  • ML/AI infrastructure or training pipelines
  • Distributed compute (Ray, Dask, Spark, etc.)
  • Experience supporting researchers or data scientists


Why This Role

  • Work at the intersection of quantum + ML + infrastructure
  • Build systems from scratch - high ownership, high impact
  • Turn unused compute into real research output


Location: This is an on-site role in Mountain View, CA

Compensation: $250k - $280k+, plus equity

// // //