Infrastructure Engineer

Brahma Consulting Group
Fremont, CA

About the Role

We are hiring an Infrastructure & Reliability Engineer for a high-growth startup in the Bay Area. In this role, you will own the systems that help the engineering team ship secure, reliable software at speed. You’ll be a hands-on contributor who can think strategically about infrastructure while also digging into implementation details that improve developer productivity, application performance, and production stability.


Key Responsibilities

  • Own release infrastructure and support secure end-to-end deployment processes from code commit to published artifact.
  • Architect, implement, and maintain scalable production environments using Kubernetes, Helm, and Terraform.
  • Improve developer experience by removing bottlenecks such as slow builds, flaky tests, and deployment friction.
  • Define and evolve reliability standards, including SLIs/SLOs and observability practices.
  • Identify and resolve performance bottlenecks across application code, databases, and caching layers.
  • Manage infrastructure as code across cloud environments, primarily AWS and Google Cloud.
  • Participate in incident response and post-incident reviews, with a focus on durable fixes and prevention.


Qualifications

  • 4–10+ years of professional experience in DevOps, Infrastructure, or Site Reliability Engineering.
  • Strong coding skills in Python and TypeScript for automation and reliability tooling.
  • Hands-on experience with Kubernetes, Terraform, and cloud ecosystems such as AWS or GCP.
  • Familiarity with modern observability tools and service reliability practices.
  • Strong debugging skills and the ability to distinguish real regressions from flaky tests or intermittent failures.
  • Comfortable working in a fast-paced startup environment with a broad technical scope.

// // //