Staff Site Reliability Engineer

Satine Technologies
Atlanta, GA

About the RoleYou'll be doing real SRE work: building infrastructure, maintaining pipelines, debugging production issues, improving observability, and making the systems under your care more reliable over time. You'll own significant portions of its implementation and day-to-day operation.This is not a ticket-queue role. You'll have real ownership over platform components and be expected to bring your own judgment about what needs improving and how.What You'll DoBuild and operate Kubernetes clusters and cloud infrastructureOwn and improve CI/CD pipelines - build reliability, deployment safety, rollback capabilityImplement and maintain observability: metrics, logging, alerting, dashboardsWrite Terraform and other IaC to provision and manage cloud resourcesParticipate in on-call rotation and lead incident response for platform issuesIdentify and drive reliability improvements - SLO gaps, toil reduction, capacity issuesDocument what you build so the team can operate and extend itWhat We're Looking ForRequired:5+ years of SRE, platform engineering, or DevOps experienceStrong Kubernetes - you've operated clusters in production, not just deployed workloads to themTerraform or equivalent IaC at real scaleSolid Linux fundamentals - you can debug a system-level issue, understand network behavior, read a flame graphExperience with at least one major cloud platform (AWS, Azure, or GCP)US citizenship or Lawful Permanent Resident status (Public Trust eligibility required)Paths In - You Might Be a Fit If You:Have been doing solid SRE work at a startup, product company, or agency and want to work on systems that matter beyond the business metricHave been the SRE generalist on a small team - you've done everything and want to go deeper on platform reliability specificallyAre a strong infrastructure engineer who has been growing into SRE responsibilities and wants to formalize that transitionHave commercial cloud experience and want to bring those skills somewhere the work has real stakesHelpful but Not Required:Experience with Kafka or event-driven architecturesObservability stack experience: Prometheus, Grafana, ELK, or similarFamiliarity with security or compliance frameworks (FedRAMP, NIST 800-53, SOC 2, or similar)GitOps experience with tools like ArgoCD or FluxAbout Satine TechnologiesOur mission is to protect the institutions that underpin free society from cyber threats. We're a small, mission-driven team that works on problems that matter - from offensive security testing for hospitals and banks to building capabilities for national security missions.We invest in people who invest in themselves. This isn't a body shop. You'll work with a team that takes pride in technical craft and cares about developing the people who join us.BenefitsHealth insurance with vision, dental, and HSALife insurance (100% employer-funded)401(k) with 4% matchFlexible PTOTo all recruitment agencies: Satine Technologies does not accept agency resumes.
recblid h12524j2mxglvqkf17veeucn7ml94q

Not Specified
// // //