Description
Hybrid 50% ON SITE, requires candidates local to the area within Westlake, TX / DFW greater metro
We are seeking a hands-on DevOps Engineer to support and modernize cloud infrastructure for a large, mission‑critical platform in the financial services industry. This role is strongly focused on core DevOps engineering — building, automating, securing, and operating cloud-native infrastructure at scale, including build and automate functions within cloud infrastructure for a global set of platforms. The role requires strong skills with Python automation, work with Kubernetes, Azure, Terraform, ARM templates, IAM, networking, and strong Python scripting. You will design CI/CD, manage AKS and containerized workloads, and implement observability and security best practices. You will collaborate with engineers to deliver reliable, scalable services.
This role will require deep DevOps experience and a passion to gain experience / exposure to AI or AI/ML based automation concepts as they exist in the broader environment, however - this role is not AI‑focused. Candidates with primarily configuration‑based tool experience (e.g., Azure DevOps only) will not be a fit unless they also bring strong foundational DevOps engineering skills and a desire to perform more AI driven functions over the long term.
Due to client requirements, applicants must be willing and able to work on a w2 basis. For our w2 consultants, we offer a great benefits package that includes Medical, Dental, and Vision benefits, 401k with company matching, and life insurance.
Rate: $70.00 to $77.00/hr. w2
Responsibilities
Main functions of the role:
- Kubernetes (production experience required)
- Microsoft Azure Cloud
- Infrastructure as Code: Terraform (required), ARM Templates / Azure Bicep
- Strong scripting skills, primarily Python (plus Bash)
- Identity & Access Management (IAM) and RBAC
- Networking fundamentals (cloud and on‑prem concepts)
Core Responsibilities
- Design, build, and maintain cloud-native infrastructure using Infrastructure as Code
- Deploy, manage, and scale containerized applications on Azure Kubernetes Service (AKS)
- Write and maintain Python and Bash scripts for automation, provisioning, and operational tasks
- Build and maintain CI/CD pipelines with a strong emphasis on automation and reliability
- Manage Kubernetes clusters, including:
- Deployments, scaling, upgrades, and node pools
- Helm charts and release management
- Cluster networking and RBAC
- Implement and maintain secure cloud networking, identity, and secrets management
- Support monitoring, logging, and observability for distributed systems
- Troubleshoot complex infrastructure and deployment issues across environments
- Partner closely with application engineers to ensure reliable, scalable delivery
DevOps & Platform Engineering Expertise
Linux, Scripting & Automation
- Advanced Linux experience
- Bash and Python scripting for automation and tooling
Containers & Kubernetes
- Dockerfile creation and image lifecycle management
- Container security best practices
- Kubernetes cluster operations, scaling, and networking
- Helm chart development and maintenance
CI/CD
- Designing and maintaining pipelines (e.g., Jenkins)
- Writing automation logic using Groovy, YAML, shell, and Python
- Artifact management and environment promotion
- Deployment strategies such as blue/green and rolling deployments
Infrastructure as Code
- Terraform for infrastructure provisioning
- Azure ARM Templates / Bicep
- Version control and repeatable deployments
Networking & Security
- Core networking concepts: DNS, TCP/IP, routing, load balancing
- Azure networking: VNets, subnets, NSGs, firewalls
- Identity and secrets management (Key Vault, RBAC, least privilege)
Monitoring & Observability
- Metrics, logs, and traces using tools such as Datadog, Splunk, or OpenTelemetry
- Proactive alerting and performance monitoring
Azure‑Focused Cloud Skills
- Deploying and managing AKS clusters, including autoscaling and upgrades
- Experience deploying workloads from CI/CD pipelines into AKS
- Hands‑on experience with:
- Virtual Machines, App Services, Functions
- Storage accounts and backup/DR fundamentals
- Service Bus, Redis / managed cache, Blob Storage, Key Vault
- Strong understanding of high availability and scaling patterns
- Azure governance basics: resource groups, policies, and management groups
- Azure Monitor, Log Analytics, and Application Insights
Experience Requirements
- 6+ years of experience in DevOps, Cloud Engineering, or Platform Engineering
- Proven experience supporting production systems at scale
- Strong troubleshooting and root‑cause analysis skills
- Experience working in regulated or enterprise environments preferred
Nice To Have
- Knowledge of Docker and container security; experience with Kubernetes platforms such as OpenShift or Rancher (preferred).
- Experience building DevSecOps functions and secure code execution and isolation (preferred).
- Experience with RAG systems, vector databases, and LLM integrations (preferred).
- Familiarity with languages such as Java, Go, JavaScript, and shell (preferred).
- Cloud platform certifications in AWS or Azure (preferred).
Education Requirements
- Bachelor’s degree or equivalent in a technology-related field such as Computer Science or Engineering.
- Azure certification (preferred).
- AWS certification (preferred).