Lowe’s Platform team is looking for a highly motivated Lead Operations Engineer to lead the Platform Reliability efforts of our enterprise microservices and DevOps platform. This individual will be responsible for establishing and maintaining a suite of automation, monitoring solutions, processes, and other critical support components to ensure we meet expectations in terms of our availability and performance objectives. In addition to developing great solutions, this individual will serve as a mentor for junior members of the platform reliability team and will also work closely with other site reliability engineers across Lowe’s to continuously drive improvements for both the platform and it’s users.
This individual will utilize data driven results to identify areas of improvement, either technology or processes and have the initiative and skillset to deliver those improvements. This individual will work daily with their Platform colleagues to iteratively contribute in the areas of infrastructure provisioning automation processes, autonomous CI/CD, monitoring, and incident reduction to name a few.
This role relies heavily on a thorough understanding of cloud native principles, distributed systems, and a micro services architecture. This individual should exude a confident understanding and practical approach towards problem solving in this type of environment with a heavy emphasis on automation.
Technology Stack: Kubernetes, Jenkins, Spinnaker, Terraform, Istio, Google Cloud, GoLang, Python, Consul, Vault, Rancher, etc.
The primary purpose of this role is to provide consultation and expert technical advice on Technology infrastructure planning, and engineering for assigned systems. This includes responsibility for translating business requirements and functional specifications into technical requirements and delivering integrated and sustainable designs for complex or high impact infrastructure systems.
This role is also responsible for working with Enterprise Architecture teams to develop the technical direction for infrastructure solutions within various computing environments and systems.
The individual in this role works the hours and schedule required to support the enterprise and may be on call and/or work alternate schedules to meet the needs of the department.
Key Responsibilities:
- Translates requirements and functional specifications into technical requirements that support integrated and sustainable designs for complex or high impact infrastructure systems
- Collaborates with architects and engineers to ensure functional specifications are converted into flexible, scalable, and maintainable system designs
- Works closely with enterprise solution and infrastructure architects to develop and validate system design prototypes that guide architecture design
- Writes, reviews, and validates clear technical specifications and documentation
- Develops or modifies complex infrastructure solutions within designated computing environments
- Develops and validates complex system design prototypes
- Reviews and/or leads the building of complex hardware and/or software configurations and prepares system components for installation to the infrastructure
- Mentors and advises others, sharing an in‐depth understanding of company and industry methodologies, policies, standards, and controls
- Leads effort to deliver the validation and testing of components of highly complex infrastructure systems
- Serves as a technical expert for project teams throughout the implementation and maintenance of assigned enterprise infrastructure systems; defines and oversees the documentation of detailed standards (e.g., guidelines, processes, procedures) for the introduction and maintenance of services
- Provides insight and recommendations to inform the ongoing strategy for health and care of assigned domain(s) and/or platform(s) by identifying and maintaining Technology and business‐level service attributes for associated technologies
- Evaluates new service options, identifies issues and impacts, and makes recommendations on feasibility, cost, and ROI
- Provides mentoring and guidance to more junior level engineers; may provide feedback and direction on specific engineering tasks
- Systems Administration Responsibilities
- Monitors and manages the stability, availability, and performance of enterprise systems and systems across IT domains (e.g., systems, network, databases, storage, security) by analyzing systems to identify problems, trends, and opportunities for improvement
- Responds to escalated support issues for enterprise systems; facilitates advanced diagnosis and troubleshooting when necessary
- Leads implementation of hardware and software changes into environments by performing impact analyses of system changes
- Documents and communicates relevant system performance and procedural information by maintaining documentation of application and/or system diagrams, schemata, and dictionaries
- Leads the development of system administration standard operating procedures
Minimum Qualifications:- Bachelor's Degree in Computer Science, CIS, or related field (or equivalent work experience in a related field)
- 7 years of experience in an IT support environment with technical experience in distributed technologies and systems development
- 3 years of experience in a leadership role with or without direct reports
- 2 years of infrastructure engineering experience working across multiple domains, platforms, or specialty areas
- Experience with systems (both infrastructure and applications) management, diagnostics, and support
- Familiarity with networking, server, and storage technologies
- Familiarity with n‐tier architectures
Preferred Qualifications:- Master's Degree in Computer Science, CIS, or related field
- 4 years of experience leading technical or project teams with or without direct reports in a large matrixed, retail organization
- 3 years of experience in an IT role requiring interaction with senior leadership
- 4 years of experience working in a large matrixed organization, in the Retail industry
- 4 years of experience working with third party IT vendors and/or systems solution providers
- 4 years of IT experience in the retail industry
- 4 years of experience writing technical documentation
About Lowe’s:
Lowe’s Companies, Inc. (NYSE: LOW) is a FORTUNE® 50 home improvement company serving approximately 18 million customers a week in the United States and Canada. With fiscal year 2019 sales of $72.1 billion, Lowe’s and its related businesses operate or service more than 2,200 home improvement and hardware stores and employ approximately 300,000 associates. Based in Mooresville, N.C., Lowe’s supports its hometown Charlotte region and all communities it serves through programs focused on creating safe, affordable housing and helping to develop the next generation of skilled trade experts. For more information, visit Lowes.com.
About Lowe’s in the Community:
As a FORTUNE® 50 home improvement company, Lowe’s is committed to creating safe, affordable housing and helping to develop the next generation of skilled trade experts through nonprofit partnerships. Across every community we serve, Lowe’s associates donate their time and expertise through the Lowe’s Heroes volunteer program. For the latest news, visit Newsroom.Lowes.com or follow @LowesMedia on Twitter.
Lowe’s is an equal opportunity affirmative action employer and administers all personnel practices without regard to race, color, religion, sex, age, national origin, disability, sexual orientation, gender identity or expression, marital status, veteran status, genetics or any other category protected under applicable law.