Sr. SRE-Oracle DBA

Oracle
Seattle, WA

We are constantly pushing to define and develop the end-to-end automation of complex workflows found in day-to-day management of the Recovery Service. This team will keep you learning and, on your toes, delivering mission critical services that our customers depend on.

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. 

Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. 

Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. 

Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. 

Understand and explain the effect of product architecture decisions on distributed systems. Professional curiosity and a desire to develop a deep understanding of services and technologies. 

This Position will require being available for Shifts and Weekends. 

Relevant skills include ability to:

Manage, maintain, and optimize mission-critical Oracle database environments. 

This role is responsible for ensuring database availability, performance, security, and recoverability across on-premises and/or cloud-based systems. 

The ideal candidate has strong expertise in Oracle technologies, RMAN backup/recovery, Linux systems, and high-availability architectures such as RAC and Data Guard, with experience in Exadata environments highly preferred.

Required Skills:

  • Must Possess U.S. CITIZENSHIP and ACTIVE TS/SCI W/POLY SECURITY CLEARANCE To Be Eligible for Consideration
  • Willing to work Shifts and Weekends
  • Strong experience with Oracle Database (11g/12c/19c or later) 
  • Expertise in RMAN backup and recovery, including restore and disaster recovery scenarios 
  • Hands-on experience with Oracle RAC and/or Data Guard 
  • Solid Linux/Unix administration skills (RHEL, Oracle Linux) 

  • Shell scripting (Bash) 
  • System performance tools (top, vmstat, iostat)
  • Strong SQL and PL/SQL skills 
  • Experience with performance tuning (AWR, ASH, execution plans) 
  • Knowledge of Oracle architecture (SGA, PGA, redo/undo, etc.) 
  • Experience with database patching and upgrades 
  • Strong troubleshooting and problem-solving skills

 

Preferred But not Required Qualifications

  • Prior SRE experience managing production cloud services.

  • Prior experience in releasing and maintaining cloud services.

  • Production experience managing systems or database environment.

  • Experience with a general-purpose systems language such as:  Python, Perl, Unix Shell and/or database language SQL, PL/SQL,Rust

// // //