Pega Engineer

Strategic Staffing Solutions
Irving, TX

Pega Engineer



Location
: Irving, TX (Hybrid

)Duration: 18 Month Contac


t
Overvie

w:We are seeking a contractor who combines Pega platform support expertise with Site Reliability Engineering (SRE) experience to support ongoing Production as well as the build-out and steady-state operations of the PFIX environment for Pega Smart Investigation platfor


m.
This role blends software engineering and operational excellence to deliver stable, scalable, and resilient services, reduce manual toil through automation, and drive “shift-left” reliability practic


es.
KEY RESPONSIBILI

  • TIESProduction Operations & Platform Reliabi
  • lityProvide hands-on production support and reliability engineering for Pega Smart Investigate and underlying PFIX environme
  • nts.Ensure high availability and stability of production systems through proactive monitoring, incident response, and operational readiness activit
  • ies.Participate in on-call support and lead technical resolution of high-priority incidents as nee


ded.
Incident, Problem, and Change Manag

  • ementLead and participate in incident response, root cause analysis, and blameless postmor
  • tems.Drive remediation efforts that eliminate recurring incidents and improve mean time to reco
  • very.Partner with change and release teams to ensure safe, well-governed production deploym


ents.
Observability & Moni

  • toringDesign, implement, and enhance monitoring, alerting, logging, and performance metrics to improve early detection and reduce noisy a
  • lerts.Improve end-to-end service visibility using enterprise-standard observability and APM


tools.
Automation & Toil Re

  • ductionIdentify manual, repetitive operational tasks and implement automation to reduce toil and improve relia
  • bility.Support self-healing and auto-remediation capabilities where appro


priate.
Reliability Engineering P

  • racticesDefine and operationalize Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets in partnership with application and platfor
  • m teams.Embed reliability considerations earlier in the software delivery lifecycle (“shift


left”).
Operational Readiness & G

  • overnanceImplement and guide Non-Functional Requirements (NFRs) to ensure performance, resiliency, and scalability expectations
  • are met.Support and enforce Permit-to-Operate and operational readiness standards prior to production


releases.
REQUIRED SKILLS &

  • EXPERIENCEExperience supporting Pega platforms in a production environment, acting as a subject matter expert for platform re
  • liability.Experience in Site Reliability Engineering, DevOps, or platform operati
  • ons roles.Proven experience in production incident management, problem management, and root cause
  • analysis.Proficiency in automation or scripting (Python and/or Java p
  • referred).Hands-on experience with monitoring, logging, and observability tools (e.g., Grafana, Prometheus, Splunk, AppDynamics, Thous
  • and Eyes).Strong communication skills and ability to partner effectively across development, platform, and operati
  • ons teams.Experience in understanding of Kubernetes (i.e.


OpenShift)
General Experi

  • ence/Skills2+ years leading, operating and performing within a
  • n SRE team.Strong soft-skills including written and verbal com
  • munication.Understanding


of AutoSys
// // //