Senior Data Engineer

Persistent Systems
Irving, TX

About Persistent

We are a trusted Digital Engineering and Enterprise Modernization partner, combining deep technical expertise and industry experience to help our clients anticipate what’s next. Our offerings and proven solutions create unique competitive advantage for our clients by giving them the power to see beyond and rise above.

We are experiencing tremendous growth, with $566 million in revenue in FY21, representing 12.9% year-over-year growth. Along with that growth, we onboarded over 3,000 new employees in the past year, bringing our total employee count to over 15,000 people located in 18 countries across the globe.

At Persistent, our values are more than a list of ideals to improve our corporate image. We’re dedicated to building an inclusive culture that reflects what’s important to our employees and is based on what they value. As a result, 95% of our employees approve of the CEO and 83% recommend working at Persistent to a friend.



About Position: Experienced Senior Data Engineer (12+ Years) to support large scale data platform modernization initiatives within a regulated banking environment.

The role focuses on designing and building enterprise-grade in-house frameworks, supporting high-volume batch and CDC-based incremental processing using Cloudera platform, and enabling ongoing Google Cloud Platform (GCP) modernization efforts


About Position

Role: Senior Data Engineer

Location: Irving, TX / Wilmington, DE (Onsite)

Hire Type : FTE/CTH


What You'll

Apache Spark (PySpark and/or Scala) in large-scale production environments

Cloudera Hadoop ecosystem (HDFS, Hive, YARN, Spark on Cloudera)

Strong SQL expertise with complex transformations, performance tuning, and reconciliation logic

Enterprise RDBMS experience with Oracle and MS SQL Server

Batch ingestion, incremental ingestion, and CDC processing patterns

CDC concepts and tooling (tool-agnostic: Golden Gate, Debezium, or equivalent)

Data merge, deduplication, watermarking, checkpointing, and SCD handling

Google Cloud Platform services including Dataproc , Composer and Dataplex

Hybrid on prem to cloud data architecture and migration patterns

Metadata-driven framework development and data quality validation techniques

CI/CD pipeline implementation using enterprise tooling (GitHub Actions, Jenkins, DevOps)

Git-based development workflows, code reviews, and automated testing practices

Experience using Copilot or similar AI-assisted development tools safely and effectively in enterprise environments

Logging, monitoring, alerting, and operational readiness practices

Secure coding, access control, and compliance-aware development

Documentation of design artifacts, runbooks, and operational procedures


Expertise You'll :

Apache Spark (PySpark and/or Scala) in large-scale production environments

Cloudera Hadoop ecosystem (HDFS, Hive, YARN, Spark on Cloudera)

Strong SQL expertise with complex transformations, performance tuning, and reconciliation logic

Enterprise RDBMS experience with Oracle and MS SQL Server

Batch ingestion, incremental ingestion, and CDC processing patterns

CDC concepts and tooling (tool-agnostic: GoldenGate, Debezium, or equivalent)

Data merge, deduplication, watermarking, checkpointing, and SCD handling

Google Cloud Platform services including Dataproc , Composer and Dataplex

Hybrid on prem to cloud data architecture and migration patterns

Metadata-driven framework development and data quality validation techniques

CI/CD pipeline implementation using enterprise tooling (GitHub Actions, Jenkins, DevOps)

Git-based development workflows, code reviews, and automated testing practices

Experience using Copilot or similar AI-assisted development tools safely and effectively in enterprise environments

Logging, monitoring, alerting, and operational readiness practices

Secure coding, access control, and compliance-aware development

Documentation of design artifacts, runbooks, and operational procedures


Benefits:

Competitive salary and benefits package Culture focused on talent development with quarterly promotion cycles and company-sponsored higher education and certifications

Opportunity to work with cutting-edge technologies

Employee engagement initiatives such as project parties, flexible work hours, and ‘Long Service awards Annual health check-ups as well as insurance

Group term life insurance Personal accident insurance

Mediclaim hospitalization insurance for self, spouse, two children, and parents


Why Persistent is an employer of choice

Technology Innovation: culture of innovation using cutting-edge technology to bring value to clients.

Growth and Career Progression: learning opportunities for growth, including quarterly promotion

cycles.

One Persistent Culture: global outlook with diversity and inclusion at it core.

Mental and Physical Wellness: employee health and mindfulness programs

// // //