GCP Data Engineer

Pacer Group
Austin, Texas Metropolitan Area

Job Title: GCP Data Engineer (Geospatial Senior Analyst)

Location: Austin, TX (Preferred) OR Remaining Client Locations / Remote

Work Arrangement: Hybrid (Austin, TX) or 100% Remote Eligible

Employment Type: Contract

Duration: 6 months (Based on July 1, 2026 – December 31, 2026 timeline)

Pay Range: $40.00/hr. to $42.00/hr. on W2 | $55.00/hr. to $56.00/hr. on C2C

Domain: Healthcare / Health Tech Analytics

Application Deadline: June 20, 2026


SKILLS REQUIRED:

Primary (Must-Have):

�6 to 10 years of data engineering experience with a heavy specialization in geospatial processing

�Advanced hands-on proficiency in GCP Dataflow, designing both streaming and batch pipelines for spatial and temporal datasets

�Deep expertise in GCP BigQuery, including data modeling, geospatial functions (GIS), and high-scale query optimization (partitioning/clustering)

�Proven knowledge of Apache Kafka architectures, including topic design, consumer group configurations, and real-time event streaming

�Strong Python 3 programming capabilities to build modular, testable, and reusable spatial automation scripts

�Practical experience using PySpark to clean, enrich, and transform large-scale distributed datasets


Secondary (Good to Have):

�Solid working knowledge of core geospatial mechanics: coordinate systems, spatial joins, and map projections

�Experience with cloud optimization, resource allocation monitoring, and cost-efficient storage architecture design

�Background building automated data quality gates, validation checks, and reproducible pipeline unit tests

�Clear technical communication skills with a track record of collaborating with cross-functional product owners and data consumers


POSITION OVERVIEW

We are seeking a Senior GCP Data Engineer / Geospatial Analyst with 6 to 10 years of specialized experience to design, build, and optimize high-throughput geospatial data pipelines. Operating in a flexible remote or hybrid capacity out of Austin, TX, this role focuses on supporting map-based analytics, geospatial positioning systems, and location-aware digital products. The ideal candidate will combine rigorous cloud data engineering practices (GCP, Kafka, PySpark) with advanced spatial analysis techniques to process complex real-time location events, sensor feeds, and temporal datasets into highly actionable business intelligence.


ROLES & RESPONSIBILITIES

�Design, deploy, and maintain robust geospatial data ingestion pipelines using GCP Dataflow to reliably process high-volume, real-time spatial records.

�Develop and optimize complex analytical queries in GCP BigQuery, converting raw geographic telemetry into highly curated tables, analytical models, and dashboards.

�Architect and manage Apache Kafka event streams to capture real-time location feeds, tracking systems, and sensor movements with minimal processing lag.

�Implement highly scalable PySpark data transformation matrices to clean, isolate anomalies from, and aggregate massive spatial datasets.

�Author enterprise-grade Python 3 scripts to automate complex data workflows, orchestrate cron-based data jobs, and deliver repeatable analytical insights to stakeholders.

�Tune and structure BigQuery data lakes by defining optimized schema clustering, geometric data indexing, and temporal partitioning strategies for multi-terabyte datasets.

�Support incident mitigation workflows, performing deep-dive root-cause investigations on pipeline performance drops or spatial data degradation, and engineering permanent programmatic fixes.


BENEFITS

Medical | Dental | Vision | 401(k)


EEOC Compliance

We are an equal opportunity employer, and all qualified applicants will receive consideration for employment.


DISCLAIMER

AI Usage Policy: Pacer Group uses AI to assist in screening applications. Final hiring decisions are made by human recruiters based on qualifications and experience.

// // //