Lead Java Developer

Software Technology Inc.

Chicago, IL

Role: Lead Engineer – Java

Location: Chicago, IL

Contract

Mandatory Skills – Java, Spark, Data and Cloud (AWS / Azure / GCP) GCP Preferred

Job Description

8–12 years of experience in production-grade software engineering and data engineering, with a strong foundation in Java-based application development.
Demonstrated progression from hands-on Java development roles into data engineering and platform-level responsibilities.
Extensive experience designing, building, and operating Spark-based batch data processing systems using Java in cloud or distributed environments.
Proven experience working on shared data platforms that support multiple downstream analytics use cases, reporting systems, and business functions.
Strong exposure to enterprise data processing workloads, including large-scale structured and semi-structured data handling with performance and reliability considerations.

Key Expertise

1. Technical Skills

Deep hands-on experience with Java as the primary programming language, including building scalable and maintainable applications for data processing and backend systems.
Strong working knowledge of Apache Spark using the Java API, with the ability to design and implement robust batch processing pipelines.
Experience working with cloud-based data platforms (GCP preferred), including services such as BigQuery and Cloud Storage, or equivalent services in other cloud environments.
Strong understanding of data storage formats and access patterns, including Parquet, Avro, and JSON, with a focus on optimizing data layout for analytical workloads.
Experience implementing CI/CD practices for data engineering solutions, including source control strategies, automated deployments, and environment promotion across development, testing, and production.
Solid understanding of data security fundamentals, including secure data access patterns, credential management, and compliance-aware data handling.

2. Architecture & Design

Ownership of solution and platform-level architecture for batch data processing systems built on Java and Spark.
Strong foundation in data modeling principles, including normalization, denormalization, and analytics-oriented schema design based on consumption patterns.
Proven experience designing and enforcing layered data architectures, including clear separation of raw, processed, and curated data layers.
Ability to define and document architecture standards, design guidelines, and reusable frameworks for ingestion, transformation, and consumption layers.
Experience reviewing technical designs across teams to ensure alignment with scalability, performance, and maintainability requirements.
Strong understanding of integration patterns across upstream source systems and downstream consumers such as BI tools and reporting platforms.

3. Big Data & Analytics

Deep understanding of OLTP and OLAP concepts, and the implications of analytical workloads on storage layout, compute sizing, and query performance.
Proven experience designing and optimizing ETL / ELT frameworks capable of handling large volumes of structured and semi-structured data with predictable performance and reliability.
Strong expertise in Spark performance tuning techniques, including partitioning strategies, join optimizations, caching decisions, and query execution analysis.
Experience supporting enterprise analytics use cases by delivering high-quality, well-modeled datasets suitable for consumption by BI and reporting tools.
Ability to diagnose and resolve complex data issues related to:
Latency
Data correctness
Schema drift
Pipeline failures in production environments

4. GenAI Adoption & Automation

Practical experience evaluating and adopting AI-assisted development tools to improve developer productivity, code quality, and delivery velocity within data engineering teams.
Understanding of how AI-driven techniques can be applied to data engineering use cases, such as anomaly detection, data quality monitoring, and operational insights.
Ability to assess emerging GenAI capabilities pragmatically and integrate them into the platform in a controlled, value-driven manner without compromising stability or governance.

5. Observability & Performance Optimization (Good to Have)

Experience defining observability practices for data platforms, including monitoring of pipeline health, job execution metrics, and operational alerts.
Strong hands-on ability to troubleshoot distributed Spark workloads, identify performance bottlenecks, and drive corrective optimizations.
Exposure to data lineage, metadata management, or operational dashboards to improve platform transparency and operational maturity.

Responsibilities

Own and evolve the solution architecture for Java and Spark-based batch data platforms supporting multiple enterprise use cases.
Act as a technical authority for data engineering design decisions, ensuring consistency, scalability, and long-term maintainability of the platform.
Guide Technical Leads and Senior Engineers on architecture, design patterns, and implementation best practices through design reviews and hands-on collaboration.
Ensure platform implementations meet defined non-functional requirements, including performance, reliability, security, and cost efficiency.
Collaborate closely with enterprise architecture, cloud, and security teams to align platform design with organizational standards and constraints.
Support delivery planning, technical estimation, and risk assessment for complex data engineering initiatives.
Continuously assess platform gaps and drive improvements in architecture, tooling, and engineering practices.

Skills & Competencies

Strong architectural judgment with the ability to balance immediate delivery needs against long-term platform sustainability.
Excellent communication skills to articulate complex technical concepts to both engineering teams and senior stakeholders.
Ability to operate effectively in ambiguous environments and make well-reasoned technical decisions.
Proven capability to mentor and elevate the technical maturity of data engineering teams.

Related jobs