This position focuses on designing and guiding the implementation of advanced containerized platforms tailored for compute-intensive workloads such as large-scale simulations, data processing, and machine learning. You will serve as a technical advisor, helping stakeholders adopt and optimize GPU-enabled orchestration environments.
The role spans the full solution lifecycle—from initial requirements gathering and system design to validation, deployment, and ongoing performance optimization. You will also collaborate with internal engineering and product teams to incorporate user feedback into future platform improvements.
Key Responsibilities:
Act as the primary technical advisor for clients implementing container orchestration platforms for high-performance and AI-driven workloads
Translate business and technical requirements into scalable architecture designs and deployment strategies
Design, deploy, and manage container clusters optimized for hardware acceleration and high-throughput processing
Configure and optimize GPU utilization techniques such as partitioning, sharing, and workload scheduling in multi-user environments
Develop automation tools and extensions (e.g., custom controllers or operators) to streamline infrastructure management
Establish secure, multi-tenant environments with appropriate access controls, policy enforcement, and resource isolation
Lead testing, validation, and benchmarking initiatives to ensure performance, scalability, and reliability
Define integration strategies across compute, storage, and networking layers, including distributed storage and advanced networking configurations
Implement monitoring and observability frameworks to track system performance, utilization, and health metrics
Support infrastructure automation and deployment pipelines using modern configuration and release management tools
Collaborate with cross-functional teams (engineering, data science, IT operations) to ensure successful system integration
Provide technical leadership during onboarding and deployment phases
Stay informed on emerging trends in hardware acceleration, networking, and orchestration technologies, and translate these into actionable recommendations
Represent the organization in technical discussions, workshops, and industry engagements
Basic Qualifications:
Bachelor’s degree or equivalent practical experience in a technical field
Strong experience with container orchestration platforms and cluster management in high-performance environments
Deep understanding of GPU-accelerated computing ecosystems and resource management
Familiarity with orchestration internals, including scheduling, access control, and extensibility mechanisms
Experience integrating distributed storage and high-speed networking into containerized systems
Proficiency in at least one programming language (e.g., Go or Python) for automation or platform extensions
Demonstrated ability to design scalable, secure, and resilient systems
Experience with performance tuning, benchmarking, and workload optimization
Strong communication skills with the ability to translate complex technical concepts into clear solutions
Preferred Qualifications:
Experience delivering end-to-end technical solutions from design through deployment and adoption
Exposure to containerized environments for scientific or compute-heavy workloads
Familiarity with infrastructure automation and declarative deployment methodologies
Contributions to open-source ecosystems or collaborative technical projects
Knowledge of emerging trends in compute acceleration, networking, and distributed systems
Advanced degree in a related technical discipline
Relevant certifications in container orchestration or cloud architecture
Additional Information:
Responsibilities may evolve based on organizational priorities and project needs
Candidates must be authorized to work in the applicable region without sponsorship requirements
Compensation & Benefits (Generalized):
Comprehensive health, dental, and vision coverage options
Retirement savings plan with employer contributions
Paid time off and company-recognized holidays
Parental leave and wellness support programs
Additional optional benefits and employee support resources