Cloud Architect with expertise in Generative AI

GyanSys Inc.
Santa Clara, CA

We are seeking a highly skilled Cloud Architect with expertise in Generative AI, Copilot Studio, and multi‑cloud platforms spanning Azure (including Azure AI Foundry), AWS, and Google Cloud.


Core Responsibilities:


  • Architect end‑to‑end Generative AI solutions, including model serving (vLLM, TGI), API integration, and user interaction layers.
  • Design and implement RAG architecture using vector stores, embeddings, hybrid search, and re‑ranking to embed enterprise knowledge into LLMs.
  • Create agentic systems, enabling multi‑agent collaboration for complex, stateful workflows and reasoning‑driven automation.
  • Develop and govern Copilots in Copilot Studio, including connectors, actions, plugins, DLP rules, environment strategy, and integration with Microsoft 365 and enterprise systems.
  • Leverage Azure AI Foundry (prompt flow, evaluators, safety, model orchestration) to operationalize LLM applications at scale.
  • Evaluate and optimize AI system performance, balancing quality, latency, throughput, cost efficiency, and safety compliance.
  • Implement Responsible AI, security, and HITL (HumanintheLoop) controls, ensuring compliance in regulated environments.‑in‑the‑Loop) controls, ensuring compliance in regulated environments.
  • Produce clear, maintainable documentation for architecture, patterns, and operational processes.


Required Qualifications:

  • 10+ years of experience in cloud architecture or enterprise software engineering.
  • 3+ years of hands‑on experience designing or delivering Generative AI or LLM applications.
  • Proven experience with Azure AI Foundry, Azure OpenAI, and Copilot Studio (actions, connectors, governance, M365 integration).
  • Experience deploying AI solutions on AWS (Bedrock, SageMaker) and/or GCP (Vertex AI).
  • Hands‑on experience with RAG, vector databases (Azure AI Search, Pinecone, OpenSearch, Vertex Matching Engine), embeddings, and hybrid search.
  • Deep understanding of cloud security (IAM/RBAC, Key Vault/KMS, VPC/PrivateLink, token safety).
  • Experience with Kubernetes (AKS/EKS/GKE), containerization, API frameworks (FastAPI, Node.js, .NET), Python, TypeScript, or C#/.NET.
  • Working knowledge of transformer architectures and model adaptation techniques (fine‑tuning, LoRA, prompt engineering).
  • Familiarity with AI Ops / MLOps tools such as Prompt Flow, MLflow, SageMaker Pipelines, or Vertex Pipelines.
  • Bachelor’s/ Masters in Computer Science, Engineering, Information Systems, Data Science, or related field (required).



About GyanSys


GyanSys is a leading global system integrator company supporting enterprise customers worldwide. We specialize in solutions implementations, managed services, and data analytics spanning SAP, Salesforce, Microsoft, and other prime enterprise platforms. Using a mature blended delivery model with over 3,000 consultants, we support over 350 enterprise customers across the Americas, Europe, and APAC.

// // //