Title: Lead Cloud Engineer Location: 100% remote Duration: 12+ months contract Interview: MS Team Job Description:
We are seeking a Lead Cloud Engineer, AI Enablement to join our AI Platform
Acceleration team. The AI Platform Acceleration team will enable AI democratization through
apps & services. You will collaborate with cross-functional teams of data scientists, research
scientists, software engineers, and product leads to understand business requirements,
identify opportunities for AI integration, and ensure our platforms enable development of
scalable and robust AI systems. This team will also engage with third party vendors to
enable speed, scale, and efficiency.
This role is heavily focused on platform operations. General areas of focus are deployment &
infrastructure, ci/cd pipelines, observability, SLAs, incident management, user onboarding
and other related operational components.
RESPONSIBILITIES
- Manage and enhance deployment of infrastructure using Terraform
- Manage and enhance ci/cd pipelines using GitHub Actions
- Interact daily with Azure and Azure services
- Troubleshoot issues related to API deployments in AKS
- Ensure accurate observability of solutions deployed in AKS using Datadog or other
- related observability tools
- Define and implement SLAs and incident management procedures
- Independently define, prioritize and execute project tasks and plans to deliver cloud-
- related infrastructure and solutions
- Document work and solutions appropriately
- Engage with other technology and infrastructure teams as necessary to complete
- tasks
- Participate routinely in team on-call rotation during business hours
QUALIFICATIONS, SKILLS, AND EXPERIENCE
- 4-year degree in a technology related discipline or equivalent work experience
- 5+ years of experience with public cloud technologies (Azure or Google Cloud Platform preferred)
- including demonstrated networking and security focus
- "Google Cloud Platform Associate Cloud Engineer and/or "Microsoft Azure Administrator Associate
certifications preferred
- 5+ years of experience with container technologies (Docker, Kubernetes, Helm)
- 5+ years of experience with cloud automation tools (Terraform)
- 5+ years of experience with SDLC and working with Agile development teams
- Experience with AI-related concepts a plus (RAG, fine-tuning, agent framework,
- LLMs, etc.)
- Ability to manage small to medium size IT-related projects, solving related problems
- and working to tight deadlines while under pressure
- Strong interpersonal and communication skills with demonstrated experience
- leveraging these skills with technical teams and non-technical business units
- Desire to learn new technology and grow across different areas of technology
- Demonstrated ability to prioritize own workload with multiple responsibilities and
- adaptability to changes in those priorities
Title: Lead Cloud Engineer Location: 100% remote Duration: 12+ months contract Interview: MS Team Job Description:
We are seeking a Lead Cloud Engineer, AI Enablement to join our AI Platform
Acceleration team. The AI Platform Acceleration team will enable AI democratization through
apps & services. You will collaborate with cross-functional teams of data scientists, research
scientists, software engineers, and product leads to understand business requirements,
identify opportunities for AI integration, and ensure our platforms enable development of
scalable and robust AI systems. This team will also engage with third party vendors to
enable speed, scale, and efficiency.
This role is heavily focused on platform operations. General areas of focus are deployment &
infrastructure, ci/cd pipelines, observability, SLAs, incident management, user onboarding
and other related operational components.
RESPONSIBILITIES
- Manage and enhance deployment of infrastructure using Terraform
- Manage and enhance ci/cd pipelines using GitHub Actions
- Interact daily with Azure and Azure services
- Troubleshoot issues related to API deployments in AKS
- Ensure accurate observability of solutions deployed in AKS using Datadog or other
- related observability tools
- Define and implement SLAs and incident management procedures
- Independently define, prioritize and execute project tasks and plans to deliver cloud-
- related infrastructure and solutions
- Document work and solutions appropriately
- Engage with other technology and infrastructure teams as necessary to complete
- tasks
- Participate routinely in team on-call rotation during business hours
QUALIFICATIONS, SKILLS, AND EXPERIENCE
- 4-year degree in a technology related discipline or equivalent work experience
- 5+ years of experience with public cloud technologies (Azure or Google Cloud Platform preferred)
- including demonstrated networking and security focus
- "Google Cloud Platform Associate Cloud Engineer and/or "Microsoft Azure Administrator Associate
certifications preferred
- 5+ years of experience with container technologies (Docker, Kubernetes, Helm)
- 5+ years of experience with cloud automation tools (Terraform)
- 5+ years of experience with SDLC and working with Agile development teams
- Experience with AI-related concepts a plus (RAG, fine-tuning, agent framework,
- LLMs, etc.)
- Ability to manage small to medium size IT-related projects, solving related problems
- and working to tight deadlines while under pressure
- Strong interpersonal and communication skills with demonstrated experience
- leveraging these skills with technical teams and non-technical business units
- Desire to learn new technology and grow across different areas of technology
- Demonstrated ability to prioritize own workload with multiple responsibilities and
- adaptability to changes in those priorities