Overview

We're looking for a Senior DevOps Engineer skilled in AI and Copilot to join our team in London, United Kingdom in a hybrid working mode (3 days onsite).

In this role, you will build and maintain a cloud-based index platform, manage secure and cost-effective AWS environments, and drive observability and resilience through chaos and disaster recovery testing. You will implement FinOps strategies to optimize cloud spending and leverage your expertise in CI/CD practices to design and optimize a best-in-class AWS microservices architecture. This is a hands-on role requiring deep experience in DevOps tools and practices, including GitLab, Terraform, AWS, Kubernetes/EKS and Ansible.

Responsibilities

  • Build and maintain secure, resilient and cost-optimized AWS cloud infrastructure using DevOps best practices
  • Design and manage CI/CD pipelines for Java and Python microservices architectures
  • Implement infrastructure as code using Terraform with modular and reusable patterns
  • Manage Kubernetes (EKS) clusters at scale for workload orchestration and container lifecycle
  • Integrate security controls and continuous compliance into the DevOps lifecycle
  • Drive observability using tools like Datadog to monitor infrastructure and application performance
  • Implement chaos engineering and disaster recovery strategies to ensure resilience and business continuity
  • Optimize cloud cost management in line with FinOps principles

Requirements

  • Strong hands-on experience with AWS services including EC2, S3, RDS, Lambda, VPC, IAM, CloudWatch and EKS
  • Expertise in infrastructure as code using Terraform
  • Proven experience with CI/CD tools such as GitLab CI or Jenkins for microservices architectures
  • Strong knowledge of Kubernetes (EKS) and containerization with Docker
  • Solid understanding of cloud security practices and compliance frameworks
  • Experience with observability platforms such as Datadog or Prometheus
  • Familiarity with cloud financial management and FinOps principles
  • Track record of designing and executing disaster recovery strategies

Nice to have

  • Experience with AI-assisted tools such as GitHub Copilot, GitLab AI or Terraform AI
  • Knowledge of scripting languages (Python, Bash or similar) for automation
  • Familiarity with Ansible for configuration management
  • Exposure to emerging DevOps trends and cloud-native technologies

UK

  • EPAM Employee Stock Purchase Plan (ESPP)
  • Protection benefits including life assurance, income protection and critical illness cover
  • Private medical insurance and dental care
  • Employee Assistance Program
  • Competitive group pension plan
  • Cyclescheme, Techscheme and season ticket loans
  • Various perks such as free Wednesday lunch in-office, on-site massages and regular social events
  • Learning and development opportunities including in-house training and coaching, professional certifications, over 22,000 courses on LinkedIn Learning Solutions and much more
  • If otherwise eligible, participation in the discretionary annual bonus program
  • If otherwise eligible and hired into a qualifying level, participation in the discretionary Long-Term Incentive (LTI) Program
  • *All benefits and perks are subject to certain eligibility requirements