Overview

We are improving how we run and ship services in AWS and are hiring a Senior DevOps Engineer to advance our Terraform- and Kubernetes-based platform. You will enhance CI/CD, incident response, and observability while keeping security and reliability at the forefront—apply today.

Responsibilities

  • Plan and manage scalable cloud environments to support business operations
  • Automate deployment workflows and configuration management tasks
  • Observe system health and performance to ensure uptime and reliability
  • Investigate and remediate issues in distributed systems
  • Coordinate with development teams to improve and streamline CI/CD pipelines
  • Optimize infrastructure as code practices for efficiency and consistency
  • Maintain security standards and protocols across all environments
  • Provide operational support for database, messaging, and storage platforms
  • Roll out and manage observability solutions for logging, monitoring, and alerting
  • Respond to incidents during on-call rotations as required

Requirements

  • Minimum of 3 years of professional experience in DevOps or a related engineering discipline
  • Advanced capability with Amazon Web Services to administer cloud infrastructure
  • Hands-on skill in Bash scripting for automation and system operations
  • Demonstrated experience delivering and maintaining CI/CD pipelines
  • Solid background in Kubernetes for orchestration and cluster management
  • Proven ability in observability and troubleshooting distributed systems
  • Hands-on experience with Terraform to implement infrastructure as code
  • English proficiency (written and spoken) at B2+ level or higher

Nice to have

  • Experience with AWS Aurora for relational database management
  • Familiarity with AWS Lambda for serverless application development
  • Understanding of Amazon API Gateway for API management
  • Experience with Amazon CloudFront for content delivery
  • Skills in Amazon CloudWatch for monitoring and logging
  • Hands-on work with Amazon Elastic Kubernetes Service (EKS)
  • Familiarity with Amazon Managed Grafana and Amazon Managed Service for Prometheus for observability
  • Experience with Amazon OpenSearch for search and analytics
  • Knowledge of Amazon RDS and Amazon S3 for database and storage management
  • Experience using Argo CD for GitOps workflows
  • Understanding of Azure DevOps for CI/CD and project coordination
  • Skills in Fluentbit and OpenTelemetry for log and trace collection
  • Experience with PowerShell and Python for scripting and automation

[GTS] Benefits (generic, except India)

  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn