Overview
We are improving how we run and ship services in AWS and are hiring a Senior DevOps Engineer to advance our Terraform- and Kubernetes-based platform. You will enhance CI/CD, incident response, and observability while keeping security and reliability at the forefront—apply today.
Responsibilities
- Plan and manage scalable cloud environments to support business operations
- Automate deployment workflows and configuration management tasks
- Observe system health and performance to ensure uptime and reliability
- Investigate and remediate issues in distributed systems
- Coordinate with development teams to improve and streamline CI/CD pipelines
- Optimize infrastructure as code practices for efficiency and consistency
- Maintain security standards and protocols across all environments
- Provide operational support for database, messaging, and storage platforms
- Roll out and manage observability solutions for logging, monitoring, and alerting
- Respond to incidents during on-call rotations as required
Requirements
- Minimum of 3 years of professional experience in DevOps or a related engineering discipline
- Advanced capability with Amazon Web Services to administer cloud infrastructure
- Hands-on skill in Bash scripting for automation and system operations
- Demonstrated experience delivering and maintaining CI/CD pipelines
- Solid background in Kubernetes for orchestration and cluster management
- Proven ability in observability and troubleshooting distributed systems
- Hands-on experience with Terraform to implement infrastructure as code
- English proficiency (written and spoken) at B2+ level or higher
Nice to have
- Experience with AWS Aurora for relational database management
- Familiarity with AWS Lambda for serverless application development
- Understanding of Amazon API Gateway for API management
- Experience with Amazon CloudFront for content delivery
- Skills in Amazon CloudWatch for monitoring and logging
- Hands-on work with Amazon Elastic Kubernetes Service (EKS)
- Familiarity with Amazon Managed Grafana and Amazon Managed Service for Prometheus for observability
- Experience with Amazon OpenSearch for search and analytics
- Knowledge of Amazon RDS and Amazon S3 for database and storage management
- Experience using Argo CD for GitOps workflows
- Understanding of Azure DevOps for CI/CD and project coordination
- Skills in Fluentbit and OpenTelemetry for log and trace collection
- Experience with PowerShell and Python for scripting and automation
[GTS] Benefits (generic, except India)
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn