Overview
We are seeking a Senior Cloud Engineer (Azure) to provide technical leadership, hands-on implementation and ongoing day-to-day support across Datacenter operations and vendor management. This role ensures effective development, maintenance, support and optimization of key functional areas, including Windows server operations, monitoring, virtualization and cloud-based technologies.
Responsibilities
- Provide support for the global data center environment, including Windows server technologies, VMware, VEEAM and general server/datacenter maintenance
- Deliver immediate 3rd level server support for problems escalated by the Datacenter Operations Team, Service Desk, App/Dev or business users
- Offer general server support and maintenance, implement patching and proactive maintenance plans and develop best practice maintenance plans
- Proactively review monitoring systems, act on alerts and revise thresholds to ensure infrastructure stability
- Perform thorough root cause analysis, troubleshooting techniques and procedures as defined by hardware/software vendors and personal experience
- Maintain, create and update associated build and standard operating procedure documentation, planning and implementing necessary updates for support relevance
- Identify, document, publish and uphold systems policies, standards, procedures, checklists, agreements, diagrams and inventory
- Plan and lead projects and tasks necessary to assess, optimize and maintain enterprise and client systems and infrastructure
- Ensure all support requests, projects and other tasks are reviewed, prioritized and completed in a timely and proficient manner
- Work evenings and weekends as required to support maintenance and project activities
- Develop, test and validate Azure Resource Manager templates/JSON files/BICEP scripts as part of IAC deployments, maintaining source control for change management
- Diagnose, troubleshoot and resolve IAC deployment errors within Azure
Requirements
- 4+ years of equivalent work experience with infrastructure support and operational excellence
- Background in VMware, Windows Server and SAN (NetApp, HP Nimble)
- Proficiency in Microsoft Operating Systems Server 2008-2019, PowerShell and Exchange 2010-2019
- Expertise in Office 365, Active Directory and VEEAM
- Knowledge of SCOM 2012-2019, Networking (LAN and WAN) and DNS/DHCP
- Familiarity with HP Server and Blade class machines
- Insurance/Reinsurance industry experience with an understanding of the terminology, business functions and processes
- Skills in PowerShell scripting to create automation tasks that increase team reliability and efficiency
- Capability to lead infrastructure projects as a technical resource, including oversight of resource planning and detailed execution plans for project and operational work
- Competency in AWS CLI and Gcloud command shells with the ability to create and run scripts
- Hands-on experience managing virtual instances in cloud environments (Azure, AWS, GCP) and configuring Azure resources such as Azure Virtual WAN, Application Gateway, Route Table and Azure Policy
- Flexibility to use PowerShell and C# to create runbooks
- English proficiency at B2 level or higher
Nice to have
- Hands-on experience with VMware vSphere 6.x to act as 3rd tier support for escalated issues; VCP-DCV 6.0/6.7 or later preferred
- Familiarity with backup tools including Veeam Backup and Replication, Cloud Endure and Cloudranger
[GTS] Benefits (generic, except India)
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn