Why Lytx:
Site Reliability Engineering team is responsible for the availability, reliability, observability and resilience of Infrastructure and related automation of the entire fleet of servers on-prem and the expanding cloud posture of the organization. This team’s responsibilities are very critical to the continuity of business of the organization. If you love crafting new solutions and building a scalable cloud and on-prem infrastructure, then this role may be an excellent match for you!
You’ll get to:
Build tools and frameworks to monitor systems and ensure highest level of uptime on production environments.
Mentor the SRE team on best practices. Develop culture of innovation.
Take lead in enhancing our 24/7 on call and incident management process. Build and maintain Run-books. Contribute to design and documentation of the cloud services and SOPs.
Influence service design by working closely with Architects, DBAs, Developers, DevOps, Data engineers to bake reliability, scalability and cost optimizations early in the development process.
Lead blameless post-mortems. Take ownership of publishing RCA documents for internal and external consumption.
Lead initiatives with Service Owners to define the SLOs and build SLIs to ensure systems are meeting the SLAs.
Research and evaluate new cloud technologies and vendor offerings to enhance product stability and manageability.
Reduce Operational Toil and maintain high degree of automation by adapting IaC first and Gitops principals.
Acquire and maintain significant understanding of Lytx production services to ensure timely resolution of production incidents.
You’ll Need:
8+ years of experience as a SRE in an AWS environment at medium to large scale organization.
6+ years of hands-on experience implementing and managing Observability tools (Prometheus, New Relic, Grafana, etc.)
High degree of proficiency in programing, preferably using Python, groovy and bash.
Hands-on experience managing database technologies (SQL and NoSQL).
5+ years of experience building Infrastructure deployment pipelines using git, Terraform, Helm, Jenkins/JenkinX/ArgoCD etc.
Proficient in designing production environments in AWS cloud using various AWS services (VPCs, EKS, IAM, AMI, EC2, CloudWatch, CloudTrail’s, Control Tower, Guard duty, MSK, S3, Glacier, Gateways, Direct Connects, Route53, RDS, ALBs, Autoscaling etc)
Extensive with Linux systems and various protocols and technologies (HTTP, REST, TCP/IP, SSL, DNS, SMTP, SSH, NTP, Load Balancing, SQL/NoSQL, Message Brokers, Nginx, Vault , ELK etc)
Hands-on experience with Kubernetes and various container and cloud native technologies.
Significant experience in participating, implementing, and managing 24-7 on call rotation for SRE team, creating run books, building support procedures and proactively monitor systems across geographical locations
Ability to work well under pressure within a technically challenging environment.
Preferred Experience:
Hands-on experience managing sophisticated networks in AWS cloud (Direct Connects, Transit gateways, VPNs, BGP, Firewalls, CDNs)
Hands-on experience managing Cloud Databases (AWS RDS, Mongo, Elastic Search, Snowflake)
Certifications: Multiple AWS Certificates, Kubernetes, Linux, Programming, CI/CD.
Benefits:
- Medical, dental and vision insurance
- Health Savings Account
- Flexible Spending Accounts
- Telehealth
- 401(k) and 401(k) match
- Life and AD&D insurance
- Short-Term and Long-Term Disability
- FTO or PTO
- Employee Well-Being program
- 11 paid holidays plus 1 inclusive holiday per year
- Volunteer Time Off
- Employee Referral program
- Education Reimbursement Program
- Employee Recognition and Appreciation program
- Additional perk and voluntary benefit programs
Salary is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. This position is also eligible for an incentive compensation plan. The expected hiring salary for this position is:
$183,500.00 - $232,500.00You’re driven to succeed and so are we. At Lytx, our mission is to protect a world in motion, and we do it by building technology and partnerships that help keep people safe on the road. The way we work is guided by our shared values: Deliver for the customer, Responsibility in every outcome, Innovate with purpose, Velocity with excellence, and Elevate each other.
If you’re looking for meaningful work, a team that challenges and supports you, and the chance to grow your career while making a real impact, we’d love to meet you.
Together, we’re helping make roadways safer and saving lives!
Lytx, Inc. is proud to be an equal opportunity employer. We’re committed to building a diverse and inclusive workforce and do not discriminate based on race, color, religion, sex, sexual orientation, gender identity or expression, gender, genetic information, uniformed service, national origin, age, veteran status, disability, pregnancy, or any other status protected by federal or state law. We are committed to providing reasonable accommodation for candidates with disabilities who need assistance during the hiring process. To request a reasonable accommodation, please email TA@lytx.com. Lytx conducts background checks on applicants who receive a conditional offer of employment in accordance with applicable local, state, federal and regional laws. Qualified applicants with arrest or conviction records will be considered. Background check results may potentially result in the withdrawal of a conditional offer of employment and will be made in accordance with all applicable local, state, federal and regional laws.