Gong is an AI-powered SaaS platform for conversation analysis and revenue management. Senior Site Reliability Engineer role based in Tel Aviv, Israel.
Responsibilities:
- Design, build, and maintain scalable, fault-tolerant systems.
- Define and enforce reliability processes, SLOs, SLIs, and SLAs.
- Lead complex incident responses, including on-call rotations and postmortems.
- Drive reliability initiatives, lead cross-team projects, and build automation, tooling, and self-service capabilities.
- Collaborate with engineering, product, and support teams and mentor engineers to promote operational excellence.
Requirements:
- 7+ years of experience in SRE, DevOps, or Production Engineering roles (SaaS environments preferred).
- Deep understanding of distributed systems, failure modes, resiliency patterns, observability, and operating large-scale production services on Kubernetes.
- Hands-on experience building and owning monitoring tools; experienced with CI/CD and infrastructure-as-code tools.
- Solid experience with cloud platforms (AWS preferred). Advantage: experience with Java. Familiarity with Python/Java.
Conditions: Flexible hybrid work model in Tel Aviv, Israel.