Gong is an AI-powered SaaS platform for conversation analysis and revenue management. Senior Site Reliability Engineer role based in Tel Aviv, Israel.

Responsibilities:

  • Design, build, and maintain scalable, fault-tolerant systems.
  • Define and enforce reliability processes, SLOs, SLIs, and SLAs.
  • Lead complex incident responses, including on-call rotations and postmortems.
  • Drive reliability initiatives, lead cross-team projects, and build automation, tooling, and self-service capabilities.
  • Collaborate with engineering, product, and support teams and mentor engineers to promote operational excellence.

Requirements:

  • 7+ years of experience in SRE, DevOps, or Production Engineering roles (SaaS environments preferred).
  • Deep understanding of distributed systems, failure modes, resiliency patterns, observability, and operating large-scale production services on Kubernetes.
  • Hands-on experience building and owning monitoring tools; experienced with CI/CD and infrastructure-as-code tools.
  • Solid experience with cloud platforms (AWS preferred). Advantage: experience with Java. Familiarity with Python/Java.

Conditions: Flexible hybrid work model in Tel Aviv, Israel.