Finom, a European fintech, is hiring a Senior SRE Engineer to drive the design, implementation and evolution of a Kubernetes-based platform in a multi-cloud environment (primarily GCP, AWS).

Responsibilities:

  • Design and operate Kubernetes ecosystem (GKE, multi-cluster) with focus on high availability and zero-downtime operations.
  • Own and evolve PaaS strategy using GitOps (ArgoCD) and CI/CD (GitLab) to enable independent deployments.
  • Define and implement observability across metrics, logs and tracing (Prometheus, VictoriaMetrics, OpenTelemetry).
  • Automate infrastructure with Terraform and standardize resources as code.
  • Establish and manage SLOs/SLAs, incident management and error budgets; participate in DR drills and design failover strategies.
  • Apply AI-driven approaches to improve operations and automate bottleneck detection.

Requirements:

  • Strong hands-on experience managing Kubernetes (GKE preferred) in high-load, multi-cluster production environments.
  • Deep experience with GCP (AWS is a plus) and Terraform for large-scale infrastructure.
  • Expertise in GitOps (ArgoCD), GitLab CI and Infrastructure-as-Code practices.
  • Proven knowledge of observability stacks (Prometheus/Grafana), tracing and logging at scale.
  • Ability to design highly available 24/7 systems with automated failover/rollback.
  • English proficiency (B2+).

Nice-to-haves & ecosystem:

  • Experience with Kafka (Confluent), RabbitMQ, Redis, PostgreSQL.
  • Familiarity with Vault, PCI DSS/GDPR/ISO27001 compliance and AI tools for operations.

Conditions & benefits:

  • Work in the EU with flexibility for remote or hybrid work across Europe.
  • Full-time employment, stock options program, professional development support.
  • Additional perks such as the "Work & Swim" program and supportive corporate culture.