Finom, a European fintech, is hiring a Senior SRE Engineer to drive the design, implementation and evolution of a Kubernetes-based platform in a multi-cloud environment (primarily GCP, AWS).
Responsibilities:
- Design and operate Kubernetes ecosystem (GKE, multi-cluster) with focus on high availability and zero-downtime operations.
- Own and evolve PaaS strategy using GitOps (ArgoCD) and CI/CD (GitLab) to enable independent deployments.
- Define and implement observability across metrics, logs and tracing (Prometheus, VictoriaMetrics, OpenTelemetry).
- Automate infrastructure with Terraform and standardize resources as code.
- Establish and manage SLOs/SLAs, incident management and error budgets; participate in DR drills and design failover strategies.
- Apply AI-driven approaches to improve operations and automate bottleneck detection.
Requirements:
- Strong hands-on experience managing Kubernetes (GKE preferred) in high-load, multi-cluster production environments.
- Deep experience with GCP (AWS is a plus) and Terraform for large-scale infrastructure.
- Expertise in GitOps (ArgoCD), GitLab CI and Infrastructure-as-Code practices.
- Proven knowledge of observability stacks (Prometheus/Grafana), tracing and logging at scale.
- Ability to design highly available 24/7 systems with automated failover/rollback.
- English proficiency (B2+).
Nice-to-haves & ecosystem:
- Experience with Kafka (Confluent), RabbitMQ, Redis, PostgreSQL.
- Familiarity with Vault, PCI DSS/GDPR/ISO27001 compliance and AI tools for operations.
Conditions & benefits:
- Work in the EU with flexibility for remote or hybrid work across Europe.
- Full-time employment, stock options program, professional development support.
- Additional perks such as the "Work & Swim" program and supportive corporate culture.