About Smart Working
At Smart Working Solutions, we believe your job should not only look right on paper but also feel right every day. This isn’t just another remote opportunity - it’s about finding where you truly belong, no matter where you are. From day one, you’re welcomed into a genuine community that values your growth and well-being.

Our mission is simple: to break down geographic barriers and connect skilled professionals with outstanding global teams and products for full-time, long-term roles. We help you discover meaningful work with teams that invest in your success, where you’re empowered to grow personally and professionally.

Join one of the highest-rated workplaces on Glassdoor and experience what it means to thrive in a truly remote-first world.

About the Role
We are looking for a Senior DevOps Engineer with strong cloud infrastructure expertise (GCP / AWS / Azure) using Terraform and deep MongoDB Atlas ownership experience to design, operate, and scale a cloud-native infrastructure powering a large enterprise SaaS platform.

This is a high-ownership, architecture-level role, not just execution. You will be responsible for designing and running production systems end-to-end, with a particular focus on database infrastructure (MongoDB Atlas) and scalable cloud environments.
You will work in a fully remote, async-first environment, collaborating closely with engineering teams to ensure high availability, performance, and operational excellence across multiple environments.

Responsibilities

  • Design, implement, and operate cloud-native infrastructure across GCP, AWS, or Azure using Terraform.
  • Take full ownership of MongoDB Atlas in production, including:
  • Cluster architecture and scaling
  • Replication and high availability
  • Backup and disaster recovery strategies
  • Performance tuning and query optimisation
  • Security and access control
  • Architect and manage containerised and serverless workloads (e.g., Cloud Run, ECS, Kubernetes, or equivalents).
  • Design and operate event-driven systems (e.g., Pub/Sub, SQS/SNS, EventBridge, or equivalents).
  • Build and maintain CI/CD pipelines with a strong focus on automation, reliability, and scalability.
  • Develop reusable Infrastructure as Code (Terraform) modules and manage multi-environment setups.
  • Collaborate with engineering teams on system architecture, scalability, and performance optimisation.
  • Implement robust monitoring, alerting, and observability across distributed systems.
  • Lead incident response and root cause analysis, driving long-term improvements.
  • Own infrastructure decisions end-to-end, including architecture, cost optimisation, and performance.
  • Document systems, create runbooks, and establish best practices.
  • Mentor engineers and promote DevOps best practices across the organisation.

Requirements

  • 6+ years of DevOps / Infrastructure Engineering experience in production environments.
  • Strong hands-on experience with at least one major cloud provider: GCP, AWS, or Azure using Terraform.
  • Advanced experience with Terraform (modularisation, remote state, multi-environment setups).
  • Proven experience designing and operating scalable cloud infrastructure.
  • Mandatory: Deep MongoDB Atlas experience in production, including:
  • Cluster configuration and scaling
  • Replication and failover
  • Backup and recovery strategies
  • Performance tuning and indexing
  • Security and access management
  • Experience with containerised environments (Docker, Kubernetes, or equivalents).
  • Experience building and maintaining CI/CD pipelines.
  • Solid understanding of event-driven architectures.
  • Strong knowledge of monitoring, logging, and observability in distributed systems.
  • Ability to operate at an architect/owner level, not just execute tasks.
  • Strong communication skills and ability to work in a remote, async-first team.

Nice to Have

  • Experience working across multiple cloud providers.
  • Experience implementing GitOps practices.
  • Familiarity with advanced observability tools (Datadog, APM, tracing).
  • Experience supporting high-scale SaaS platforms.
  • Interest in platform engineering and developer experience.
At Smart Working, you’ll never be just another remote hire.

Be a Smart Worker - valued, empowered, and part of a culture that celebrates integrity, excellence, and ambition.