This job has been archived and is no longer active.

Monitoring Engineer (DevOps, SRE)

WorldwideRemote

We are forming a new Observability team responsible for the design, evolution, and operation of our company-wide monitoring, logging, and tracing platforms.

This role is a key part of our strategic shift from Datadog to a self-hosted observability stack fully compliant with local regulatory requirements.

Today, we operate multiple observability systems — our legacy stack based on Datadog (logs, metrics, APM, RUM, dashboards, and alerts), and our new Elastic Enterprise platform used for storing audit, access, application, and database logs required by regulators.

On top of Elastic, we've built SOC2 and SIEM integrations with external vendors. You'll be part of a small, highly technical team tasked with defining the future of observability at scale - designing, migrating, and maintaining the platforms that power visibility, compliance, and reliability across the organization.

Key Responsibilities

  • Operate and maintain Elastic Enterprise clusters (index lifecycle, scaling, retention, access control, backups)

  • Design and manage log pipelines (Fluentd / Fluent Bit / Logstash / Beats) for applications, databases, and infrastructure

  • Support and gradually decommission Datadog integrations (logs, metrics, APM, RUM, dashboards, monitors)

  • Collaborate with SRE, DevOps, and Security teams to ensure observability compliance and integration with SOC2 / SIEM

  • Define and maintain SLIs, SLOs, and error budgets, improving service reliability and visibility

  • Implement and optimize metrics and APM solutions (Prometheus, VictoriaMetrics, Mimir, etc.)

  • Automate observability infrastructure with Terraform, Helm, and GitOps tools (FluxCD or ArgoCD)

  • Document architecture, data flows, and best practices

Skills, Knowledge and Expertise

Must-Have

  • 4+ years in DevOps / SRE / Observability roles

  • Solid experience with Elastic Stack (Elasticsearch, Logstash, Kibana, Beats, log shippers)

  • Hands-on experience building or running APM / Metrics platforms (Prometheus, VictoriaMetrics, Mimir, etc.)

  • Exposure to security monitoring / SIEM integration

  • Strong experience in Grafana Labs tools like Loki, Tempo, etc


Nice to Have:

  • Familiarity with Kafka, ClickHouse, or Grafana Loki

  • Knowledge of Kubernetes and cloud platforms (GCP, OCI, or AWS)

Benefits

  • Design and build the next-generation observability platform from the ground up

  • Work on large-scale, multi-cloud environments with a strong focus on compliance, performance, and reliability

  • Collaborate with experienced SRE, Platform, and Security teams

  • Competitive compensation, hybrid work model, and strong career growth opportunities

What we offer:

  • Full-time B2B contract

  • Fully remote setup, work from anywhere in Europe

  • Up to 20% tax allowance

  • 22 paid leave days annually

  • Stock options (ESOP) in a fast-scaling, pre-IPO company

  • Flexi benefits you can use for wellness, travel, or learning

  • Work alongside a high-performing, international engineering team in a global fintech unicorn


Relocation support is available to our hubs in Armenia, Georgia, Serbia, and Spain, including flights, temporary accommodation, and legal setup.

Published on: 2/21/2026

Tabby

Tabbyverified company badge

Tabby is a UAE-based buy now, pay later method that enables customers to purchase products online or in store and split the payment over 4 monthly installments.

Website

See all 13 jobs at Tabby

Similar jobs