Platform Engineer

CyprusRelocationHybridSenior

We are looking for a Platform Engineer to fully own and operate our Kafka platform on on-premise Kubernetes. You will be responsible for the end-to-end lifecycle of our clusters - designing, scaling, and troubleshooting - while also enabling internal teams to use Kafka effectively. Alongside Kafka, you will support our broader ecosystem of Kafka Connect, Flink, and Temporal.io.

Alongside Kafka, you will support our broader ecosystem of Kafka Connect, Flink, and Temporal.io.

The role is based in our Limassol office, Cyprus. In case of relocation, we offer full relocation support for you and your family to make your move smooth and worry-free.

What you'll actually do

  • Design and deploy new Kafka clusters based on business, SLA, durability, and availability requirements.

  • Operate and maintain high-throughput Kafka and Kafka Connect clusters on on-prem Kubernetes.

  • Maintain adjacent streaming platform components such as Flink and support their reliability and operability.

  • Plan and execute safe upgrades, broker replacements, storage migrations, and cluster rebalancing activities.

  • Manage cross-cluster replication and disaster recovery patterns using MirrorMaker 2.

  • Improve monitoring, alerting, and operational visibility using Grafana, Alertmanager, and PagerDuty.

  • Investigate production issues such as under-replicated partitions, disk pressure, producer latency spikes, consumer lag, partition skew, and client connectivity failures.

  • Work directly with internal users to troubleshoot producer, consumer, schema, retention, and topic configuration issues.

  • Help design resilient multi-cluster and multi-data-center setups to reduce the impact of infrastructure failures.

  • Support the internal Temporal.io platform provided to engineering teams and help keep it reliable, scalable, and easy to consume.

  • Automate routine operations and reduce manual toil using Python and infrastructure tooling.

  • Maintain infrastructure definitions with Terraform and manage delivery workflows through GitLab CI/CD and ArgoCD using GitOps practices.

  • Contribute to platform security, access control, and operational standards across the streaming stack.

Who we’re looking for

  • Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related technical field.

  • 3+ years of professional experience in DevOps, Platform Engineering, or Site Reliability Engineering.

  • Strong hands-on production experience operating Kafka clusters.

  • Good understanding of partitions, replication factor, ISR, leader elections, retention, compaction, topic configuration, and broker-level behavior.

  • Experience with Kafka capacity planning, scaling, balancing, upgrades, and migrations.

  • Proven troubleshooting experience with distributed systems in production.

  • Strong Kubernetes troubleshooting skills, ideally in on-prem environments.

  • Experience building monitoring and alerting for critical systems and using that data during incidents.

  • Experience with Infrastructure as Code and GitOps workflows.

  • Ability to work directly with product development teams and investigate user-reported issues.

  • Clear communication and strong technical judgment.

  • Strong critical thinking and attention to detail.

  • Ability to use advanced English for different work and business purposes.

  • Decision-making skills and the ability to adapt to new changes.

What we offer along the way

  • Competitive salary and annual performance bonus

  • Full relocation support for you and your family — flights, housing, visas, and legal assistance included

  • Top-tier health insurance with full family coverage — medical, dental, vision, mental health — plus life insurance for peace of mind

  • Unlimited learning opportunities: external courses, English lessons, career and leadership development

  • Education allowance covering school and kindergarten fees

Published on: 5/8/2026

Exness

Exness

At Exness, we are not just a leading trading broker—we’ve reimagined what it takes to be a leader. With 40M+ trades a day and 2,000+ people across 13 countries, we combine scale, care, and real tech to make trading better for 1M+ clients worldwide. 

Website

See all 3 jobs at Exness

Please let Exness know you found this job on Wantapply.com. It helps us to get more jobs on our site. Thanks!