Platform Engineer (Cloud, DevOps, SRE)
We’re looking for a Platform Engineer to help us build and scale the infrastructure behind AIUTA’s AI-powered solutions trusted by leading retail players.
You’ll work across modern cloud infrastructure, Kubernetes platforms, CI/CD systems, observability, and GPU-powered AI workloads — partnering closely with engineering and data science teams to improve scalability, reliability, and developer velocity.
This is a highly hands-on role with strong ownership and direct impact on how our platform evolves as we continue scaling our products, infrastructure, and engineering organization.
Key Responsibilities
Architect & Automate: Design, build, and maintain highly scalable, secure, and resilient cloud infrastructure using Infrastructure as Code (IaC) with Terraform.
Streamline Delivery: Build and optimize blazing-fast CI/CD pipelines (GitLab CI, GitHub Actions) to elevate developer experience and deploy containerized services.
Scale AI Workloads: Partner with our AI/ML teams to support high-performance computing infrastructure, optimizing containerized environments (Kubernetes) for GPU/accelerated workloads.
Boost Resilience: Implement modern observability, monitoring, and proactive logging systems (Prometheus, Grafana, OpenSearch/ELK) to maintain high service availability.
Control Costs & Performance: Proactively identify performance bottlenecks and execute cloud cost-optimization strategies without compromising on speed or reliability.
Drive Engineering Culture: Reduce environment drift, advocate for GitOps/DevSecOps best practices, participate in chaos/resiliency testing, and mentor team members as we scale the engineering organization.
What We Are Looking For
Experience: 4+ years of hands-on experience in DevOps, Platform Engineering, or Infrastructure roles, ideally within a fast-growing startup or high-load product environment.
Cloud & Containers: Strong expertise in cloud platforms (AWS or GCP preferred) and deep hands-on experience orchestrating production-grade Kubernetes clusters.
Infrastructure as Code: Proven track record of managing complex environments using Terraform.
Systems & Networking: Solid fundamentals in Linux systems administration, networking, security, and modern web application architectures.
Troubleshooting Mindset: Excellent analytical and debugging skills; a proactive problem-solver who takes ownership of production issues.
Collaborative Spirit: Strong communication skills in English, with the ability to collaborate cross-functionally and document infrastructure patterns clearly.
Nice to Haves
Experience scaling GPU/accelerated hardware (Nvidia/AMD) for AI/ML model inference.
Familiarity with MLOps frameworks or deploying high-performance AI serving stacks.
Experience with configuration management tools (e.g., Ansible) and incident management processes (PagerDuty, disaster recovery planning).
Prior experience working in a fast-paced, Series-A stage startup.
What We Offer
An opportunity to work with cutting-edge AI technology alongside a highly experienced team from leading global tech companies,
Competitive salary,
A collaborative and inclusive team culture where every voice is heard,
Remote-friendly environment,
Flexible working hours.
Published on: 5/26/2026

AIUTA
AIUTA provides cutting-edge, white-label virtual try-on solutions for fashion brands and retailers.
Please let AIUTA know you found this job on Wantapply.com. It helps us to get more jobs on our site. Thanks!




