All roles

MLOps Engineer (Relocation to Serbia)

Remote · USA Full-time New today

This engagement is focused on building an internal AI platform that enables developers to ship AI-powered services efficiently. Scope includes model connectivity, prompt testing and evaluation, monitoring/observability, and the underlying AI infrastructure layer.

The objective is to improve DevEx and reduce time-to-market for AI features.

Location: Serbia (relocation support available), Croatia, Poland, Portugal

Tasks

  • Build and operate the AI platform infrastructure enabling developers to ship LLM-based services faster.
  • Implement and maintain Kubernetes-based runtime environments (incl. AKS) for AI workloads.
  • Manage infrastructure as code with Terraform (modules, environments, CI/CD automation).
  • Support LLM workflows: RAG, agents, prompt experimentation, evaluations, and deployment patterns.
  • Integrate and operate tooling such as Azure AI Foundry, LiteLLM, Langfuse, MLflow.
  • Orchestrate pipelines using Kubeflow Pipelines and/or Argo Workflows (build, deploy, evaluate).
  • Improve platform reliability and observability (monitoring, logging, tracing, cost/perf signals).
  • Collaborate closely with developers to streamline DX (APIs, templates, docs, golden paths, automation).

Requirements

  • Strong hands-on experience with Kubernetes in production (preferably AKS).
  • Solid Terraform expertise (IaC best practices, multi-env setups).
  • Practical experience supporting ML/LLM workloads in a platform or DevOps/MLOps context.
  • Proficiency in Python for automation, scripting, and supporting APIs/evaluation tooling.
  • Understanding of CI/CD, release processes, and production-grade operations.
  • Ability to work under tight timelines and deliver pragmatically.

Nice to Have

  • Experience building internal developer platforms or “paved roads” for engineering teams.
  • Familiarity with LLM evaluation frameworks, prompt testing workflows, and LLM observability.
  • Exposure to RAG architectures, vector databases, and agentic patterns.
  • Experience with Kubeflow, Argo, and ML lifecycle tooling.

Engagement Type

  • Long-term B2B contract.

Team

  • You will join a team of 5, with 3 AI Platform Engineers being added.

Location / Timezone

  • Remote work from Croatia, Poland, Portugal, and Serbia.
  • European working hours.
  • Occasionally available for meetings up to 10:00 AM PST (US overlap).
Apply To This Job

Related roles

MLOps Engineer (Relocation to Serbia)

Remote · USA Full-time

Technischer Vertriebsingenieur (m/w/d) Metall-Vakuumimprägnierung

Remote · USA Full-time

Sales Operations Specialist (w/m/d)

Remote · USA Full-time

Remote LPN/ LVN Clinical Care Coordinator

Remote · USA Full-time

Remote CMA/ RMA Patient Care Coordinator

Remote · USA Full-time

VP, Product Marketing and GTM

Remote · USA Full-time

Licensed Psychologist to conduct part-time Assessments

Remote · USA Full-time

Polygraph Examiner

Remote · USA Full-time

Lead Polygraph Examiner

Remote · USA Full-time

Director of Communications

Remote · USA Full-time

Technology Project Manager - Remote Opportunity with Southwest Airlines in Denver - Competitive Salary $26/Hour

Remote · USA Full-time

Business Development Representative

Remote · USA Full-time

HR Business Partner, Performance & Culture- Wharton Human Resources & People Operations

Remote · USA Full-time

Sr. Immigration Paralegal (VAWA & T Visa) – Bilingual (English/Spanish) – 100% Remote

Remote · USA Full-time

South Carolina Licensed Telemedicine Physician

Remote · USA Full-time

Experienced Inbound Energy Customer Care Specialist – Remote Customer Service Representative for arenaflex

Remote · USA Full-time

Work From Home

Remote · USA Full-time

Experienced Entry-Level Remote Chat Agent – Global Customer Support and Auction Services

Remote · USA Full-time

Experienced Customer Service Representative - Amazon Remote Work Opportunity

Remote · USA Full-time

Experienced Customer Service Representative (Part-Time) - arenaflex Remote Careers

Remote · USA Full-time