Principal Observability Engineer

inDrive

inDrive

Limassol, Cyprus
Posted 6+ months ago

Principal Site Reliability Engineer

, Limassol

We are looking for a Principal Site Reliability Engineer.

Responsibilities

  • Improvement and support of observability tools
  • improvement of the incident management process
  • SLA 99.99% for the product
  • Implementation of SRE practices to dev teams

Qualifications

  • Must have:
    Experience with observability tools
  • Prometheus-like TSDB, EFK/EFK/Loki, Jaeger
  • Experience to adaptation observability tools in company
  • Experience in troubleshooting problems in production
  • Good experience with Kubernetes (including with different operators)
  • Any tool for Incident Management (PagerDuty, Opsgenie, etc)

    Nice to have:
  • Experience working with AWS
  • Experience building SRE in the company
  • Development experience: python/go

Conditions & Benefits

  • Relocation to company offices in Cyprus or Kazakhstan
  • Modern MacBook Pro and other equipment necessary for work
  • Unlimited opportunities for professional and career growth, regular external and internal training from our partners
  • Personal growth programs in which we set goals and move towards them together
  • Become part of an international team of professionals and just good people who together create one of the coolest success stories in the global IT industry.