Observability Engineer
inDrive
Other Engineering
Limassol, Cyprus
Posted on Monday, June 10, 2024
Principal Site Reliability Engineer
, Limassol
We are looking for a Principal Site Reliability Engineer.
Responsibilities
- Improvement and support of observability tools
- improvement of the incident management process
- SLA 99.99% for the product
- Implementation of SRE practices to dev teams
Qualifications
- Must have:
Experience with observability tools - Prometheus-like TSDB, EFK/EFK/Loki, Jaeger
- Experience to adaptation observability tools in company
- Experience in troubleshooting problems in production
- Good experience with Kubernetes (including with different operators)
- Any tool for Incident Management (PagerDuty, Opsgenie, etc)
Nice to have: - Experience working with AWS
- Experience building SRE in the company
- Development experience: python/go
Conditions & Benefits
- Relocation to company offices in Cyprus or Kazakhstan
- Modern MacBook Pro and other equipment necessary for work
- Unlimited opportunities for professional and career growth, regular external and internal training from our partners
- Personal growth programs in which we set goals and move towards them together
- Become part of an international team of professionals and just good people who together create one of the coolest success stories in the global IT industry.