site reliability engineer
CRED
This job is no longer accepting applications
See open jobs at CRED.See open jobs similar to "site reliability engineer" General Catalyst.Software Engineering
Bengaluru, Karnataka, India
Posted 6+ months ago
what is CRED?
CRED is an exclusive community for India’s most trustworthy and creditworthy individuals, where the members are rewarded for good financial behavior. CRED was born out of a need to bring back the focus on a long lost virtue, one of trust, the idea being to create a community centered around this virtue. a community that constantly strives to become more virtuous in this regard till they finally scale their behavior to create a utopia where being trustworthy is the norm and not the exception. to build a community like this requires a community of its own; a community special in its own way, working towards making this vision come true.
here’s a thought experiment: what do you get when you put a group of incredibly passionate and driven people and entrust them with the complete freedom to chase down their goals in a completely uninhibited manner? answer: you get something close to what we have at CRED; CRED just has it better.
here’s what will be in store for you at CRED once you join as a site reliability engineer.
what you will do?
- work with large-scale data engineering infrastructures and data native technologies such as Spark/EMR, Flink, Apache Pinot, Kafka, Airflow, Tableau, NiFi, Metabase, and Databricks
- work with Observability tools like Loki, Victoriametrics and Datadog
- showcase understanding of best practices in running and managing self managed platforms on kubernetes, ensuring complete observability, HA, and self-served CI/CD system
- foster cross-team collaboration, building and maintaining relationships with customer teams, architects, and engineering teams to jointly achieve key deliverables ensuring production scalability and stability
- demonstrate strong troubleshooting and debugging skills, including conducting post-incident reviews, root cause analysis, and triaging product or system issues to analyze sources, impacts, and resolve them for service operations and quality
you should apply if you:
- have experience in SRE/DevOps, with a focus on distributed cloud native systems design, observability, container orchestration, maintenance, and troubleshooting
- experience with public cloud platforms, preferably AWS
- have hands-on experience in Kubernetes/EKS, building and operating large-scale production systems with stringent SLOs & SLAs
- are proficient in modern DevOps programming and scripting languages: Shell, Python, GoLang
- demonstrate experience in Linux Infrastructure management and systems administration with Linux
- have experience with Infrastructure as code & Configuration management using tools like Terraform, Helm, Ansible
- have expertise in Continuous Integration and Deployment (CI/CD) and release orchestration using Jenkins, ArgoCD, GitHub Actions, etc.
- have expertise in big data systems like Spark/EMR, Flink, Airflow etc.
- have expertise in pubsub solutions like Kafka.
- are familiar with system observability tools such as ELK/EFK, Prometheus, Grafana, alert manager, Sysdig, Datadog, Victoria Metrics, etc.
- have exceptional interpersonal, verbal, and written communication skills
how is life at CRED?
working at CRED would instantly make you realize one thing: you are working with the best talent around you. not just in the role you occupy, but everywhere you go. talk to someone around you; most likely you will be talking to a singer, standup comic, artist, writer, an athlete, maybe a magician. at CRED people always have talent up their sleeves. with the right company, even conversations can be rejuvenating. at CRED, we guarantee a good company.
hard truths: pushing oneself comes with the role. and we realise pushing oneself is hard work. which is why CRED is in the continuous process of building an environment that helps the team rejuvenate oneself: included but not limited to a stacked, in-house pantry, with lunch and dinner provided for all the team members, paid sick leaves and a comprehensive health insurance.
to make things smoother and to make sure you spend time and energy only on the most important things, CRED strives to make every process transparent: there are no work timings because we do not believe in archaic methods of calculating productivity, your work should speak for you. there are no job designations because you will be expected to hold down roles that cannot be described in one word. since trust is a major virtue in the community we have built, we make it a point to highlight it in the community behind CRED: all our employees get their salaries before their joining date. a show of trust that speaks volumes because of the skin in the game.
there are many more such eccentricities that make CRED what it is but that’s for one to discover. if you feel at home reading this, get in touch
This job is no longer accepting applications
See open jobs at CRED.See open jobs similar to "site reliability engineer" General Catalyst.