Sr. SRE, Practices team (Gitaly Embedded)
The GitLab DevSecOps platform empowers 100,000+ organizations to deliver software faster and more efficiently. We are one of the world’s largest all-remote companies with 2,000+ team members and values that foster a culture where people embrace the belief that everyone can contribute. Learn more about Life at GitLab.
An overview of this role
As a Site Reliability Engineer at GitLab, you are responsible for keeping all user-facing services and other GitLab production systems running smoothly. Our SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our operating environments and the GitLab codebase.
As an SRE in the Practices Team, you will collaborate on creating an operating standard (practice) across SRE while being embedded in the Gitaly Team to support them with building and maintaining systems to ensure that the Git data storage tier of GitLab instances, and GitLab.com in particular, is reliable, secure and fast. As an SRE you will support this team with your cluster provisioning expertise and have a significant impact on their success.
What you’ll do
- Work embedded in the Gitaly team. You will report to the Practices team’s manager but will participate in the Gitaly team’s processes and meetings to make sure you are successful.
- Effectively manage and maintain the infrastructure for remote development
- Manage application deployments using Helm charts and the Kubernetes ecosystem.
- Optimize the infrastructure on Google Cloud Platform. Utilizing GKE, Terraform, Ansible, and other tools.
- Track the health, performance, and availability of the remote development infrastructure using observability and monitoring tools
- Participate in an on-call rotation
What you’ll bring
- Strong programming skills - preferably experience in Go.
- A solid understanding of, and experience with, implementing and working with SLI/SLO
- SRE experience in running and operating distributed storage systems
- Google Cloud Platform expertise, specifically around networking, GKE configuration, and scaling
- Experience with Terraform, Ansible, and other tools like Chef.
- Experience with Observability and Monitoring tools
- Production experience with the Kubernetes ecosystem is a bonus
About the team
The Practices Team is a subgroup of the Reliability Team.
Our mission is to ensure the reliability, performance, and availability of GitLab.com by partnering with Stage Groups to ensure that features and services are designed and implemented with reliability in mind. The team collaborates with Stage groups to build, maintain, and improve services and ensure the services' SLO is met as per GitLab.com's availability and performance goals.
How GitLab will support you
Home office support
Please note that we welcome interest from candidates with varying levels of experience; many successful candidates do not meet every single requirement. Additionally, studies have shown that people from underrepresented groups are less likely to apply to a job unless they meet every single qualification. If you're excited about this role, please apply and allow our recruiters to assess your application.
Country Hiring Guidelines: GitLab hires new team members in countries around the world. All of our roles are remote, however some roles may carry specific location-based eligibility requirements. Our Talent Acquisition team can help answer any questions about location after starting the recruiting process.
GitLab is proud to be an equal opportunity workplace and is an affirmative action employer. GitLab’s policies and practices relating to recruitment, employment, career development and advancement, promotion, and retirement are based solely on merit, regardless of race, color, religion, ancestry, sex (including pregnancy, lactation, sexual orientation, gender identity, or gender expression), national origin, age, citizenship, marital status, mental or physical disability, genetic information (including family medical history), discharge status from the military, protected veteran status (which includes disabled veterans, recently separated veterans, active duty wartime or campaign badge veterans, and Armed Forces service medal veterans), or any other basis protected by law. GitLab will not tolerate discrimination or harassment based on any of these characteristics. See also GitLab’s EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know during the recruiting process.