Site Reliability / Infrastructure Engineer
Other Engineering
New York, NY, USA
USD 150k-275k / year + Equity
About Medal
Millions of gamers capture, share, and discover new games on Medal, the largest platform for gaming clips. Our mission is to design products that make sharing and connecting around gaming seamless and fun, and build a place where brands and game publishers can reach high-quality gamers to grow their products. Medal's data powers General Intuition, the frontier research lab that recently raised $320M at a $2.3B valuation led by Khosla Ventures with participation from General Catalyst, Eric Schmidt, and Jeff Bezos.
The Role
Medal's infrastructure handles billions of clips, video ingestion pipelines, and social features at a massive scale most engineers never get to touch. The work centers on reliability, incident response, scaling, and making sure our infrastructure keeps up with our growth. You'll own the on-call rotation, drive postmortems, and work directly with engineering teams to meet their infra needs. The right person probably came through startups and scale-ups, has been in the room when things broke at 2am, has scaled databases under pressure, and knows the difference between a durable fix and a patch that buys you a week.
What We're Looking For
Infrastructure-as-code: Strong fluency in Terraform, with real experience owning infrastructure-as-code at scale
Elasticsearch depth: Hands-on experience running ES for user-facing features, not just as a log sink
GCP depth: You know it maybe a little too well: Kubernetes, VPC, IAM, Cloud Logging, and the managed services ecosystem
Database scaling: Deep, hands-on experience scaling and sharding relational databases (MySQL, Postgres) in production
Incident response instincts: You can work a P0 calmly, communicate clearly under pressure, and run a postmortem that prevents recurrence
CI/CD: You've worked with GitHub Actions in a production environment
Communication (crucial!): You flag issues clearly and rapidly during incidents and lead/write actionable postmortems
Experience at startups: You are comfortable in an environment of rapid growth where scaling up is a priority
Great judgment: You know the difference between a durable, sustainable fix and a patch that buys you a week
Our Stack
Electron, React, Redux, Styled Components & other modern web-based technologies
C# and C++ for native windows recording & more
Swift for iOS, Kotlin for Android
Java, Redis, RabbitMQ, Kubernetes for backend
Terraform, Salt, GitHub Actions, CircleCI for IaC and CI/CD