Staff Engineer, Datalake Platform

Stripe
Stripe

Dublin, Ireland

EUR 132k-198k / year + Equity

Posted on Jun 30, 2026

Who We Are

About Stripe

Stripe is a financial infrastructure platform for businesses. Millions of companies — from the world's largest enterprises to the most ambitious startups — use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone's reach while doing the most important work of your career.

About the Team

The Datalake team builds and maintains Stripe's foundational data access and governance infrastructure — the paved path for safe, fast, and compliant access to Stripe's critical big data assets. We serve developers, data engineers, analysts, ML and AI teams, security teams, and business users across the company. The team is in the middle of a significant architectural transition as Stripe grows. We are making Stripe's data lake a first-class citizen of the modern data ecosystem to support our growing scale and diverse workloads.

What Makes This Role Compelling

  • Foundational infrastructure with broad reach: The Datalake team's systems sit in the critical path of nearly every data workload at Stripe. Decisions affect petabytes of data, hundreds of production pipelines, and every engineering team that builds on Stripe's data lake.
  • Active, high-stakes architectural transformation: The team is executing a multi-year migration to modern, OSS-aligned solutions — a technically deep project with real architectural choices at each step, including API design, compute engine integration, authorization model, and per-table credential vending.
  • Active, high-stakes, OSS-aligned architectural transformation: You will lead a multi-year migration to modern, open-source solutions like the Apache Iceberg REST Catalog. This is a technically deep project involving critical architectural choices at each step, from API design and compute engine integration to authorization models, where your opinions and technical influence will directly shape how the platform engages with the broader data infrastructure ecosystem.
  • Storage platform ownership with room to define the approach: The team owns the object storage abstraction layer — access control, IAM policy design, lifecycle management, and compliance architecture — but the how is still being written. You'll shape how hundreds of engineering teams interact with petabytes of data, and the decisions you make will stick.
  • At Stripe you’ll have the scale of the large company and the agency to influence technical strategy and the roadmap

Responsibilities

  • Architect the unified Iceberg platform: Lead the technical design of a metastore service as it becomes the single source of truth for Iceberg table management across all compute engines — Spark, Trino, Flink, and PyIceberg. Define the API contracts, authorization model, per-table credential vending, and integration patterns that every data pipeline at the company will depend on.
  • Own the metastore migration strategy: Drive the sequencing, backward compatibility story, rollback approach, and cross-team coordination for migrating all remaining Hive Metastore-backed workloads to the new platform. This means coordinating with dozens of consuming teams while keeping production data infrastructure operational at all times.
  • Shape the object storage abstraction: Define the storage abstraction layer — including bucket provisioning, access control policy design, and the developer-facing client libraries that make object storage ergonomic and secure by default. The goal is an abstraction layer that consuming teams can adopt without needing to become storage infrastructure experts themselves.
  • Lead compliance architecture: Partner with security and compliance teams to translate regulatory requirements into durable preventative technical controls — audit logging, access review infrastructure, data segregation, and lifecycle enforcement — built into the platform rather than bolted on.
  • Drive cost and efficiency at petabyte scale: Identify systemic inefficiencies in storage layout, snapshot retention, and data lifecycle, and design automated, self-service tooling that scales without ongoing manual intervention from the team.
  • Set the technical bar: Own critical design reviews, establish standards for reliability, security, and developer experience, and mentor senior engineers through high-stakes architectural decisions. Provide the technical judgment that keeps the platform moving fast without accumulating structural debt.

Who You Are

Minimum requirements

  • 10+ years of professional software engineering experience.
  • Demonstrated track record of designing, building, and operating large-scale distributed storage or data infrastructure systems.
  • Deep experience with object storage (S3, Azure Blob, or equivalent) — including IAM, access control policy design, lifecycle management, and operational practices at petabyte scale.
  • Proven ability to lead complex, multi-quarter infrastructure projects end-to-end, including cross-team dependency management and coordinating migrations across many consuming teams.
  • Strong background in authorization and access control design for distributed data systems.

Preferred requirements

  • Deep expertise in Apache Iceberg — table format internals, the REST Catalog specification, snapshot lifecycle management, compaction, and compute engine integration (Spark, Trino, Flink, PyIceberg).
  • Background in compliance-sensitive infrastructure — SOX, ICFR, or equivalent regulatory frameworks — with an understanding of how audit and access review requirements translate into preventative technical controls.
  • Experience safely executing large-scale data migrations with a strong instinct for sequencing, blast radius reduction, rollback, and data integrity validation.
  • A strong developer experience sensibility: the ability to build abstractions that are ergonomic, well-documented, and actively reduce toil for the engineering teams that depend on your platform.
Office-assigned Stripes in most of our locations are currently expected to spend at least 50% of the time in a given month in their local office or with users. This expectation may vary depending on role, team and location. For example, Stripes in Stripe Delivery Center roles in Mexico City, Mexico, Bengaluru, India, and Dublin, Ireland work 100% from the office. Also, some teams have greater in-office attendance requirements, to appropriately support our users and workflows, which the hiring manager will discuss. This approach helps strike a balance between bringing people together for in-person collaboration and learning from each other, while supporting flexibility when possible.

The annual salary range for this role in the primary location is €132,000 - €198,000. This range may change if you are hired in another location. For sales roles, the range provided is the role’s On Target Earnings (“OTE”) range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. This salary range may be inclusive of several career levels at Stripe and will be narrowed during the interview process based on a number of factors, including the candidate’s experience, qualifications, and specific location. Applicants interested in this role and who are not located in the primary location may request the annual salary range for their location during the interview process.

Specific benefits and details about what compensation is included in the salary range listed above will vary depending on the applicant’s location and can be discussed in more detail during the interview process. Benefits/additional compensation for this role may include: equity, company bonus or sales commissions/bonuses; retirement plans; health benefits; and wellness stipends.

Office locations

Dublin HQ

Team

Infrastructure & Corporate Tech

Job type

Full time