Infrastructure Quality Engineer, Infrastructure Reliability & Quality (IRQ)
AWS Elemental
Infrastructure Quality Engineer, Infrastructure Reliability & Quality (IRQ)
Job ID: 2969678 | Amazon Asia-Pacific Resources Private Limited (Singapore)
DESCRIPTION
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
Our AWS Infrastructure Reliability & Quality (IRQ) engineering team provides engineering support for our data center infrastructure equipment (Air Handling Unit, Switchgear, Breaker, Panel Board, UPS, Transformer, Generator, ATS etc.). As a member of this team you will be proactively driving quality and reliability risk identification, assessment and mitigation for data center equipment. You will also be responsible for root cause analysis of critical equipment failures, supplier process breakdown and drive continuous improvements to improve datacenter availability for AWS customers. You will work closely with both internal and external partners including suppliers to define product specifications, risk identification plans and mitigations. Internally you will collaborate with AWS Engineering, Procurement, Construction, Commissioning, Operations and Field Engineering teams. Externally you will manage supplier qualification, quality and reliability monitoring, supplier issue resolution and supplier development and continuous improvement initiatives that span the product lifecycle. You must have can-do attitude, be ownership minded, independent, action- and results-oriented to succeed in our open collaborative environment.
Key job responsibilities
- Develop, implement and maintain equipment quality and reliability roadmaps by collaborating with engineering, operations, and procurement teams.
- Define, monitor and achieve the correct quality/reliability performance targets for each equipment.
- Verify AWS quality standards are met at suppliers through in-person and remote audits.
- Establish and monitor end-of-line and incoming inspection/first article inspection plans.
- Support supplier and equipment qualification and assessment processes in support of procurement teams including issue resolution.
- Collaborate globally with suppliers to resolve field issues through Root Cause Analysis and corrective actions. Escalate complex failure investigations to AWS Senior/Principal Engineers.
- Develop and support suppliers with product improvement initiatives and Key Performance Indicators (KPI). Provide a feedback mechanism from suppliers to internal teams to resolve joint quality issues.
- Support internal AWS teams in New Product Development (NPD) initiatives including Failure Mode and Effect Analysis (FMEA) of design and manufacturing processes.
- Ensure AWS products meet or exceed industry standards for initial quality and long-term reliability performance.
- Analyze product design assumptions and AWS operational requirements to identify and mitigate equipment performance risks.
- Drive Continuous Process Improvement strategy through identification of new qualification criteria, test requirements, preventative maintenance checkpoints or specification to improve overall equipment resilience
- Successfully handle concurrent projects, sometimes in multiple geographical regions.
- Travel required, both international and domestic, approximately 30-50%
About the team
About AWS
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Inclusive Team Culture
AWS values curiosity and connection. Our employee-led and company-sponsored affinity groups promote inclusion and empower our people to take pride in what makes us unique. Our inclusion events foster stronger, more collaborative teams. Our continual innovation is fueled by the bold ideas, fresh perspectives, and passionate voices our teams bring to everything we do.
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
BASIC QUALIFICATIONS
- Bachelor's Degree in Electrical, Mechanical, Manufacturing Engineering or similar related field.
- 6+ years of industry experience in quality/reliability engineering
- 4+ years of direct interaction with suppliers including technical Failure Analysis and Root Cause Analysis.
PREFERRED QUALIFICATIONS
- MS or PhD in Electrical, Mechanical, or Manufacturing Engineering or similar related field.
- 5+ years of work experience in quality/reliability risk identification and assessment from component to system level applying analytical, experimental and statistical approaches to evaluate product design and manufacturing quality/reliability levels.
- Experience with managing proactive, effective, and frugal quality/reliability strategies throughout product design, manufacture and deployment stages. Experience with data center operations and infrastructure equipment (Air Handling Unit, Switchgear, Breaker, Panel Board, UPS, Transformer, Generator, ATS etc.).
- Experience with modern manufacturing processes, ISO-9000, quality control plans, and problem-solving methodologies. Experience with accelerated life testing, stress analysis and finite element analysis.
- Proficiency in the development of data, dashboards, and reports, including data cleansing and analysis.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
Job details
- SGP, Singapore