Senior Product Manger - Tech, Infrastructure Reliability
Amazon's Fulfillment Technologies & Robotics (FTR) team invites you to spearhead the product vision for a platform ensuring Amazon's fulfillment network never stops — even as we move toward fully self‑governing, zero‑touch operations.
You will own the roadmap for an AI‑powered infrastructure reliability platform preventing, detecting and resolving incidents across thousands of fulfillment sites globally. This is a rare opportunity for a technically deep product leader who can write code, deliver proof‑of‑concepts, and engage as a peer with data scientists and engineers.
Key Job Responsibilities
* Own and drive the multi‑year product roadmap for the Infrastructure Reliability AI‑Ops platform, spanning three strategic programs: zero‑touch incident resolution, associate‑directed work tooling, and predictive failure prevention.
* Define the vision, strategy and success metrics for AI‑powered progressive detection, incident consolidation, self‑growing remediation orchestration and cross‑domain observability capabilities serving thousands of fulfillment sites globally.
* Write code and deliver working proofs‑of‑concept that validate technical hypotheses before committing engineering resources.
* Produce prototypes of multi‑agent reasoning pipelines, novel anomaly‑detection approaches, or stress‑test LLM prompt chains against real incident data.
* Apply deep machine‑learning fundamentals to shape platform detection, consolidation and failure‑reasoning capabilities.
* Engage data scientists on model architecture selections, feature‑engineering trade‑offs and evaluation frameworks, ensuring high‑trust models for production.
* Use AI‑reasoning techniques such as chain‑of‑thought prompting, retrieval‑augmented generation, confidence calibration and evidence accumulation to define progressive confidence about incident severity and failure origin.
* Define the multi‑agent architecture that orchestrates detection, investigation, consolidation, diagnosis and remediation as a coordinated system.
* Translate complex operational and technical requirements into a prioritised backlog, balancing feature depth, platform scalability and autonomous site readiness milestones.
* Track the business case across all three programs, secure ongoing investment and measure performance metrics such as auto‑detection rate, false‑positive rate, consolidation accuracy and remediation success rate.
* Drive cross‑functional alignment and lead executive‑level reviews of program progress, risks, and investment cases.
A Day in the Life
You spend most of your time at the intersection of product strategy and hands‑on technical work. A typical day might start by pulling incident data into a notebook to test a new detection signal, then jumping into a whiteboard session with engineers debating multi‑agent handoff reasoning. You might prototype a diagnostic flow in the afternoon to prove a concept is worth building, and occasionally you will find yourself in the operations center watching real operators resolve network failures.
Benefits
* Medical, Dental and Vision Coverage
* Maternity and Parental Leave Options
* Paid Time Off (PTO)
* 401(k) Plan
Benefits can vary by location, operating hours, employment length and job status such as seasonal or temporary employment.
Compensation by Location
* USA, MA, North Reading: $151,200.00 – $204,600.00 USD annually
* USA, TN, Nashville: $143,700.00 – $194,300.00 USD annually
* USA, TX, Austin: $151,200.00 – $204,600.00 USD annually
* USA, VA, Arlington: $151,200.00 – $204,600.00 USD annually
About the Team
The Infrastructure Reliability team sits within Amazon's Robotics organization, operating as the cross‑domain orchestration layer for a fulfillment network that processes customer orders continuously across thousands of sites. Our mission is simple and purposeful: operations never stop, no matter what breaks. We build the platform that sees across all domains, identifying failures that cascade across team boundaries and coordinating capabilities that domain teams have built to resolve those failures faster than any single team could alone.
Basic Qualifications
* Bachelor's degree
* Experience owning or driving roadmap strategy and definition
* Experience with feature delivery and trade‑offs for a product
* Experience contributing to engineering discussions around technology decisions and product strategy
* Experience managing technical products or online services
* Experience representing and advocating for a variety of critical customers and stakeholders during executive‑level prioritization and planning
Preferred Qualifications
* Experience using analytical tools such as Tableau, Qlikview, QuickSight
* Experience building and driving adoption of new tools
Amazon is an equal‑opportunity employer and does not discriminate on the basis of protected veteran status, disability or other legally protected status.
Posted: May 14, 2026
#J-18808-Ljbffr