Home
/
Comprehensive
/
Sr SDE, EC2 HealthAnalytics, dbrown Team
Sr SDE, EC2 HealthAnalytics, dbrown Team-May 2024
Seattle
May 15, 2026
ABOUT AMAZON
Our mission is to be the world’s most customer-centric company.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Sr SDE, EC2 HealthAnalytics, dbrown Team

  Description

  EC2 Health Analytics Team is responsible for Classification, Measurement and Analysis of failure events across the EC2 fleet to improve AWS fleet reliability and improve customer experience.

  As part of the EC2 HA team, you will work on highly scalable tools and software services to measure fleet health, identify failure patterns and generate automated health reports. You will work with partner teams to improve existing failure classifications and create new failure classifications. You will use data science techniques to identity spikes in failures across the fleet. You will work to ensure that the failures patterns are root caused and fixed to ensure a healthy AWS fleet. You will drive innovation and development of new tools and services to cover new operational and health metrics.

  Key job responsibilities

  Designing and developing cutting edge highly reliable and scalable distributed systems.

  Delivering quality features on-time and on-budget and execution against project plans and delivery commitments.

  Working with team members to manage the day-to-day development activities, participate in designs, design review, code review, and implementation.

  Engaging and working with customers and dependencies to ensure a quality delivery.

  Mentoring other engineers

  Maintaining current technical knowledge to support rapidly changing technology, always on a look out for new technologies and work with the team in bringing in new technologies.

  A day in the life

  You will use data analytics and various large data sets to efficiently detect and root cause EC2 server and instance failures

  You will exercise the highest bar for security in both code and operations.

  Our customers rely on timely availability of the quality and reliability data. You will generate and provide reliability reports to our customers in a timely manner and incorporate customer feedback in improving our reports and dashboards

  About the team

  We are looking for top engineers to join a talented, innovative team to help us monitor and drive improvements of one of the largest server fleets in the world. The team is focused on intelligent monitoring, forecasting and machine learning models to improve EC2 reliability, availability and flexibility

  We are open to hiring candidates to work out of one of the following locations:

  Seattle, WA, USA

  Basic Qualifications

  5+ years of non-internship professional software development experience

  5+ years of programming with at least one software programming language experience

  5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience

  Experience as a mentor, tech lead or leading an engineering team

  Bachelor's degree

  Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby.

  Proficiency in Computer Science fundamentals such as object-oriented design, data structures, algorithm design, problem solving, and complexity analysis

  Preferred Qualifications

  Experience with PowerShell (preferred), Python, Ruby, or Java

  Experience working in an Agile environment using the Scrum methodology

  Master in Computer Science or related field preferred

  Experience working with Linux operating systems

  Experience with architecting high scale systems

  Experience with developing cloud technologies

  Candidate must be able to work with a minimum of technical supervision and supplemental engineering support, while responding efficiently to multiple program priorities.

  Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

  Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $134,500/year in our lowest geographic market up to $261,500/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. Applicants should apply via our internal or external career site.

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Vice President, Program and Project Management I
Reference #: 48023Project/Program Management - IC3Participates in or leads a wide array of activities associated with project planning and management to ensure that projects are completed on time, wi
Security Compliance Specialist, Amazon Stores Security
Description Are you interested in driving exceptional security for customers? Do you see information security as a business enabler? Amazon’s Stores Security organization is seeking an experienced Se
Sr SDE, EC2 HealthAnalytics, dbrown Team
Description EC2 Health Analytics Team is responsible for Classification, Measurement and Analysis of failure events across the EC2 fleet to improve AWS fleet reliability and improve customer experien
Software Engineer II, AWS Platform Cloud Operations
Description The Amazon Web Services (AWS) Change Management is a core Systems Manager feature. Our team simplifies the way you request, approve, implement, and report on operational changes to your a
After Hours RN (5pm- 8am)- Sign on Bonus $5000
Description Position at Lifespark Lifespark is a complete senior health company headquartered in St. Louis Park, Minnesota. Since 2004, we've been helping seniors stay healthy, navigate their health
Team Member - Food Champion
Work today, get paid today? Yes!! Apply and learn how! Hospitality Restaurant Group (Taco Bell) is looking for Food Champions who love serving customers and want to further their professional careers
Structural Engineer - Seismic SME
DescriptionLJB Inc. is a national engineering firm that provides civil and structural engineering, as well as geospatial, safety and environmental services. Our diverse expertise, client base, and ge
Engineering Technician VI (EPFSS)
AI Signal Research, Inc. (ASRI) is recruiting for the Engineering Prototype Fabrication Support Services (EPFSS) Task Order at NSWC Dahlgren Division. Education: High School Diploma or GED Months/Yea
Certified Nursing Assistant
$1000 Sign On Bonus for Part time $20-$24 per hour based on experience + Shift and Weekend Differentials where applicable Part-Time C.N.A. Skilled Care: All shifts Days TBD Are you dedicated to provi
LPN Shared Services/PRN Physician Services
Southern Tennessee Regional Health System has an opportunity for you to join our team. A Joint Commission accredited hospital, Sothern Tennessee Regional Health System serves the patients of south-ce
Copyright 2023-2026 - www.zdrecruit.com All Rights Reserved