Home
/
Comprehensive
/
Senior site reliability engineer
Senior site reliability engineer-June 2024
Remote
Jun 21, 2026
About Senior site reliability engineer

  Job Description Summary

  The Site Reliability Engineering team is responsible for the reliability and performance of tools worldwide. We obsess over availability by building tools and engineering new systems to automate our platform. We are software engineers with full visibility and influenceacross the entire stack.

  We create tooling, deliver and operate customer environments both on-prem and in the cloud using cloud native technologies.

  Job Description

  Roles and Responsibilities

  In this role, you will:

  • Develop automated solutions to predict and address potential problems before they result in a service interruption

  • Oversee and adapt monitoring and alerting systems

  • Collaborate with all GE business units worldwide, providing a bastion technical expertise

  • Identify potential process improvements across the entire engineering organization

  • Define and drive architectural enhancements into system to mitigate potential failure points

  • Provide impact assessment and mitigation plan for changes going into the production environment

  • Investigate root cause of severe and systemic outages, identify corrective actions

  • Establish performance baseline, capacity thresholds, correlate events, and define monitoring/alerting criteria

  • Provide technical coaching and direction to more junior teammates

  Education Qualification

  Bachelor's Degree in Computer Science or “STEM” Majors (Science, Technology, Engineering and Math) with advanced experience.

  Desired CharacteristicsTechnical Expertise:

  • Excellent knowledge of Linux system internals

  • Excellent knowledge of Kubernetes for cluster management of containers

  • Strong analytical and problem solving skills

  • Experience with all stages of an agile software development lifecycle (CI/CD)

  • Familiar with largecluster deployment tools (Helm, Kustomize)

  • Demonstrated ability to script around repeatable tasks (Go, Ruby, Python, Bash)

  • Experience with developing cloud-native applications (High Availability)

  • Able to dive into any level of a modern internet service (schedulers, containers, Linux kernel,

  caching, object storage, distributed filesystems, RDBMS, NoSQL, etc.)

  • Comfortable with network troubleshooting (tcpdump, routing, proxies, firewalls, load balancers,

  etc.)

  • Able to troubleshoot and debug applications (C, Java, Go)

  • Proficient in configuration management systems (Chef, Terraform, Ansible, Puppet, Salt)

  • Experience with configuring, customizing, and extending monitoring tools (Sensu, Grafana, Prometheus, Graphite, Splunk, etc.)

  • Experience deploying and managing infrastructure on public clouds (AWS, GCP, or Azure)

  • Comfortable using Git on the command line

  Leadership:

  • Influences through others; builds direct and "behind the scenes" support for ideas. Preemptively

  sees downstream consequences and effectively tailors influencing strategy to support a

  • positive outcome.

  • Able to verbalize what is behind decisions and downstream implications. Continuously

  reflecting on success and failures to improve performance and decision-making. Understands and encourages change when needed.

  • Proactively identifies and removes project obstacles or barriers on behalf of the team.

  • Able to navigate accountability in a matrixed organization.

  • Self-starter; communicates and demonstrates a shared sense of purpose. Learns from failure.

  Personal Attributes:

  • Critical thinker; able to quickly adapt to changing environments

  • A hacker or tinkerer at heart

  • Risk taker, not afraid to think outside the box or challenge the status quo

  • Emotional Intelligence, ability to influence up and out and the ability to work independently

  • Must be a team player with a strong desire to win

  • Passionate about continuously learning

  • Highly organized and efficient; able to balance competing priorities and execute accordingly

  • Strong oral and written communication skills.

  Additional Information

  Relocation Assistance Provided: No

  #LI-Remote - This is a remote position

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Barista
Location: CLAFLIN UNIVERSITY - 29115003 Working with Sodexo is more than a job; it’s a chance to be part of something greater. You’ll belong in a company and team that values you for you; you’ll act
Lavapiatti - Sheraton Milan San Siro - L.68/99
Job Number 24012154 Job Category Food and Beverage & Culinary Location Sheraton Milan San Siro, Via Caldera, 3, Milan, ., Italy Schedule Full-Time Located Remotely? N Relocation? N Position Type
Senior Software Developer
Job Description Cloud Engineering Infrastructure Development Design, develop, troubleshoot, and debug software programs for databases, applications, tools, networks etc. As a member of the software e
Parkour Instructor Ninja Classes Kroc
JOB REQUIREMENTS: We are looking for energetic, fun, and kid-lovingpeople who love jumping, twisting, and flipping through challengingobstacle courses! Martial Arts/Parkour/Tumbling experience is a p
Diagnostic Technologist
Will work between Cranberry and WEX Spine Sign-on Bonus available: <1 year experience: $10,000 with 2-year commitment 1 year experience: $15,000 with 2-year commitment OR $30,000 with 3-year commi
Creative Marketing Intern - Post, XCM (Cross-Channel Marketing)
Description Amazon’s internal advertising team (XCM) is seeking Creative Marketing interns to help us continue to raise the creative bar across Amazon channels. Key job responsibilities The ideal can
Mechanical Engineer
Mechanical Engineer Reporting to the Manager of Engineering, this is an ideal opportunity to step into a challenging role directly impacting the security of our nation. In this position, you will be
COTA
Requirements Rehab Optima Experience Preferred. State License Details: COTA (Certified Occupational Therapy Assistant) Must Be Currently Active. Minimum years of experience: Less than 1 year. As an e
Korea Rain Maker
Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – alway
Registered Nurse Labor And Delivery
Job Description:The Registered Nurse (RN) is a professional caregiverwho assumes responsibility and accountability for assessing, planning,implementing and evaluating care of patients. The staff RN u
Copyright 2023-2026 - www.zdrecruit.com All Rights Reserved