We are hiring for a Senior Site Reliability Engineer to join our team remotely!
Responsibilities
Defining standard reliability and resilience for infrastructure and application components.
Proactive optimization of redundancies, monitoring and alerting practices and patterns
Developing resilient and highly available distributed systems.
Infrastructure as Code development for building cloud tools.
Monitoring systems and services, providing incident and emergency response to triage and resolve system or client issues
Management of the application ecosystem improving platform infrastructure and applications with high reliability, resiliency, performance and quality
Supporting documentation, knowledge articles, and runbooks
Designing, building and Implementing SRE patterns that adhere to our client’s security guidelines and policies.
Requirements:
Bachelor\'s degree or in lieu of a degree, demonstrating, in addition to the minimum years of experience required for the role, three years of specialized training and/or progressively responsible work experience in technology for each missing year of college.
Advanced Kubernetes – Must have strong skills in Kubernetes at scale using one of AKS, EKS or GKE. Experience with Kubectl and Helm. Experience working with Lens or Rancher
Observability – Experience in setting up tools like Datadog & Splunk to give actionable intel on a microservice environment including but not limited to synthetics, Application performance monitoring, logging and Alerting (Pagerduty/OpsGenie Integrations).
Good CI/CD expertise. Experience using Azure DevOps & Github Actions
SCM - Working with tools like Github for source code management and well as experience with branching strategies like GitFlow and trunk based.
Strong troubleshooting skills – Be able to move all the way down to code level to give development teams a head start on application issues. Effectively be able to contribute to root cause analysis exercises post problem resolution.
Good Communication Skills - Active listening, verbal and non-verbal communication, Clarity and Concision, Confidence, Open-Mindedness, Respect.
Good Documentation skills - Be able to effectively document any automation, technical efforts so as to ensure ease of adaptability of a solution.
Good collaboration skills – Must be able to work effectively with Scrum/Dev teams with a push/pull (push back and prioritize work pulled in) philosophy in order to manage expectations and contribute to the stability and improvement of the platform.
IAC - Terraform , Pulumi. Preferably developed modules in the past rather than just using them.
Security – Worked with encryption at rest, in transit patterns. Experience with tools like Azure Key vault, Hashicorp Vault, Google KMS.
Containers: Experience deploying Java (Spring Boot) microservices in dockerized environments.
Automation – Must be able to identify toil and opportunities to reduce that within the team.
Authentication/Authorization – Familiarity with Authn/Authz schemes like OpenID, OAuth 2.0, SAML.
Scripting and Programming – Experience with Python, Powershell, Java or Node.
Event Driven/Event Sourcing Patterns – Familiarity with distributed event streaming platforms like Kafka, EventHub, RabbitMQ and patterns like CQRS.
Datadog - highly preferred but other comparable tools are Splunk, AppDynamics, New Relic, etc.
Any experience setting up monitoring or API monitoring
About TEKsystems:
We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company.
The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.
We are an equal opportunity employers and will consider all applications without regard to race, genetic information, sex, age, color, religion, national origin, veteran status, disability or any other characteristic protected by law. To view the EEO is the law poster click here. Applicants with disabilities that require an accommodation or assistance a position, please call 888-472-3411 or email [email protected]. This is a dedicated line designed exclusively to assist job seekers whose disability prevents them from being able to apply online. Messages left for other purposes will not receive a response.