LOCATION: Agency located in DC
On/Off site: Offsite 100%
CLEARANCE LEVEL: Public Trust (HSPD-12)
Who YOU are:
As a Senior Cloud Engineer at Plus3 IT Systems, you:
Are passionate about working on cutting-edge, high-profile projects and are motivated by delivering solutions on an aggressive scheduleArent satisfied with status quo, and regularly look for creative ways to solve problems and help your team meet commitmentsAre insatiably curious - you ask why, you explore, and youre not afraid to blurt out your crazy ideaAre a strong self-performer that also flourishes in a team setting; and love the ability to work on multiple clients/projects simultaneouslyLove learning new technologies and sharing them with your teamHave a keen interest in using any and all appropriate tools, especially Cloud-based and Open Source, to solve the problem at handHave strong verbal and written communication skills, due to the dynamic nature of collaborations with customers, vendors, and other engineering teams to solve complex business problems togetherUse your experience and leadership skills to motivate your teammates to deliver high quality results in a fast-paced work environmentAre obsessed with automation, simplicity, and smooth-running systemsWho We Are:
A 2023 Top Work Places recipient ()A company committed to your training, technical experience growth, and well beingUniquely positioned and ready to expand, with your help, into more complex and technically challenging environmentsBuilt upon subject matter expertise supporting the Federal Government with a focus on Cloud Adoption, Cloud Security, Cloud Enabled Data Analytics, Cloud Native Application Development, and DevSecOpsA small business with big partners such as Amazon Web Services, Microsoft (Azure), and Google (Cloud Platform) and other technology partners;ImmutaDatabricksGitLabRedHatMultiple Prime contract holder (GSA, SITE III, JAIC DRAID, and NDE)Always a committed partner with our customers and laser-focused on their missionRESPONSIBILITIES:
As a Site Reliability Engineer on our team, youll lead the team and work with the customer on the development of more robust systems by building a resilient infrastructure. Youll guide the team to build in redundancy, implement monitoring tools, and automate wherever possible. Youll reduce mundane, redundant efforts by scripting routine tasks and automating self-repair. This is your chance to leverage your significant experience with Kubernetes, Ansible, AWS Cloud Migration, Cloudera, and IaC, while overseeing junior engineers and broadening your knowledge base to integrate engineering activities. You will be accountable for maintaining the uptime and performance of the solution in accordance with the agreed-upon service level agreement (SLA), service level objectives (SLOs) and key performance parameters (KPPs).
Understand and leverage cloud automation tools such as CloudFormation and Terraform to provision and manage approved cloud baselinesWork with Infrastructure as Code managed in Git-based repositories and CI/CD toolingProvision cloud accounts for new customers using cloud automation toolsProvide customer support for cloud application ownersHelp cloud customers navigate the cloud governance processAdvise cloud customers on their cloud architectures to ensure they plan properly and are aware of agency-specific constraintsHelp estimate customer cloud costsProvide deep technical troubleshooting for escalated issues that involve the most technically complex or large-scale components and the affected usersDevelop and/or use troubleshooting, monitoring, and reporting tools to analyze the root cause of serious and impactful technical issues and building stable and sustainable solutions and improve entsBe the Technical Lead and seek solutions for customers and drive tasks toward completion - Driving and improving the whole lifecycle of operational readiness - from inception and design through deployment, operation, and refinementEnsure that technical solutions are scalable is paramountDevelop tools, operational improvements, and automated solutions that enable self- service configuration changes, speed deployments, and improve monitoring in support of mission-critical customer-facing applications and environmentsAssisting the software engineering team to ensure accurate monitoring and metrics are built into the applications before deployment to production - Participating in an on-call rotationTroubleshoot issues within the cloud layerDesign, authorize and implement custom permissionsEvaluate, document, and test new cloud services as they become availableExperience with designing, developing, deploying, or delivering using automation techniques or tools, including Ansible or GitKnowledge of cloud and virtualization-based technologies, including Docker, Azure, Amazon Web Services (AWS), VMware, or OpenShiftExperience with troubleshooting Linux, including RHEL, UNIX, networking, scripting and automation, systems administration functions, or application troubleshootingEDUCATION AND EXPERIENCE:
Bachelors degree (Computer Science, MIS, Mathematics, or other scientific degree) with 12 years of directly related hands-on experienceRelevant cloud certification(s)Desired Qualifications:
Strong organizational, time-management, analytical and troubleshooting skills are a mustHave significant hands-on experience working with tools and technologies in, at a minimum, two of the following areas: software development, configuration control, testing, security, automation, containerization, orchestration, cloud services, open-source technologiesExcellent verbal and written communicationExperience with creating automation of deployment and/or configuration tasks with IaC tools including Terraform, Ansible, Jenkins, or GitLabAmazon Web Services (AWS), Azure, or GCP CertificationExperience with designing, developing, deploying, or delivering using automation techniques or tools, including Ansible or GitKnowledge of cloud and virtualization-based technologies, including Docker, Azure, Amazon Web Services (AWS), VMware, or OpenShiftExperience with troubleshooting Linux, including RHEL, UNIX, networking, scripting and automation, systems administration functions, or application troubleshootingOther:
Plus3 IT Systems is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action Employer, making decisions without regard to race, color, religion, creed, sex, sexual orientation, gender identity, marital status, national origin, age, veteran status, disability, or any other protected class. We are committed to providing access, equal opportunity, and reasonable accommodation for individuals with disabilities in employment, its services, programs, and activities. To request reasonable accommodation, contact [email protected] [include name and/or department, telephone, and e-mail address].
The health and safety of our employees and their families is a top priority. With the continuing impacts of COVID-19 around the world, we are taking action to protect t