Site Reliability Engineer SME - San Antonio, United States - Rackner
Description
Title:
Site Reliability Engineer SME
Location:
San Antonio, TX (Top Secret required personnel may be remote but may be required to be onsite to access the SCIF facilities in San Antonio up to one week, 2 times per month)
Clearance:
Top Secret (SCI eligible)
About this role:
Rackner is seeking a Site Reliability Engineer SME to work with the DSO Platform Product Line Manager and Infrastructure as Code SME, as well as other team members to ensure that the overall platform design and implementation is able to meet or exceed the service level objectives and agreements relating to uptime, availability, and mean-time-to-recover.
We are seeking professionals with:
BS; Engineering, Computer Science, or technical degree or industry experience equivalent
10+ years; Contributing on a technical team for a software or IT project.
4+ years; use of DevSecOps in support of system integrity, availability, and security
3+ years; Development containerized applications and delivery to a containerized platform
3+ years; Providing technical guidance to more junior team members
DoD 8570/8140 IASAE Level II within 30 days of start:
CASP+ CE
CISSP (or Associate)
CSSLP
CISSP-ISSAP
CISSP-ISSEP
CCSP
What will make you successful:
Manage and maintain availability and reliability of critical platform services and applications, ensuring they meet requirements of internal and external users
Collaborate with business leaders in building and running sustainable production systems, which can evolve and adapt to changes in a global business environment
Evaluates performance results and recommends major changes affecting short-term project growth and successRun infrastructure with Chef, Ansible, Terraform, GitLab, CI/CD and Kubernetes
Build monitoring that alerts on symptoms rather than on outages
Document every action so your findings turn into repeatable actions and then into automation
Use the GitLab product to run as a first resort and improve the product as much as possible
Improve operational processes
Design build and maintain core infrastructure that enables GitLab scaling to support hundreds of thousands of concurrent users
Debug production issues across services and levels of the stack
Nice to have:
M.S.; Computer Science or Engineering field
10+ years; Contributing on a technical team
5+ years; providing technical guidance to morejunior team members
DoD 8570/8140 IASAE Level II at start
Who We Are:
Rackner is a software consultancy that builds cloud-native solutions for startups, enterprises, and the public sector.
We are an energetic, growing consultancy with a passion for solving big problems for both startups and enterprises.
Each of us enable digital transformation for large organizations through the newest in distributed technologies as we are laser focused on end-to-end application development, DevSecOps, AI/ML and systems architecture and our methodology focuses on cloud-first and cost-effective innovation.
Benefits/Additional Info:
Rackner embraces and promotes employee development and training and covers the cost of certifications relevant to a position and the technologies/services provided .
401K with 100% matching up to 6%
Highly competitive PTO
Great health insurance with large network of providers
Medical/Dental/Vision
Life Insurance, and short & long term disability
Industry-Leading Weekly Pay Schedule
Home office & equipment plan
#J-18808-Ljbffr