Jobs
>
Iselin

    Site Reliability Engineer - Iselin, United States - CLS Group.

    CLS Group.
    CLS Group. Iselin, United States

    1 week ago

    Default job background
    Description

    Job Purpose

    The role is primarily responsible for ensuring that SRE methodologies are applied to the cloud hosted environment. In addition, the role will act as a central point of expertise for SRE automation across the Platform Operations team.

    Essential Job Functions

    • Responsible for implementing SRE methodologies within the cloud hosted environment, including automation of TOIL and definition / implementation of SLOs and SLAs.
    • Establish SRE as a practice within the Cloud team and working closely with Infrastructure Engineering enhance observability and telemetry to ensure the Cloud hosted services have appropriate Service metrics and monitoring.
    • Build out GitOps practices for use in the cloud hosted environment using tooling such as Terraform and Ansible. Provide a liaison between Engineering and Cloud Operations to fully embed Infrastructure as Code for all new cloud hosted deployments.
    • Provide support escalation for Cloud and Automation related issues ensuring that Production stability is always the primary requirement.
    • Ensure risks and stability issues in the cloud hosted environment are understood and addressed where possible through SRE best practice as part of any incident postmortems.

    Minimum Education Required

    • Bachelors degree educated or equivalent
    • Industry standard IT certification desired e.g. AWS / Microsoft / VMware / Redhat Linux

    Minimum job-related Experience Required

    • Must have strong technical operational support experience within an infrastructure services team preferably supporting either on-premise compute or cloud hosted environments.
    • Strong understanding of automation technologies ideally including Terraform and Ansible and ability to use these tools to drive Infrastructure as Code through GitOps methodologies.
    • Knowledge of at least 1 scripting language, preferably either Python or Powershell.
    • Minimum of 2 years experience applying SRE methodologies within a support team and an understanding of Service Level metrics associated with this.
    • Experience of APM tools (e.g. Grafana / Datadog / Dynatrace).
    • Experience of working in a regulated financial services / banking organization.

    Special Skills/Knowledge

    • Able to understand and use at least one cloud hosted services including either AWS or Azure.
    • Possesses a strong service-orientated mindset, can consistently deliver a high level of service to the business.
    • Able to make and influence decisions with peers, stakeholders and management.
    • Able to communicate effectively with both business and technical staff at all levels. This includes communicating complex technical issues to different levels of management.
    • Able to work proactively, own complex deliveries and provide regular updates to management and stakeholders.

    Expected full-time salary range between $140,000 - $170,000 + variable compensation + 401(k) match + benefits.

    *Note: Disclosure as required by NY Pay Transparency Law of the expected salary compensation range for this role



  • Integrity Consulting New York, United States

    Uma das maiores empresas multinacionais do mundo, no seu Centro de Inovação Digital com o objetivo de cada vez mais se tornar uma empres digital com tecnologias de ponta e um time global.Como objetivo de serem sempre abertos a novos caminhos, transparentes no que fazem e receptiv ...


  • Salling Group A/S New York, United States

    Jesteśmy oddziałem największej duńskiej firmy z branży handlu detalicznego. Działamy jako centrum usług biznesowych, w którym obsługujemy procesy zachodzące w naszych europejskich spółkach, takich jak: Bilka, Fotex, Salling czy dobrze znanej w Polsce — sieci dyskontowej Netto. · ...


  • Sodexo RAHWAY, United States Full time

    Unit Description: Sodexo is currently seeking a Reliability Engineer to provide support for our Life Science Client located in Rahway, New Jersey. The Reliability Engineer will play a crucial role in managing and maintaining a substantial backlog of initiatives and strategic chan ...

  • Sodexo

    Reliability Engineer

    23 hours ago


    Sodexo RAHWAY, United States Full time

    Unit Description: Sodexo is currently seeking a Reliability Engineer to provide support for our Life Science Client located in Rahway, New Jersey. The Reliability Engineer will play a crucial role in managing and maintaining a substantial backlog of initiatives and strategic chan ...


  • CLS Group Iselin, United States

    Job Purpose · The role is primarily responsible for ensuring that SRE methodologies are applied to the cloud hosted environment. In addition, the role will act as a central point of expertise for SRE automation across the Platform Operations team. · Essential Job Functions · Resp ...


  • Lawrence Harvey Iselin, United States

    Lawrence Harvey is partnered with a specialty financial institution that plays a critical role in the foreign exchange market. Their global settlement infrastructure reduces systemic risk at large and is a trusted party at the center of the global ecosystem. · They're in the pro ...


  • Lawrence Harvey Iselin, United States

    Lawrence Harvey is partnered with a specialty financial institution that plays a critical role in the foreign exchange market. Their global settlement infrastructure reduces systemic risk at large and is a trusted party at the center of the global ecosystem. · Theyre in the proc ...


  • Mizuho Bank Iselin, United States Full time

    Join the Mizuho team as a Lead Site Reliability Engineer (SRE) · The successful candidate will bring the following: · • 10+ years of Software Engineering, and Architecture experience with at least 5+ years on SRE focused experience in Production Support, Application Support and D ...


  • CLS Group. Woodbridge, United States

    Job Purpose · The role is primarily responsible for ensuring that SRE methodologies are applied to the cloud hosted environment. In addition, the role will act as a central point of expertise for SRE automation across the Platform Operations team. · Essential Job Functions · Resp ...


  • MSD Malaysia Rahway, United States

    Senior Electrical Reliability Engineer page is loaded · Senior Electrical Reliability Engineer · Apply · remote type · Not Applicable · locations · USA - Pennsylvania - West Point · time type · Full time · posted on · Posted 3 Days Ago · job requisition id · R294317 ...


  • LyondellBasell Industries Edison, United States

    LyondellBasell · Basic Function · As a member of our dynamic team, you will provide the site engineering participation in the identification and resolution of mechanical related problems and define improvement opportunities that will positively impact quality, capacity and mecha ...


  • Lstechinc Plainfield, United States

    [vc_row el_id="sreEngineer"][vc_column][vc_column_text] · Essential Duties and Responsibilities: · Design, build, and maintain scalable and highly available infrastructure using automation and configuration management tools (e.g., Terraform, Ansible, Kubernetes). · Monitor syst ...


  • LyondellBasell Edison, United States

    LyondellBasell · Basic Function · As a member of our dynamic team, you will provide the site engineering participation in the identification and resolution of mechanical related problems and define improvement opportunities that will positively impact quality, capacity and mech ...


  • Merck & Company, Inc. Rahway, United States

    As a Site Reliability Engineer (SRE) Architecture Lead, you will drive the technology strategy, architecture, and operational excellence of critical tech domains. Your role encompasses establishing technology roadmaps, defining end-to-end architectur Reliability Engineer, IT, Lia ...


  • Insight Global Kenilworth, United States

    A large pharma company is seeking an SRE Engineer that really understands site reliability engineering on a technical level + understands all SLO's as well as why someone would want to do reliability. This person should be confident in teaching best practices and principles of SR ...


  • Airswift New York, United States

    We are seeking a Reliability Engineer to work with one of our major clients, a global chemicals business. · This professional will provide expertise to improve reliability of equipment, maintainability of systems and develop processes to increase mean time between failures in or ...


  • The Spear Group New York, United States

    The position requires technical expertise and leadership to implement and execute asset reliability strategies across operating site. Roles and responsibilities for this position are listed below. · Key Responsibilities: · Leads site team of Operations, Maintenance and Reliabil ...


  • CTS - Complete Technical Services New York, United States

    Job Summary · The Fixed Equipment Engineer for Corpus Christi location provides technical expertise and recommendations for fixed equipment mechanical integrity, repairs, and reliability improvements. It also provides fixed equipment technical support to refinery operations and ...


  • Walker Lovell New York, United States

    Seeking a · Reliability Engineer · to play a vital role in ensuring the safety, efficiency, and reliability of our plant operations. You'll develop and implement maintenance strategies, conduct root cause analysis, and collaborate with cross-functional teams to enhance equipmen ...

  • Executive Alliance

    Reliability Engineer

    2 weeks ago


    Executive Alliance Pine Brook, United States

    Brief Description: · Our client is seeking a Reliability Engineer to join our team. In this position, you will work in a dynamic people-focused environment where you will work directly with experienced engineers and scientists to assist with reliability analysis. You will also b ...