Jobs
>
Dallas

    Senior Site Reliability Engineer - Dallas, United States - Saxon Global

    Default job background
    Description

    Job Summary:

    We are looking for a Site Reliability Engineer (SRE) who will be responsible for ensuring the reliability, availability, and performance of our production systems.

    As an SRE, you will work closely with cross development and engineering teams to design and implement tools and processes to automate deployment, observability, and troubleshooting of our applications and infrastructure supporting the deployment of new Android tablets to the stores.

    This individual must be skilled and have professional experience with the core functions of Site Reliability Engineering including deployments, observability, monitoring, telemetry, and automation.

    Please be sure to call out your experience in these areas and how your technical experience matches the requirements below in your resume.



    Responsibilities:
    Ensure the reliability, availability, and performance of our production systems as we scale

    Develop and maintain monitoring and alerting systems to detect and respond to incidents in a timely manner

    There is no on-call rotation but occasionally support planned deployment roll outs that may require working off-hours during store closure

    Work with cross-functional teams to plan and execute scaling initiatives

    Develop and maintain documentation of processes, procedures, and technical configurations


    Requirements:
    Strong written and verbal communication skills with peers, technical leads, project managers and product owners

    Must be able to collaborate with customers and cross-functional teams to design, test and validate deliverable which meet or exceed expectations

    Self-starter and highly motivated individual that is well-organized

    Bachelor's degree in Computer Science or related field

    5+ years of experience as a Site Reliability Engineer

    Strong experience with automation tools and experience with automation scripting in Python

    Experience with containerization technologies such as Docker and Kubernetes

    Experience with cloud platforms such as Azure or AWS

    Experience with monitoring and logging tools such as Datadog, Prometheus, Grafana or Splunk

    Strong understanding of networking, security, and systems administration

    Excellent problem-solving skills and attention to detail

    Must be available to work core hours PST.


    Preferred qualifications:
    Experience with distributed systems and supporting a large retail business

    Experience with infrastructure as code tools such as Terraform or CloudFormation

    Experience with CI/CD tools such as Jenkins

    Experience with incident ticketing systems such as ServiceNow and Jira for tracking stories

    Familiarity with Agile/Scrum methodologies and DevOps principles

    If you are passionate about ensuring the reliability and availability of systems in our stores and enjoy collaborating with cross-functional teams to solve complex problems, we encourage you to apply for this exciting opportunity as an SRE.



  • Epsilon Grand Prairie, United States

    Sonova · Radolfzell am Bodensee, BW 78315 · posted 04/27/2024 · More... · front runner · Buyer/Planner Fertigungsdisponent:in (d/w/m) · Danaher · Bodman-Ludwigshafen, Baden-Württemberg 78351 · posted 04/27/2024 · More... · front runner · Manager: in Software Team (d/f/m) ...


  • Mass Staffing Projects Dallas, United States

    Job Description · Looking for something reliable? One of our top mining clients is looking for a skilled and experienced Reliability Engineer to join their team in Free State. · Requirements Include: · Senior Certificate · Degree in Mechanical / Electrical Engineering · GCC ...


  • Mass Staffing Projects Dallas, United States

    Job Description Looking for something reliable?One of our top mining clients is looking for a skilled and experienced Reliability Engineer to join their team in Free State.Requirements Include: · •Senior Certificate · •Degree in Mechanical / Electrical Engineering · •GCC Mines & ...


  • AllSTEM Connections Plano, United States

    SITE RELIABILTY ENGINEER · ON W2 · PLANO,TX/HOUSTON,TX/DELAWARE · HYBRID REPORTING: 3DAYS ONSITE · SKILLSET NEEDED: · AWS · BIG DATA · SPARK · PYTHON · SCRIPTING · SHELL · PERL · CONTROL-M · AUTOSYS · GRAFANA · ...


  • WestRock Dallas, United States

    The Electrical Reliability Engineer is considered the local expert on the equipment within their area of assignment and must either have demonstrated capability unique to the equipment or be capable of quickly assimilating new information to become e Reliability Engineer, Electri ...


  • Global Mobility Services Dallas, United States

    ** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliability En ...


  • Highbrow Dallas, United States

    System Reliability Engineer (SRE) 2 · —> 5 to 7 years experience · Location :- Kansas City, Mi or Atlanta, GA or Dallas, Texas · Job Description: · We are seeking an experienced System Reliability Engineer (SRE) 2 to join our team. The ideal candidate will have 5 to 7 years of ...


  • Global Mobility Services Dallas, United States

    ** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliabilit ...


  • Avetta Dallas, United States

    Join Avetta as a Site Reliability Engineer · Site Reliability Engineers are pioneers of the production systems, we believe in proactive discovery and analysis of our entire stack, continually optimizing, tuning, and scaling the system for maximal end-user experience on a globally ...


  • Tecktiva Dallas, United States

    Job Title: Site Reliability Engineer · Location: Phoenix, AZ / dallas, TX · Duration: 6 Months+ · Responsibilities: · Expert in Observability & SRE principles, SLI, SLO and SLA definition and management · Experienced in Grafana stack and other Application performance managemen ...


  • Global Mobility Services Dallas, United States

    ** We are not looking for C2C profiles at the moment** · Site Reliability Engineer · 7 months contract to start (potential for CTH) · Looking for hybrid resources local to Denver or Dallas. · Summary · Our client is actively seeking a skilled and experienced Site Reliability Eng ...


  • Goldman Sachs Dallas, United States

    MORE ABOUT THIS JOB: · Who we are: · At Goldman Sachs, our culture is one of teamwork, innovation and meritocracy. We often say our people are our greatest asset and we take pride in supporting each colleague both professionally and personally. From collaborative work spaces an ...


  • Avature Dallas, United States

    WestRock (NYSE :WRK) is a global leader in sustainable paper and packaging solutions. We are materials scientists, packaging designers, mechanical engineers and manufacturing experts with a shared purpose: Innovate Boldly. Package Sustainably. Guided by our values of integrity, r ...


  • Avature Dallas, United States

    WestRock – · Electrical Reliability Engineer, Dallas, TX · WestRock is a company where each of us genuinely belongs, is respected and valued, can do our best work, and where diversity, inclusion and equity are competitive advantages · The Opportunity: · The Electrical Reliabilit ...


  • Diverse Lynx Dallas, United States

    Job Title: Site Reliability Engineer · Location: Dallas, TX//Onsite · Duration: Full Time-Only · JOB DESCRIPTION: · Responsible for ensuring the reliability of systems, minimizing downtime, and maintaining service-level objectives (SLOs). · •Developing, automation and imple ...


  • PMG, Inc. Dallas, United States

    PMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the center of everything we do to del ...


  • Avetta Dallas, United States

    Join Avetta as a Site Reliability Engineer · Site Reliability Engineers are pioneers of the production systems, we believe in proactive discovery and analysis of our entire stack, continually optimizing, tuning, and scaling the system for maximal end-user experience on a globall ...


  • Suncaptech Dallas, United States

    Role: Site Reliability Engineer · Location: Dallas, TX (Onsite) · 2 Positions available · Implement SRE practices · Identify, craft, and maintain SLIs and SLOs for teams, as well as metrics such as MTTR, Lead time for change, Deployment Frequency and Change Failure Rate · Work ...


  • Avetta Dallas, United States

    Join Avetta as a Site Reliability Engineer · Site Reliability Engineers are pioneers of the production systems, we believe in proactive discovery and analysis of our entire stack, continually optimizing, tuning, and scaling the system for maximal end-user experience on a globally ...


  • Juniper Networks Dallas, United States

    Job Description · Site Reliability Engineer 4 · Location: Anywhere in the United States · Juniper is changing what's possible in networking. We're going beyond building the networks customers expect - we're building the networks customers deserve. And the world is taking note. B ...