Jobs
>
Oklahoma City

    Lead Site Reliability Engineer - Oklahoma City, United States - BJ's Wholesale Club

    BJ's Wholesale Club background
    Description
    Lead Site Reliability Engineer page is loaded

    Lead Site Reliability Engineer

    Apply

    locations

    BJ's Club Support Center Marlborough, MA #5997

    time type

    Full time

    posted on

    Posted 2 Days Ago

    job requisition id

    R147855

    Join our team of more than 34,000 team members, supporting our members and communities in our Club Support Center,

    235+

    clubs and eight distribution centers.

    BJ's Wholesale Club offers a collaborative and inclusive environment where all team members can learn, grow and be their authentic selves.

    Together, we're committed to providing outstanding service and convenience to our members, helping them save on the products and services they need for their families and homes.

    The Benefits of working at BJ's


    BJ's pays weekly


    Generous time off programs to support busy lifestyles*o Vacation, Personal, Holiday, Sick, Bereavement Leave, Jury Duty


    Benefit plans for your changing needs*o Three medical plans**, Health Reimbursement Account (HRA), Health Savings Account (HSA), two dental plans, flexible spending
    *eligibility requirements vary by position
    **medical plans vary by location

    As a Lead Site Reliability Engineer, you will be responsible for designing, building, monitoring, and continuously improving our ecommerce platform's infrastructure and processes.

    Leveraging your expertise in observability tools such as New Relic, Scalyr/Splunk, bash scripts, and Python scripts, you will play a pivotal role in ensuring the reliability and performance of our Java microservices-based architecture.


    Key Responsibilities :


    Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles.

    Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs).

    Proactively identify and address potential issues before they impact operations, utilizing observability tools like New Relic, Scalyr/Splunk, bash scripts, and Python scripts.

    Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices.

    Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence.

    Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions.

    Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices.

    Design and maintain robust production monitoring systems to ensure timely detection and resolution of issues, following SRE guidelines for effective monitoring and alerting.

    Utilize a diverse array of tools to troubleshoot performance and stability issues effectively, employing SRE methodologies to identify and mitigate bottlenecks.

    Evaluate and enhance application and environment security measures, integrating SRE-driven security practices into the development and deployment pipelines.
    Provide support for globally distributed, multi-cloud (public and/or private) environments, implementing SRE strategies for resilience and fault tolerance.

    Automate repetitive tasks at scale to streamline operational workflows and enhance efficiency, focusing on the implementation of SRE-driven automation solutions.

    Adhere to change management processes during implementations and utilize version control for application infrastructure, following SRE principles for reliable and auditable change management.

    Foster a SRE mindset throughout the organization, promoting collaboration and shared responsibility for reliability and performance

    Qualifications :
    Bachelor's Degree in Computer Science or related field, or foreign equivalent.
    Demonstrated curiosity and self-drive to tackle complex challenges and drive change in a diverse organizational landscape.
    Excellent written and verbal communication skills, with the ability to effectively communicate with engineering management, developers, and leadership.
    Proven ability to adapt to new technologies and learn quickly.
    Minimum of 5 years of experience in Site Reliability Engineering (SRE) or related roles.

    Job Conditions :
    Collaborate within a diverse and global team environment.
    Participate in cross-training with other team members across different regions.
    Rotate in an on-call schedule as required to ensure 24/7 availability and support for critical systems.

    In accordance with the Pay Transparency requirements, the following represents a good faith estimate of the compensation range for this position.

    At BJ's Wholesale Club, we carefully consider a wide range of non-discriminatory factors when determining salary. Actual salaries will vary depending on factors including but not limited to location, education, experience, and qualifications. The pay range for this position is starting from $109,000.00.
    About Us

    At BJ's Wholesale Club, we're focused on delivering unbeatable value and outstanding service to our members and communities.

    Headquartered in Marlborough, Massachusetts, BJ's Wholesale Club is a leading operator of membership warehouse clubs in the Eastern United States.

    Currently operating

    more than 235

    clubs,

    over 165


    BJ's Gas locations and eight distribution centers, we were the first retailer to introduce the warehouse club concept in the northeastern United States.

    Providing a curated assortment of grocery, general merchandise, gasoline and ancillary services, BJ's offers a differentiated shopping experience that is further enhanced by our omnichannel capabilities.

    #J-18808-Ljbffr


  • The Dahill Group Oklahoma City, United States

    Job Description · Job Description · RELIABILITY ENGINEER · Our client sets the standard for superior service. They truly believe people are essential to their ability to innovate, improve, and lead.They provide a company culture where all employees feel valued and supported. F ...


  • The Dahill Group Oklahoma City, United States

    Job Description · Job DescriptionRELIABILITY ENGINEER · Our client sets the standard for superior service. They truly believe people are essential to their ability to innovate, improve, and lead.They provide a company culture where all employees feel valued and supported. From co ...


  • Paycom Payroll Llc Oklahoma City, United States Full time

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionalit ...


  • Ajsearch Oklahoma City, United States

    Safety & Reliability Engineer · About Us · Our client is a transatlantic cutting-edge aerospace company developing solar powered aircraft solutions capable of achieving perpetual flight with heavy, and powerful payload capacity. Utilizing technology based upon the longest conti ...


  • Shopmonkey Oklahoma City, United States

    As a Site Reliability Engineer (SRE) at Shopmonkey, you'll be instrumental in ensuring the reliability, scalability, and performance of our systems and services for both internal and external stakeholders. We're seeking a seasoned professional who demonstrates mastery in computer ...


  • VitalAire Canada Inc. Oklahoma City, United States

    Senior Reliability Engineer page is loaded · Senior Reliability Engineer · Apply · locations · Bayport, TX · time type · Full time · posted on · Posted 2 Days Ago · job requisition id · R · Air Liquide Large Industries provides our customers with industrial gas and ene ...


  • TONI GROUP LLC Oklahoma City, United States

    Our client is a transatlantic cutting-edge aerospace company developing solar powered aircraft solutions capable of achieving perpetual flight with heavy, and powerful payload capacity. Utilizing technology based upon the longest continuous renewably powered flight program in his ...


  • International Staff Consulting Oklahoma City, United States

    Safety & Reliability Engineer · About Us · Our client is a transatlantic cutting-edge aerospace company developing solar powered aircraft solutions capable of achieving perpetual flight with heavy, and powerful payload capacity. Utilizing technology based upon the longest contin ...


  • Cameron Craig Group Oklahoma City, United States

    Our client is a transatlantic cutting-edge aerospace company developing solar powered aircraft solutions capable of achieving perpetual flight with heavy, and powerful payload capacity. Utilizing technology based upon the longest continuous renewably powered flight program in his ...


  • Shell Oklahoma City, United States

    Make a difference · Do you like helping people? Do you like to make people smile? Are you passionate about putting people in a better place? Then you may have what it takes to join our amazing team? · Where you fit in · The Reliability Engineer (RE) provides leadership within ...


  • Corps Partners Oklahoma City, United States

    Our client is a transatlantic cutting-edge aerospace company developing solar powered aircraft solutions capable of achieving perpetual flight with heavy, and powerful payload capacity. Utilizing technology based upon the longest continuous renewably powered flight program in his ...


  • PAYCOM PAYROLL LLC Oklahoma City, United States

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionalit ...


  • PAYCOM PAYROLL LLC Oklahoma City, United States

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionalit ...


  • PAYCOM PAYROLL LLC Oklahoma City, United States

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionalit ...


  • Bridgeway Professionals Inc Oklahoma City, United States

    Our client is a transatlantic cutting-edge aerospace company developing solar powered aircraft solutions capable of achieving perpetual flight with heavy, and powerful payload capacity. Utilizing technology based upon the longest continuous renewably powered flight program in his ...


  • PAYCOM PAYROLL LLC Oklahoma City, United States

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionalit ...


  • PAYCOM PAYROLL LLC Oklahoma City, United States

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionalit ...


  • Paycom Oklahoma City, United States

    Job Details · Level · Experienced · Job Location · Oklahoma City Office - Oklahoma City, OK · Position Type · Full Time · Education Level · Bachelor's Degree · Travel Percentage · None · Job Category · Development · Description · Site reliability engineers will be de ...


  • Saint-Gobain Group Oklahoma City, United States

    Senior Reliability Engineer II - Solar Solutions · Position Description · Job Summary · CertainTeed is one of the leaders in the development of building integrated photovoltaics (BIPV). Developing robust and safe solar products, that are cost-competitive for homeowners, is key ...


  • Saint-Gobain Oklahoma City, United States

    Senior Reliability Engineer II - Solar Solutions · CertainTeed, a subsidiarity of Saint-Gobain, develops innovative solar roofing that combines aesthetics, high energy efficiency, and durability. Our R&D team designs and validates electrical and mechanical assemblies which toget ...