Jobs
>
New York City

    Site Reliability Engineer - New York, United States - Hale Recruiting

    Hale Recruiting
    Hale Recruiting New York, United States

    3 days ago

    Default job background
    Description
    Summary - Site Reliablity Engineer (For one of the Big 4 Sports &Entertainment League)

    Our client is enhancing the landscape of the live sports and entertainment
    industry. They are striving to deliver innovative, cutting-edge technologies to enable safe,
    unforgettable fan experiences across the globe. They are assembling a world-class technology team to build and support platforms and products that anticipate these emerging opportunities.

    The Data(base) Reliability Engineer will join the infrastructure team while also working
    alongside league team members and be responsible for the following areas:

    Uptime, High Availability and Disaster recovery planning
    Incident response
    Optimization of data stores
    Identify SLIs and define SLOs
    Observability tooling
    Debugging running systems and providing tools to assist runtime debugging
    Optimizations for cost control
    Ability to interface with all levels of employees
    Ability to work both independently with little supervision and in a team environment
    Ensures availability, security, integrity, and recovery of data, pipelines and data stores.
    Define and configure relevant database metrics to ensure observability
    Create and maintain dashboards and reports to visualize database performance and health
    Create monitoring and alerting to trigger on error conditions, degradation symptoms and defined
    SLOs, as well as outages
    Develops and implements data store maintenance plans, including performing integrity checks,
    Updating statistics and monitoring security and hardware resource utilization
    Work with peers to roll out changes to production environments and help mitigate and prevent
    Data-related production incidents
    Work on automation of data store infrastructure and help engineering succeed by providing
    self-service tools
    Resolves performance, capacity, replication, and other distributed data, pipeline and data store issues
    Support and debug data production issues across services and levels of the stack
    Provide timely incident response and participate in on-call rotations
    Continuously identify opportunities for process improvement and automation to enhance
    database performance, reliability, and efficiency
    Prioritize unblocking your teammates, collaboration and knowledge sharing


    Qualifications:


    To perform this job successfully, an individual must be able to perform the Duties and Responsibilities (Duties) above satisfactorily and meet the requirements below.

    The requirements listed below are representative of the minimum knowledge, skill, and/or ability required. Reasonable accommodations will be made to enable individuals with disabilities to perform the essential functions of the job.


    Education and/or Experience:

    Required:


    Minimum of a bachelor's degree in Computer Science, MIS or related degree and five (5) years of relevant experience including software or reliability engineering, database administration, datastore programming experience or combination of education, training and experience.

    Ability to communicate clearly and effectively strong opinions on how to use technologies such as cloud, microframeworks, DevOps, automation, and observability tools
    Demonstrable experience engineering automation of triggers, alerts, and remediation
    Have written code in a compiled language that runs in production somewhere
    Experience in Oracle 19c, Postgres, Mongo, Change Data Capture, data and data store monitoring, management and support
    Experience with OLTP, OLAP as well as PL/SQL code development and tuning
    Experience in Linux OS and shell scripting
    Extensive experience in performance tuning and analysis
    Strong ITIL principles are a plus
    Capacity planning for all aspects of a data store system (storage, compute, memory, etc.)
    Understanding of networking and connectivity and how it relates to a data store environment
    Excellent problem solving and troubleshooting skills
    Ability to work non-standard shifts including nights and/or weekend on-call responsibilities
    Dedicated to continuous improvement of yourself and our SRE/DBRE capabilities
    Key Technical Traits

    APIs and microservices:
    REST, Web, Graph
    Database Solutions – Oracle, MYSQL, MSSQL, CloudSQL, NoSQL

    Cloud Providers:
    Oracle Cloud Infrastructure, Google Cloud Platform, AWS
    Real-time log/event monitoring – DataDog, Stackdriver, Oracle Enterprise Manager, Oracle Cloud
    Monitoring, SolarWinds, Splunk, SumoLogic, OpenTelemetry

    Scripting:
    PL/SQL, Shell

    Secured Access and control – Okta SSO and MFA, MS Active Directory, DataSafe
    Software Development tools – Jira, GIT, Jenkins, ArgoCD, Terraform

    Compliance:
    PCI DSS, SSAE18/SOC 1

    #J-18808-Ljbffr


  • LHH New York, United States

    LHH Recruitment Solutions is · seeking a skilled Reliability Engineer to join our client's team and drive maintenance programs, analyze data, and implement solutions to improve equipment reliability and performance. The Reliability Engineer plays a crucial role in evaluating dat ...


  • Executive Alliance New York, United States

    Our client is seeking a · Reliability Engineer · to join our team. In this position, you will work in a dynamic people-focused environment where you will work directly with experienced engineers and scientists to assist with reliability analysis. You will also be responsible fo ...


  • GRN Hudson (Global Recruiters Network) New York, United States

    Reliability Engineer · Supporting manufacturing operations to achieve superior equipment efficiency with minimal down time. The ideal candidate will play a crucial role in ensuring the reliability and performance of our industrial equipment through continuous improvement initiati ...


  • Airswift New York, United States

    We are seeking a Reliability Engineer to work with one of our major clients, a global chemicals business. · This professional will provide expertise to improve reliability of equipment, maintainability of systems and develop processes to increase mean time between failures in or ...


  • STONE Resource Group New York, United States

    ***NOTE: This is role is unable to do C2C and our client is unable to sponsor at this time. This will also be a hybrid model in Atlanta, GA.*** · Overview · STONE Resource Group is partnered with a leading company in the HVAC Industry looking to expand their current team by addi ...


  • The Spear Group New York, United States

    The position requires technical expertise and leadership to implement and execute asset reliability strategies across operating site. Roles and responsibilities for this position are listed below. · Key Responsibilities: · Leads site team of Operations, Maintenance and Reliabil ...


  • Toyo Tires New York, United States

    "Toyo Tires: Where Teamwork Drives Success" · Toyo Tires of North America is seeking a highly skilled and experienced Reliability Engineer to join our Maintenance and Engineering department. Reporting directly to the Engineering Manager, the Reliability Engineer will play a cruci ...


  • InfiniTek Corporation New York, United States

    INFINITEK is currently hiring a Reliability Engineer for a customer location in Concord California. The Reliability Engineer will works on-site at a customer facility, directly coordinates, schedules and participates in work with the customer to comply with the service agreement. ...


  • CTS - Complete Technical Services New York, United States

    Job Summary · The Fixed Equipment Engineer for Corpus Christi location provides technical expertise and recommendations for fixed equipment mechanical integrity, repairs, and reliability improvements. It also provides fixed equipment technical support to refinery operations and ...

  • Walker Lovell

    Reliability Engineer

    22 hours ago


    Walker Lovell New York, United States

    Seeking a · Reliability Engineer · to play a vital role in ensuring the safety, efficiency, and reliability of our plant operations. You'll develop and implement maintenance strategies, conduct root cause analysis, and collaborate with cross-functional teams to enhance equipmen ...


  • Century Aluminum New York, United States

    Roles: · Provide mechanical, electrical, and instrumentation engineering support. Primary areas include developing and maintaining; reliability programs, Root Cause Problem Elimination (RCPE), preventive and predictive maintenance programs, troubleshooting and project sponsorship ...


  • Dupont Newark, United States OTHER

    En DuPont, trabajamos en cosas que importan, ya sea en proporcionar agua limpia a más de mil millones de personas en el planeta, producir materiales esenciales en los dispositivos tecnológicos cotidianos (desde smartphones hasta vehículos eléctricos) o proteger a los trabajadores ...


  • JetBlue Long Island City, United States

    Position Summary · The Engineer Reliability reports to the Manager Reliability and is responsible for aircraft, power plant and component trend/performance analysis in support of the Reliability portion of the Continuing Analysis Surveillance System (CASS). · Essential Responsi ...


  • DSJ Global New York, United States

    Title: · Reliability Engineer · Industry: · Pulp and Paper · Location: · Emporia, VA or Dudley, NC · DSJ Global is currently partnered with a Fortune 500 manufacturer who is looking to add a Reliability Engineer to their continuously growing team. The Reliability Engineer wil ...


  • DSJ Global New York, United States

    Title: · Reliability Engineer · Industry: · Pulp, Paper, and Specialty Fiber Manufacturing · Location: · Emporia, VA or Dudley, NC · DSJ Global is currently partnered with a Fortune 500 pulp, paper, and specialty fiber manufacturer who is looking to add a Reliability Engineer ...


  • Unreal Gigs New York, United States

    Job Description · Job DescriptionJob Summary · We are in search of a Site Reliability Engineer to join our tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you'll be pivotal in ensuring the reliability, availability, and per ...


  • ATR International New York, United States

    We are seeking a Reliability Development Engineer for a very important client. · Job Overview - Principal Duties and Responsibilities · Successful candidate will be tasked for Product, Package reliability test tracking; reliability database, data analysis and summarization on a ...


  • Oakland Search New York, United States

    Software Reliability Engineer - Quant Trading · Title: Software Reliability Engineer · Location: Manhattan, New York City (3 days in) · Comp: $200,000 - $350,000 basic salary + highly competitive performance bonuses · Industry: Finance, Trading, Hedge fund, Capital Markets · Tech ...


  • Hawthorne Executive Search New York, United States

    and Requirements · Our client delivers technology solutions that enable community banks to better serve the communities in which they do business. Their technology and development culture is built on the tenants of innovation, alignment, collaboration, execution, and FUN We hope ...


  • Unreal Gigs New York, United States

    Job Summary · We are in search of a Site Reliability Engineer to join our tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you'll be pivotal in ensuring the reliability, availability, and performance of our systems. Your ro ...