Site Reliability Engineer - Emory, United States - Morgan McKinley

    Morgan McKinley
    Morgan McKinley Emory, United States

    2 weeks ago

    Default job background
    Description

    Responsibilities:

    Create and implement solutions to automate technical operations of large systems, collaborating with teams to enhance stability throughout the software development process.

    Lead efforts to improve the stability of payment systems, including monitoring, log management, and creating diagnostic tools. Conduct regular drills and develop plans for quick service restoration during incidents, participating in on-call rotations as needed. Define metrics to evaluate system performance and runtime, improving observability. Plan system capacities to accommodate business growth and promotions. Analyze production incidents to establish best practices for a highly available payment architecture.


    Requirements:
    At least 3 years relevant work experience from a large-scale systems. Bachelor's degree in Computer Science-related technical discipline, or equivalent practical experience. Solid knowledge of Computer Science, and familiar with the principles of Operating System, Computer Storage, Computer Networking etc. Software development experience in at least one programming language, such as Java/Go/C++/C#/Python. Familiarity with IaC, CI/CD and Observability concepts & tools would be an added advantage. Experiences of Redis, rocketMQ, MySQL, Nginx, Kubernetes, Docker, etc. are a plus.

    By sending us your personal data and curriculum vitae (CV), you are deemed to consent to Morgan McKinley Pte Ltd and its affiliates to collect, use and disclose your personal data for the purposes set out in the Privacy Policy available at//morganmckinley/sg/privacy-policy .

    You acknowledge that you have read, understood, and agree with the Privacy Policy.
    Morgan McKinley Pte LtdHo Ji Keat Joey (HE ZIJIE)

    EA Licence No: 11C5502EA Registration No. R1983255Job ID JN #J-18808-Ljbffr