Jobs
>
Atlanta

    Site Reliability Engineer - Atlanta, United States - Nlb Services

    Default job background
    Description


    As a engineer with Retail, Site Reliability Engineering team, you will be at the forefront of Cloud and Big Data technology.

    In this role you will establish yourself as a technical leader by exposing yourself to a broad range of industry leading technologies that will help to drive acceleration.

    The ideal candidate will have expert design and development capabilities and be positioned to contribute to a growing set of services and features for the ecosystem.

    This role will be supporting highly available, business critical applications.

    This role will serve as the escalation point for complex and hard to define issues in both on premise and AWS environments.

    We are seeking talented engineers, well versed in DevOps technologies, automation, infrastructure orchestration, configuration management, continuous integration, troubleshooting of complex issues, who are not constrained by how things are usually done .


    Quals
    This position is 60 % SRE and 40% SDE. Also open for candidates to join MTH along with ATL GO team. Must agree to work onsite at either of these two locations based on the team s hybrid schedule.

    Required Skillset
    Manage and optimize data streaming and API components in OpenShift Onpremise and AWS.

    Proactively review the application s APIs and processes to identify opportunities to optimize the response times for various application components.

    Automate various types of testing including data quality checks, automate delivery to system integration, production and automate deployment for production
    Develop integrations between the application in Onpremise and AWS and our third-party tools (ServiceNow, VersionOne, Sumo)
    Work with teams to create SLI/SLO s .

    Actively monitor and lead troubleshooting of degraded performance and hard to define issues for the platform applications, develop the solution and document artifacts in the back log from root cause analysis.

    Evolve the cloud infrastructure ecosystem for our application suite by experimenting with emerging technologies and completing prototypes to understand benefit

    Design and develop CI/CD pipeline in AWS and Openshift to deploy various application artifacts, including APIs and Data Process Jobs.

    Analyze, design and develop the artifacts to configure the monitoring and alerting metrics so the support engineers can proactively and timely validate, troubleshoot and resolve the issues.

    Maintain data integrity and access control by using AWS security tools and services such as HSM, IAM, etc.

    Understand and develop tools to monitor AWS billing for the services, generate cost related reports and help develop and implement cost optimization strategies.

    Work with enterprise security architects to design and implement data security tools, measures, data encryption, key management; design and develop solutions to address the security vulnerabilities discovered by internal security audit team, as well as by the vendors, security community, etc.

    ; design and develop solutions for support team to regularly scan and review to fix security issues

    Regularly and proactively monitor and analyze the capacity and performance of the platform, work with architecture team to design and implement elastic infrastructure to accommodate the irregular burst of user traffic/requests.

    Work with architecture team to develop backup strategy and implement the backup solution for critical data and application components for service restoration and disaster recovery purpose.

    Work with architecture, infrastructure, and application teams to provide input on continuous improvement on the design, performance and security enhancements


    Desired Skillset:
    Deep understanding of the operations of AWS cloud platforms.
    Must be well versed in the automation, scripting, monitoring, including use of tools from the major cloud platforms, including but not limited to OpenShift Cloud Formation, Terraform, Ansible, Shell, Python

    Preferable for candidates with significant technical knowledge with infrastructure layers, including but not limited to: Linux OS, major virtualization platforms, Traditional and software defined network, Load Balancers, firewall, API tools, element/performance/intelligent monitoring tools, storage, backup strategy, etc.

    Significant knowledge and experience in end-to-end operations for enterprise systems and applications, including driving issue resolution for mission critical systems.

    Must have experience working to automate, operationalize and improve the Development/QA using CI/CD tools (Gitlab, Github, Jenkins, Maven, Gradle, Nexus)
    Working experience with Software Release Management. Desired Qualification
    BS degree in Computer Science or a related technical field or equivalent practical experience. Minimum Experience
    3+ years of related DevOps, SysOps engineering experience with focus on major cloud platforms (AWS preferred).
    2+ years of application development experience including data streaming, deploying/monitoring high availability critical application components.
    1+ Years in Site Reliability Engineering organization preferred
    Overall 4-6years of experience.

    All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

    NLB is committed to providing access, equal opportunity and reasonable accommodation for individuals with disabilities in employment, its services, programs, and activities.

    To request reasonable accommodation, contact HR department by sending an e-mail to

  • The Select Group

    Reliability Engineer

    3 weeks ago


    The Select Group Atlanta, United States

    SYSTEMS RELIBILITY ENGINEER · The Select Group is currently hiring for a Systems Reliability Engineer to join as a resource for one of our clients within the Telecommunicaitons Industry that will be hybrid sitting in Atlanta, GA. This Engineer will be responsible for assessing t ...


  • The Select Group Atlanta, United States

    SYSTEMS RELIBILITY ENGINEER · The Select Group is currently hiring for a Systems Reliability Engineer to join as a resource for one of our clients within the Telecommunicaitons Industry that will be · hybrid sitting in Atlanta, GA . This Engineer will be responsible for assessin ...

  • The Select Group

    Reliability Engineer

    2 weeks ago


    The Select Group Atlanta, United States

    SYSTEMS RELIBILITY ENGINEER · You can get further details about the nature of this opening, and what is expected from applicants, by reading the below. · The Select Group is currently hiring for a Systems Reliability Engineer to join as a resource for one of our clients within ...

  • Jones Lang LaSalle IP, Inc.

    Reliability Engineer

    3 weeks ago


    Jones Lang LaSalle IP, Inc. Atlanta, United States

    JLL supports the Whole You, personally and professionally. · Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology to our clients. We are committed to hiring the best, most talented people in our ind ...


  • Austin Allen Inc Atlanta, United States

    Reliability Engineer – Manufacturing – South Carolina · Salary $80,0000 - $110,000 + Bonus + Fantastic Benefits + Paid Relocation to South Carolina · Actively recruiting talented Reliability Engineers for this growing manufacturing company with locations nationwide · You'll be ...

  • JLL

    Reliability Engineer

    3 weeks ago


    JLL Atlanta, United States

    JLL supports the Whole You, personally and professionally. · Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology to our clients. We are committed to hiring the best, most talented people in our ind ...

  • U.S. Bank National Association

    Reliability Engineer

    2 weeks ago


    U.S. Bank National Association Atlanta, United States

    Elavon (Elavon is a part of the U.S. Bank family) seeks a full-time Reliability Engineer in Atlanta, GA. The Reliability Engineer supports production applications and proactively looks for ways to automate discoveries, eliminate incidents from recurring and/or reduce the time it ...

  • Southern Company

    Reliability Engineer

    3 weeks ago


    Southern Company Atlanta, United States

    Job Description · Reliability Engineer - Asset Management · Reliability Engineer - Asset Management Locations: State of Georgia. Not required to report to a specific Operating Headquarters location. · Reliability Engineering - Asset Management will primarily be responsible for ...


  • Jones Lang LaSalle IP, Inc. Atlanta, United States

    JLL supports the Whole You, personally and professionally. · Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology to our clients. We are committed to hiring the best, most talented people in our ind ...

  • STONE Resource Group

    Reliability Engineer

    3 weeks ago


    STONE Resource Group Atlanta, United States

    ***NOTE: This is role is unable to do C2C and our client is unable to sponsor at this time. This will also be a hybrid model in Atlanta, GA.*** · Overview · STONE Resource Group is partnered with a leading company in the HVAC Industry looking to expand their current team by add ...


  • SUCCESS KOREA Atlanta, United States

    채용제목 · Reliability Engineer(과차장급) (지원마감) · 회사소개 · 화학회사 · 업무내용/자격요건 · 담당직무 · Chemical Engineering, Mechanical Engineering related (화학공학전공또는 기계공학전공, 기계과 관련전공) · Chemical Industry에서 공정관리 또는 Maintenance 관련 업무 담당 경력 · 단순한 정비업무보다는 각종 기계에 대한 관리 및 분석 능력이 있는 사람 · 정비팀 관련 업무를 수행했던 사람으 ...


  • Channel Personnel Services Atlanta, United States

    Job Description · Job DescriptionØ Identify and manage asset reliability risks that could adversely affect plant or business operations.Ø Develops and maintains plant standards (piping, tankage, insulation, etc.) that influence the selection of materials, equipment, and spare par ...

  • Channel Personnel Services

    Reliability Engineer

    2 weeks ago


    Channel Personnel Services Atlanta, United States

    Job Description · Job DescriptionØ Identify and manage asset reliability risks that could adversely affect plant or business operations.Ø Develops and maintains plant standards (piping, tankage, insulation, etc.) that influence the selection of materials, equipment, and spare par ...


  • PSC Biotech Atlanta, United States

    Job Description PSC Biotech provides the life sciences with essential services to ensure that health care products are developed, manufactured, and distributed to the highest standards, in compliance with all applicable regulatory requirements. Our goal is to skyrocket our client ...

  • STONE Resource Group

    Reliability Engineer

    2 weeks ago


    STONE Resource Group Sandy Springs, United States

    ***NOTE: This is role is unable to do C2C and our client is unable to sponsor at this time. This will also be a hybrid model in Atlanta, GA.*** · Overview · STONE Resource Group is partnered with a leading company in the HVAC Industry looking to expand their current team by addin ...


  • Diverse Lynx Atlanta, United States

    9406496 · Site Reliability Engineering (SRE) · Duration (Months): 8 · TCS - Atlanta, GA · Competencies: Digital : Site Reliability Engineering (SRE), Test Automation · Experience (Years):4-6 · Essential Skills: Site Reliability Engineering (SRE) · Desirable Skills: Site Rel ...


  • QuEST Global Services Pte. Ltd Atlanta, United States

    Quest Global is an organization at the forefront of innovation and one of the world's fastest growing engineering services firms with deep domain knowledge and recognized expertise in the top OEMs across seven industries. We are a twenty-five-year-old company on a journey to beco ...


  • Experis Atlanta, United States Contract

    Experis/ManpowerGroup has partnered with a leading Financial Services organization for a Site Reliability Engineer (SRE) to assist their team. This will be a Hybrid Work Model as per the details below. · Job Title: Site Reliability Engineer (SRE) · Location: Atlanta GA/Chandler A ...


  • The Coca-Cola Company Atlanta, United States

    Location(s): · United States of America · City/Cities: · Atlanta · Travel Required: · 00% - 25% · Relocation Provided: · Job Posting End Date: · May 3, 2024 · Shift: · First Shift (United States of America) · Job Description Summary: · We are seeking an experienced l ...


  • Al Nahiya Group Atlanta, United States

    Job Description Knowledge of risk and reliability management systems Experience in failure mode and effect analysis, with a solid understanding of failure mechanisms Experience in troubleshooting, analyzing and resolving engineering problems independently Effective communication ...