Jobs
>
Atlanta

    Site Reliability Engineer - Atlanta, United States - NCR Atleos

    Default job background
    Description
    About NCR Atleos

    NCR Atleos, headquartered in Atlanta, is a leader in expanding financial access. Our dedicated 20,000 employees optimize the branch, improve operational efficiency and maximize self-service availability for financial institutions and retailers across the globe.

    TITLE: Site Reliability Engineer

    LOCATION: Atlanta

    Job Role:

    We are looking for a Site Reliability Engineer (SRE), initially focused on production AppOps, who can build scalable systems, using best practices around automation, that improve reliability, velocity and enable monitoring of the operational health of stacks throughout their life-cycle including metrics collection, aggregation, and visualization.

    As a member of the SRE team you will support NCR's Financial Services business unit, product and technology teams to improve the design and operation of systems, focusing on making them scalable, reliable, and efficient while ensuring production performance and high availability of products/services primarily residing in the cloud. You will influence the development and implementation of reliable production systems and services to address emerging business needs (such as Cloud-based SaaS). SRE's pride themselves on the resiliency and stability of production systems, yet at the same time is committed to innovation and operational improvement through the application of software engineering practices to operations.

    The SRE will facilitate innovation and operational improvement through the application of software engineering practices to operations. You will make our products easier to adopt and use by making improvements to the product, tools, processes and documentation. You are someone who strives for six 9's or better in availability/uptime

    Job Description:
    • You will be responsible for maintaining and scaling production services and servers for complex and high throughput cloud services.
    • You will bridge and own the union between development, quality, security and operations
    • You will improve scalability, service reliability, capacity, and performance.
    • You will write automation code for provisioning and operating infrastructure at massive scale.
    • You are not just an operator, you're an experienced software engineer focused on operations.
    • You will initiate and contribute to continuous improvement of our software delivery processes and practices in a multi-location, multidisciplinary team to empower and accelerate product development
    • You will use automation extensively to design, configure, manage, and monitor systems in support of our product development teams
    • You will participate in disaster recovery planning and execution
    • You will be responsible for maintaining / patching servers supporting SaaS products. This includes Windows Servers, Linux Servers running in in-house Datacenters and/or using cloud PaaS providers (Azure)
    • You'll work hand-in-hand with all teams to ship our code to production using Continuous Integration / Continuous Deployment (CI/CD) and AppSec tooling.
    • You will collaborate with development teams and use intuition, experience and understanding to create SLIs, SLOs, and SLAs
    • You will provide timely assistance and remediation solutions during critical situations and production incidents to help resolve service problems (You will be on call for periods of time)
    • You will develop monitoring architecture, implementing monitoring agents, build dashboards, manage escalations and alerts
    • You will participate in incident management and driving root cause analysis (RCA) and risk management processes
    • You will participate in a rotating on-call schedule during off-hours where you may periodically need to remote-in to systems if a production outage occurs.
    IDEAL TECHNICAL AND PROFESSIONAL SKILLS:
    • BS degree in Computer Science or related technical field or 5 years prior relevant experience.
    • Extensive experience in a DevOps / SRE role with demonstrable experience in deploying and managing large scale production environments in GCP, AWS or Azure and Multi Datacenter environment.
    • Experience developing and debugging code (i.e. one or more of the following: Java, C, C++, .NET, Python, Ruby, Go, Shell, Perl, JavaScript)
    • 2+ years deploying and supporting high traffic, scalable web applications/services
    • 2+ years with cloud virtualization and PaaS
    • 2+ years with AWS/GCP/Azure
    • 2+ years with Docker, Kubernetes and early versions of OpenShift
    • Experience with Linux, Shell Scripting, PKI TLS/SSL, Network, firewalls, load balancers and backup
    • Experience in designing, analyzing and running large-scale distributed systems
    • Experience hosting and solving problems with public-facing services securely in Azure, AWS or GCP
    • Experience with orchestration, automation, and configuration management tools like git, Fabric and Ansible (or Puppet, Chef, Terraform, Helm or related technology)
    • Excellent analysis, debugging, root-cause identification, and troubleshooting skills
    • Experience with Kubernetes, system virtualization, on-prem and/or hybrid cloud computing, cloud Identity and security system, cloud monitoring and logging, and/or local/cloud storage.
    • Experience with one or more CI tools Jenkins, Artifactory, Harness, CloudBuild
    • Experience with application disaster recovery, migration, roll-back plans, expansion, routine deployments, and system upgrades
    • Experience with log management, including aggregation, alerting, and graphing (i.e Sensu/StackDriver/Prometheus/ELK/TICK stacks)
    • Bonus points for experience with Cassandra, Elasticsearch or Kafka
    • Extra bonus points for Cloud certifications and exposure to Harness
    About NCR Corporation:

    NCR is the global leader of self-service interactions, we are at the forefront of turning everyday transactions into exceptional experiences and making every day easier. NCR's footprint extends over a wide spectrum of areas: from point of sale terminals to ATMs, from financial and retail management systems to global payment systems. The industry is changing at an incredible rate with the arrival of new and disruptive technologies and startups, and we want you to be a part of it. This is an exciting time to get involved in the new products and solutions that NCR is developing for this rapidly changing world, applying the latest technologies and development practices.

    EEO Statement
    Integrated into our shared values is NCR's commitment to diversity. NCR is committed to being a globally inclusive company where all people are treated fairly, recognized for their individuality, promoted based on performance and encouraged to strive to reach their full potential. We believe in understanding and respecting differences among all people. NCR does not discriminate in employment based on sex, age, race, color, creed, religion, national origin, disability, sexual orientation, veteran status, military service, genetic information, or any other characteristic or conduct protected by law. Every individual at NCR has an ongoing responsibility to respect and support a globally diverse environment.

    Statement to Third Party Agencies
    To ALL recruitment agencies: NCR only accepts resumes from agencies on the NCR preferred supplier list. Please do not forward resumes to our applicant tracking system, NCR employees, or any NCR facility. NCR is not responsible for any fees or charges associated with unsolicited resumes.

    Offers of employment are conditional upon passage of screening criteria applicable to the job.

    Full time employee benefits include:
    • Medical Insurance
    • Dental Insurance
    • Life Insurance
    • Vision Insurance
    • Short/Long Term Disability
    • Paid Vacation
    • 401k
    EEO Statement
    NCR Atleos is an equal-opportunity employer. It is NCR Atleos policy to hire, train, promote, and pay associates based on their job-related qualifications, ability, and performance, without regard to race, color, creed, religion, national origin, citizenship status, sex, sexual orientation, gender identity/expression, pregnancy, marital status, age, mental or physical disability, genetic information, medical condition, military or veteran status, or any other factor protected by law.

    Statement to Third Party Agencies

    To ALL recruitment agencies: NCR Atleos only accepts resumes from agencies on the NCR Atleos preferred supplier list. Please do not forward resumes to our applicant tracking system, NCR Atleos employees, or any NCR Atleos facility. NCR Atleos is not responsible for any fees or charges associated with unsolicited resumes.

  • The Select Group

    Reliability Engineer

    2 weeks ago


    The Select Group Atlanta, United States

    SYSTEMS RELIBILITY ENGINEER · The Select Group is currently hiring for a Systems Reliability Engineer to join as a resource for one of our clients within the Telecommunicaitons Industry that will be hybrid sitting in Atlanta, GA. This Engineer will be responsible for assessing t ...


  • The Select Group Atlanta, United States

    SYSTEMS RELIBILITY ENGINEER · You can get further details about the nature of this opening, and what is expected from applicants, by reading the below. · The Select Group is currently hiring for a Systems Reliability Engineer to join as a resource for one of our clients within ...


  • Austin Allen Inc Atlanta, United States

    Reliability Engineer – Manufacturing – South Carolina · Salary $80,0000 - $110,000 + Bonus + Fantastic Benefits + Paid Relocation to South Carolina · Actively recruiting talented Reliability Engineers for this growing manufacturing company with locations nationwide · You'll be ...

  • U.S. Bank National Association

    Reliability Engineer

    2 weeks ago


    U.S. Bank National Association Atlanta, United States

    Elavon (Elavon is a part of the U.S. Bank family) seeks a full-time Reliability Engineer in Atlanta, GA. The Reliability Engineer supports production applications and proactively looks for ways to automate discoveries, eliminate incidents from recurring and/or reduce the time it ...

  • STONE Resource Group

    Reliability Engineer

    2 weeks ago


    STONE Resource Group Atlanta, United States

    ***NOTE: This is role is unable to do C2C and our client is unable to sponsor at this time. This will also be a hybrid model in Atlanta, GA.*** · Overview · STONE Resource Group is partnered with a leading company in the HVAC Industry looking to expand their current team by add ...

  • JLL

    Reliability Engineer

    2 weeks ago


    JLL Atlanta, United States

    JLL supports the Whole You, personally and professionally. · Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology to our clients. We are committed to hiring the best, most talented people in our ind ...

  • Jones Lang LaSalle IP, Inc.

    Reliability Engineer

    2 weeks ago


    Jones Lang LaSalle IP, Inc. Atlanta, United States

    JLL supports the Whole You, personally and professionally. · Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology to our clients. We are committed to hiring the best, most talented people in our ind ...


  • Jones Lang LaSalle IP, Inc. Atlanta, United States

    JLL supports the Whole You, personally and professionally. · Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology to our clients. We are committed to hiring the best, most talented people in our ind ...

  • Southern Company

    Reliability Engineer

    3 weeks ago


    Southern Company Atlanta, United States

    Job Description · Reliability Engineer - Asset Management · Reliability Engineer - Asset Management Locations: State of Georgia. Not required to report to a specific Operating Headquarters location. · Reliability Engineering - Asset Management will primarily be responsible for ...


  • SUCCESS KOREA Atlanta, United States

    채용제목 · Reliability Engineer(과차장급) (지원마감) · 회사소개 · 화학회사 · 업무내용/자격요건 · 담당직무 · Chemical Engineering, Mechanical Engineering related (화학공학전공또는 기계공학전공, 기계과 관련전공) · Chemical Industry에서 공정관리 또는 Maintenance 관련 업무 담당 경력 · 단순한 정비업무보다는 각종 기계에 대한 관리 및 분석 능력이 있는 사람 · 정비팀 관련 업무를 수행했던 사람으 ...


  • PSC Biotech Atlanta, United States

    Job Description PSC Biotech provides the life sciences with essential services to ensure that health care products are developed, manufactured, and distributed to the highest standards, in compliance with all applicable regulatory requirements. Our goal is to skyrocket our client ...


  • Channel Personnel Services Atlanta, United States

    Job Description · Job DescriptionØ Identify and manage asset reliability risks that could adversely affect plant or business operations.Ø Develops and maintains plant standards (piping, tankage, insulation, etc.) that influence the selection of materials, equipment, and spare par ...


  • Channel Personnel Services Atlanta, United States

    Job Description · Job DescriptionØ Identify and manage asset reliability risks that could adversely affect plant or business operations.Ø Develops and maintains plant standards (piping, tankage, insulation, etc.) that influence the selection of materials, equipment, and spare par ...

  • STONE Resource Group

    Reliability Engineer

    2 weeks ago


    STONE Resource Group Sandy Springs, United States

    ***NOTE: This is role is unable to do C2C and our client is unable to sponsor at this time. This will also be a hybrid model in Atlanta, GA.*** · Overview · STONE Resource Group is partnered with a leading company in the HVAC Industry looking to expand their current team by addin ...


  • Datum Software Atlanta, United States

    Site Reliability Engineer · Long Term Contract · Atlanta, GA · Qualifications:Manage and optimize data streaming and API components in OpenShift On-premises and AWS. · Proactively review the application's APIs and processes to identify opportunities to optimize the response times ...


  • At Datum Tech Group Atlanta, United States

    Site Reliability Engineer · Long Term Contract · Hybrid Schedule · Atlanta, GA · Qualifications: · Minimum Experience: · • 3+ years of related DevOps, SysOps engineering experience with focus on major cloud platforms (AWS preferred). · • 2+ years of application development exp ...


  • Apex Systems Atlanta, United States

    We are seeking a proactive and dynamic Site Reliability Engineer to support our multiple engineering teams by providing robust infrastructure solutions. The ideal candidate will be responsible for patching, building pipelines, and must possess a strong understanding of Terraform. ...


  • Honeywell Atlanta, United States

    As a Site Reliability Engineer here at Honeywell, you will play a critical role in ensuring the reliability, availability, and performance of our systems and applications. You will work closely with cross-functional teams to identify and resolve issu Reliability Engineer, Liabili ...


  • Tech Providers Inc. Atlanta, United States

    Site Reliability Engineer · Atlanta GA (Hybrid) 06+ Months Contract to Hire · Skills: · Top 5 Must Haves: · Extensive/Strong AWS experience, experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---P ...


  • Diverse Lynx Atlanta, United States

    9406496 · Site Reliability Engineering (SRE) · Duration (Months): 8 · TCS - Atlanta, GA · Competencies: Digital : Site Reliability Engineering (SRE), Test Automation · Experience (Years):4-6 · Essential Skills: Site Reliability Engineering (SRE) · Desirable Skills: Site Rel ...