Staff Site Reliability Engineer - Atlanta, United States - Cision

    Default job background
    Description

    Cision Career Opportunity:
    Site Reliability Engineer (SRE)

    Cision is home to the brightest and most passionate individuals in the tech industry, and we invite you to be a part of our expanding team We are dedicated to nurturing our employees through training and professional growth, providing support every step of the way to help you achieve your career aspirations.

    Your success is our top priority.

    We are currently looking for an experienced Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems.

    The ideal candidate will possess extensive knowledge of Kubernetes, containerization technologies, cloud environments (such as AWS and/or GCP), and a solid programming background.

    If you are enthusiastic about automation, security, and maintaining best practices in reliability engineering, we want to hear from you.

    Essential Skills and Experience


    Experience:
    Typically, more than 5 years of experience as an SRE, DevOps, or Software Engineer.


    Kubernetes and Containerization:
    Expert knowledge of Kubernetes at scale and containerization technologies in general.


    Programming Proficiency:
    Strong programming skills in an object-oriented programming (OOP) language with proficiency in script writing.


    Cloud Expertise:
    Expertise in a cloud environment, particularly AWS and/or GCP.


    Security:
    Expert and up-to-date knowledge of security best practices.


    SRE Principles:


    Familiarity with SRE principles, including SLI (Service Level Indicators), SLO (Service Level Objectives), SLA (Service Level Agreements), Toil, Uptime, and Observability.


    Networking:
    Solid understanding of networking fundamentals.


    Standardization and Documentation:
    Commitment to standardization and documentation.


    Linux:
    Experience with Linux.

    Desirable Skills

    Automation and CI/CD:
    Proficiency in software delivery automation, CI/CD (Continuous Integration/Continuous Deployment), and SDLC (Software Development Life Cycle).

    Metrics/Monitoring:
    Experience with metrics, monitoring, and alerting using tools such as Prometheus, Grafana, or similar.

    Infrastructure as Code (IaC): Familiarity with IaC tools, including Ansible and Terraform.

    Additional Technologies:
    Experience with Elasticsearch, Kafka, MySQL, Postgres, Windows, and Redis.

    Key Responsibilities

    Collaborate with cross-functional teams to enhance system reliability, performance, and scalability.

    Implement and maintain automation processes to streamline operations.

    Drive security initiatives and adhere to best practices in cybersecurity.

    Monitor and analyze system performance, identifying areas for improvement.

    Contribute to the development and maintenance of IaC scripts.

    Participate in on-call rotation and incident response activities.

    Education and Certifications
    Bachelor's degree in computer science, Information Technology, or a related field. Relevant certifications in cloud platforms, Kubernetes, or related technologies are a plus.

    Personal Attributes

    Strong problem-solving skills and attention to detail.

    Excellent communication and collaboration skills.

    Ability to thrive in a fast-paced and dynamic environment.

    Continuous learning mindset with a passion for staying abreast of industry trends.

    #J-18808-Ljbffr