Jobs
>
Baltimore

    Senior Kafka Site Reliability Engineer - Baltimore, United States - Leidos

    Leidos background
    Description

    Description

    Leidos is seeking a Senior Kafka Site Reliability Engineer to be part of the mission solution and help lead SSA's Digital Modernization Strategy. Join one of our high performing teams responsible for building the next-generation enterprise APIs and modern applications using data streaming and event driven architecture to modernize the speed at which business is done. We serve the Social Security Administration (SSA) and their mission to meet the changing needs of the public, positively impacting at least 65 million American lives per month. We are a team of forward-looking professionals in need of a strong candidate with these key required skills:Kafka Architecture, Ansible Automation, RHEL/Linux Administration, Scripting (Bash, Shell, Python), Availability Monitoring / Triage (Splunk, Dynatrace, Prometheus).

    If this sounds like a mission you want to be a part of, keep reading

    TEAM CULTURE

    Your passion and values might be a good fit for our teams if you answer "yes" to the following questions:

    • Are you looking for a company that puts employees first, with a focus on career, flexibility, and well-being?
    • Do you enjoy collaborating with colleagues and teammates and believe that the best ideas are fostered in an inclusive environment?
    • Are you searching for a team with a strong sense of ownership, urgency, and drive for daily mission success?
    • Are you comfortable with proactive outward communication and technical leadership?
    • Do you enjoy being a catalyst, solving complex problems, and providing innovative solutions?
    • Do you have the flexibility, creativity, and resilience to pivot the mission for success?
    • Do you have the courage to make tough ethical decisions with pride, transparency, and respect?

    MENTORSHIP & CAREER GROWTH

    Our teams are dedicated to supporting new team members in an environment that celebrates knowledge sharing and mentorship. Experienced team members will be assigned to new hires for one-on-one mentoring, collaborative reviews, and coaching on customer engagement to help each new hire successfully onboard and demonstrate their skills. Projects and tasks are assigned in a way that leverages your strengths and will help you further develop your skillset.

    DAY TO DAY RESPONSIBILITIES

    Every position we take is more rewarding when you know the why behind it.Know your work makes a difference to support those who need it most. If your passion is enabling life changing service to those around, you this is the place for you. Find you passion in a team environment where all members are valued regardless of contractor or employee status. Find your "Why" with us and take your place in our Leidos Family

    • Architect, design, develop, and implement next-generation data streaming and event-based architecture / platform using software engineering best practices in the latest technologies:
      • Data Streaming, Event Driven Architecture, Event Processing Frameworks
      • DevOps (Jenkins, Red Hat OpenShift, Docker, SonarQube)
      • Infrastructure-as-Code and Configuration-as-Code (Ansible, Terraform / CloudFormation, Scripting)
    • Administer Kafka including automating, installing, migrating, upgrading, deploying, troubleshooting, and configuring on Linux.
    • Provide expertise in one or more of these areas: Kafka administration, event-driven architecture, automation, application integration, monitoring and alerting, security, business process management/business rules processing, CI/CD pipeline and containerization, or data ingestion/data modeling.
    • Investigate, repair, and actively ensure business continuity regardless of impacted component: Kafka Platform, business logic, middleware, networking, CI/CD pipeline, or database (PL/SQL and Data Modeling).
    • Brief management, customer, team, or vendors using written or oral skills at appropriate technical level for audience
    • All other duties as assigned or directed

    FOUNDATION FOR SUCCESS(Basic Qualifications)

    • Bachelor's Degree in Computer Science, Mathematics, Engineering or a related field.Experience may be substituted in lieu of degree.
    • Master's or Doctorate degree may substitute for required experience
    • 8+ years of combined experience with Site Reliability Engineering, providing DevOps support, and/or RHEL administration for mission-critical platforms, ideally Kafka.
    • 4+ years of combined experience with Kafka (Confluent Kafka, Apache Kafka, Amazon MSK)
    • 4+ years of experience with Ansible automation
    • Must be able to obtain and maintain a Public Trust. Contract requirement.

    *** Selected candidate must reside within two (2) hours of SSA Headquarters in Woodlawn, MD

    *** Selected candidate must be willing to work on-site at least 2 days a week.

    FACTORS TO HELP YOU SHINE(Required Skills)

    These skills will help you succeed in this position:

    • Strong experience with Ansible Automation and authoring playbooks and roles for installing, maintaining, or upgrading platforms
    • Solid experience using version control software such as Git/Bitbucket including peer reviewing Ansible playbooks
    • Hands-on experience administrating Kafka platform (Confluent Kafka, Apache Kafka, Amazon MSK) via Ansible playbooks or other automation.
    • Understanding of Kafka architecture, including partition strategy, replication, transactions, tiered storage, and disaster recovery strategies.
    • Strong experience in automating tasks with scripting languages like Bash, Shell, or Python
    • Solid foundation of Red Hat Enterprise Linux (RHEL) administration
    • Basic networking skills
    • Solid experience triaging and monitoring complex issues, outages, and incidents
    • Experience with integrating/maintaining various 3rd party tools like ZooKeeper, Flink, Pinot, Prometheus, and Grafana.
    • Experience with Platform-as-a-Service (PaaS) using Red Hat OpenShift/Kubernetes and Docker containers
    • Experience working on Agile projects and understanding Agile terminology.

    HOW TO STAND OUT FROM THE CROWD (Desired Skills)

    Showcase your knowledge of modern development through the following experience or skills:

    • Preferred Confluent Certified Administrator for Apache Kafka (CCAAK) or Confluent Certified Developer for Apache Kafka (CCDAK)
    • Practical experience with event-driven applications and at least one event processing framework, such as Kafka Streams, Apache Flink, or ksqlDB.
    • Understanding of Domain Driven Design (DDD) and experience applying DDD patterns in software development.
    • Experience working with Kafka connectors and/or supporting operation of the Kafka Connect API
    • Experience with Avro / JSON data serialization and schema governance with Confluent Schema Registry.
    • Preferred experience with AWS cloud technologies or other cloud providers; AWS cloud certifications.
    • Experience with Infrastructure-as-Code (CloudFormation / Terraform, Scripting)
    • Solid knowledge of relational databases (PostgreSQL, DB2, or Oracle), NoSQL databases (MongoDB, Cassandra, DynamoDB), SQL, or/and ORM technologies (JPA2, Hibernate, or Spring JPA)
    • Knowledge of Social Security Administration (SSA)

    At Leidos, we deliver innovative solutions through the efforts of our diverse and talented people who are dedicated to our customers' success. We empower our teams and contribute to our communities. Everything we do is built on a commitment to do the right thing for our customers, our people, and our community. Our Mission, Vision, and Values guide the way we do business. Every position we take is more rewarding when you know the why behind it.Know your work makes a difference to support those who need it most. If your passion is enabling life changing service to those around, you this is the place for you. Find your passion in a team environment where all members are valued regardless of contractor or employee status. We are excited for you to take your place in our Leidos Family.

    Are you an US citizen, US resident, or Visa candidate and think you might fit? We recommend you apply and start the conversation today Join us in supporting our SSA contracts in Woodlawn, Maryland.

    ITSSCII

    Original Posting Date:

    While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.

    Pay Range:

    Pay Range $101, $183,300.00

    The Leidos pay range for this job level is a general guideline onlyand not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.



  • Peacefestival Baltimore, United States

    Opportunities for · Reliability Engineer · in · Baltimore, MD · Remote positions only · Search completed. Found 0 matching records. · Opportunities for · Reliability Engineer · in · Baltimore, MD · Remote positions only · Search completed. Found 0 matching records. · Reli ...


  • W. R. Grace Baltimore, United States

    Requisition ID: 22915 · Built on talent, technology, and trust, Grace is a leading global supplier of catalysts and engineered materials. The company's two industry-leading business segments-Catalysts Technologies and Materials Technologies-provide innovative products, technolog ...


  • The Sotland Group Baltimore, United States

    The role of the Corporate Senior Reliability Engineer (Sr. RE) is to identify and mitigate risks that could adversely affect plant or business operations. The Sr. RE provides leadership and technical expertise in support of asset management strategies and associated practices, to ...


  • The Sotland Group Inc Baltimore, United States

    The role of the Corporate Senior Reliability Engineer (Sr. RE) is to identify and mitigate risks that could adversely affect plant or business operations. The Sr. RE provides leadership and technical expertise in support of asset management strategies and associated practices, to ...


  • The Sotland Group Inc Baltimore, United States

    The role of the Corporate Senior Reliability Engineer (Sr. RE) is to identify and mitigate risks that could adversely affect plant or business operations. The Sr. RE provides leadership and technical expertise in support of asset management strategies and associated practices, to ...


  • CACi Baltimore, United States

    Site Reliability EngineerJob Category: EngineeringTime Type: Full timeMinimum Clearance Required to Start: TS/SCI with PolygraphEmployee Type: RegularPercentage of Travel Required: NoneType of Travel: None* * * · What You'll Get to Do: · The Site Reliability Engineer provides su ...


  • SITEC Consulting LLC Baltimore, United States

    About SITEC · SITEC is an employee and customer focused Information Technology and Professional Services Firm specializing in design, development, and delivery of state-of-the-art technology solutions, as well as cybersecurity, software and systems engineering services. · Summa ...


  • W. R. Grace Curtis Bay, United States

    Requisition ID: 22915 · Built on talent, technology, and trust, Grace is a leading global supplier of catalysts and engineered materials. The company's two industry-leading business segments-Catalysts Technologies and Materials Technologies-provide innovative products, technolog ...


  • Amentum Baltimore, United States

    Amentum is hiring a · Reliability & Maintainability Engineer · to support an Army program office at · Aberdeen Proving Ground, MD . · ** This position is eligible for a hybrid telework schedule and requires a minimum of 2 days of onsite work a week. This is not a remote positi ...


  • Amches Baltimore, United States

    Position: Operational Reliability Engineer 2 · Job Description: · Collaborate with product teams to understand the performance-monitoring needs and requirements for approximately 20 applications within the organization. · Work closely with the monitoring team and product teams t ...


  • Booz Allen Hamilton Baltimore, United States

    Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network engineering, systems administration, or · sof tware development—if you have a passion for making systems better, we need yo ...


  • Akina Baltimore, United States

    TS/SCI w/Polygraph required · Approved for 60% telework · 06-11-SRE · Description: · DevOps refers to a software development concept that unites and brings together developers and IT staff. The DevOps approach involves consistent, small edits to software coding. This means fr ...


  • Florida Crystals / ASR Group Baltimore, United States

    ASR Group is the world's largest refiner and marketer of cane sugar, with an annual production capacity of more than 6 million tons of sugar. The company produces a full line of grocery, industrial, food service and specialty sweetener products. Across North America, ASR Group ow ...


  • ASR Group/Domino Sugar Baltimore, United States

    ASR Group is the worlds largest refiner and marketer of cane sugar, with an annual production capacity of more than 6 million tons of sugar. The company produces a full line of grocery, industrial, food service and specialty sweetener products. Across North America, ASR Group own ...


  • Platform Science Baltimore, United States

    San Diego, CA or Remote + 5% travel to San Diego headquarters for business reasons · At Platform Science, we're working to connect everything that moves. · Founded in 2015, we are an open IoT platform that partners with innovative fleets, application developers, vehicle manufactu ...


  • VITG Inc Baltimore, United States

    Job DescriptionJob Description · VITG is seeking a skilled Site Reliability Engineer (SRE) to ensure the reliability, availability, and performance of our enterprise services hosted across CSPs and on-prem by implementing best practices, automation, and monitoring. As an SRE, yo ...


  • Fearless Baltimore, United States

    Fearless is looking for a Site Reliability Engineer II to add to our diverse team of 250+ employees (and counting). · What You'll Be Doing · We're looking to change the world by building software with a soul, and we want your help. · The Site Reliability Engineer II implements ...


  • Fearless Baltimore, United States

    About Fearless Digital · Fearless Digital builds software with a soul. As a division inside Fearless, we're part of its digital services integrator model to unlock the power of organizations, people and tech. Our division designs, engineers, and delivers digital solutions to sol ...


  • Fearless Baltimore, United States

    About Fearless Digital · Fearless Digital builds software with a soul. As a division inside Fearless, we're part of its digital services integrator model to unlock the power of organizations, people and tech. Our division designs, engineers, and delivers digital solutions to sol ...


  • Clarity Innovations Baltimore, United States

    Clarity Innovations connects human creativity with emerging technology to design, develop, and deploy software that enhances mission success. Our focus is redefining the Government's relationship with technology by encouraging the use of DevSecOps and Agile methodologies, small-t ...