Jobs
>
Washington, D.C.

    Site Reliability Engineer - Washington DC, United States - Palantir Technologies

    Default job background
    Description
    Palantir builds the world's leading software for data-driven decisions and operations.

    By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more.

    We're looking for Site Reliability Engineers who can help us build, operate, and maintain high-performance, scalable, and reliable services for our production infrastructure, across both cloud & on-prem environments.

    Site Reliability Engineers combine engineering experience and an innate drive to improve existing systems and processes, with the creativity to develop novel solutions to evolving challenges.

    You'll be the experts for the environments that you operate infrastructure in, helping partner teams build & configure their software to operate reliably within.

    We strongly believe in engineering teams being responsible for the operations of their services in production. Maintaining availability of cloud & physical Linux servers that power the Palantir platform in air-gapped production environments.
    Design, deploy, and operate infrastructure to support customer & product requirements via modern orchestration & monitoring platforms.
    Collaborate closely with product teams on requirements & SLOs for deploying software into air-gapped environments.
    Identifying, troubleshooting, and solving network & systems issues.
    Active US Security clearance, or eligibility and willingness to obtain a US Security clearance.

    Comfort with managing large scale production systems and technologies with configuration management, load balancing, monitoring & alerting infrastructure, and container orchestration.

    Demonstrated ability to continuously learn and work independently, making decisions with minimal supervision while working in secure facilities.

    Preferred Certifications:
    DOD 8570 IAT Level II or greater (CISSP, Sec+), Unix/Linux Computing Environment (e.g Linux+, RHCE).
    Proficiency with scripting in Python or Go is a plus.
    5+ years of experience with Linux system administration (RHEL or equivalent preferred).
    ~ Experience with cloud-based hosting platforms like AWS, Azure, or GCP and/or experience with hardware-based environments.
    ~ Familiarity with monitoring systems using tools like Prometheus and writing health checks.

    Paying attention to the needs of our community enables us to optimize our opportunities to grow and helps ensure many pathways to success at Palantir.

    Promoting health and well-being across all areas of Palantirians' lives is just one of the ways we're investing in our community.

    In keeping consistent with Palantir's values and culture, we believe employees are "better together" and in-person work affords the opportunity for more creative outcomes.

    Based on business need, there are a few roles that allow for "Remote" work on an exceptional basis. If the posting is specified as Onsite, you are required to work from an office.

    Palantir is committed to promoting a culture of diversity, equity, and inclusion and is proud to be an Equal Employment Opportunity and Affirmative Action employer.

    Palantir does not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

    Palantir is committed to working with and providing reasonable accommodations to qualified individuals with physical and mental disabilities. Palantir is committed to making the job application process accessible to everyone.

    If you are living with a disability (visible or not visible) and need to request a reasonable accommodation for any part of the application or hiring process, please reach out and let us know how we can help.

    #

  • Saint-Gobain

    Reliability Engineer

    3 weeks ago


    Saint-Gobain Washington DC, United States

    Consistent with CertainTeed Gypsum Vision, Mission, Values and Objectives, the Reliability Engineer identifies and quantifies Line 1 and Line 2 root cause failure(s), and drives permanent solutions to address systemic or chronic mechanical deficiencies to world class levels of sa ...


  • BuildSubmarines Washington DC, United States

    Reliability Engineer · KMS Solutions, LLC is a technical management/solutions company that specializes in engineering, analysis, and cyber security. Founded in 2005, KMS is a certified small business with over a decade and a half of experience supporting the Department of Defens ...

  • AES Corporation

    Reliability Engineer

    2 weeks ago


    AES Corporation Arlington, United States Full time

    AES's mission is to improve lives by accelerating a safer and greener energy future. We are a global, agile, cohesive organization with an employee engagement level akin to a startup company. AES businesses throughout the world are often recognized as great places to work. Our pe ...


  • KMS Solutions, LLC Washington DC, United States

    Reliability Engineer · KMS Solutions, LLC is a technical management/solutions company that specializes in engineering, analysis, and cyber security. Founded in 2005, KMS is a certified small business with over a decade and a half of experience supporting the Department of Defens ...


  • MetroStar Washington, United States

    As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end users. As the ideal candidate, you'll use your extensive experience designing and implementing end-to-end continuous delivery pipelines and experience in AI/ML. You will also u ...


  • Talent Discovery Pros Washington DC, United States

    Job DescriptionJob Description · TITLE : Site Reliability Engineer · LOCATION : Washington DC · CLEARANCE REQUIRED : TS/SCI · WORK AUTHORIZATION : US Citizen · As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observa ...


  • ARCADIS Farragut, United States

    Job Description · Arcadis is the world's leading company delivering sustainable design, engineering, and consultancy solutions for natural and built assets. · We are more than 36,000 people, in over 70 countries, dedicated to improving quality of life. Everyone has an important ...


  • Guidehouse Washington DC, United States

    Job Family · IT Architecture/Cloud (Digital) Travel Required Up to 10% Clearance Required Ability to Obtain Public Trust What You Will Do Site Reliability Engineer (SRE) is the subject matter expert for infrastructure and cloud operations issues. The SRE will design full-sta ...


  • Radius Networks Washington DC, United States

    Who We Are · Radius Networks is the global leader in location technology solutions and powers some of the world's largest restaurant, grocery, retail and hospitality brands with its Flybuy platform. Flybuy helps companies deliver a frictionless customer experience, boost loyalty ...


  • Splunk Inc. Washington DC, United States

    Join us as we pursue our disruptive new vision to make machine data accessible, usable and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we're committed to our w ...


  • Splunk Inc. Washington DC, United States

    Splunk is here to build a safer and more resilient digital world. The world's leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. While customers love our technology, it's our people that make Splunk stand out ...


  • Mount Indie Washington, United States

    Job Description · Job DescriptionAs aSite Reliability Engineer (SRE), youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained wit ...


  • Cinder Washington DC, United States

    Full Time] Site Reliability Engineer at Cinder (United States) | BEAMSTART Jobs Site Reliability Engineer · Full Time · Remote Work · Stock Options · Cinder provides a cutting-edge investigation platform to protect the internet. Companies rely on our software to investigate a ...


  • Mount Indie Washington, United States

    Job Description · Job DescriptionMount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using ...


  • Veradigm Washington DC, United States

    Welcome to Veradigm Our Mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our Vision is a Connected Community of Health that spans continents and borders. With the larg ...


  • Veradigm LLC Washington, United States Full time

    · Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. A ...


  • Talent Discovery Pros Washington DC, United States

    As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government. · Our client firmly believes that exceptional technology servi ...


  • Mount Indie Washington, United States

    Job Description · Job DescriptionMount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using ...


  • Blue Rose Consulting Group, Inc. Washington, United States

    Job Description · Job DescriptionBlue Rose is seeking a Cloud Site Reliability Engineer (SRE) to support our work with multiple federal clients. This is a Hybrid role supporting our government clients across the Washington, DC area and is open to U.S. Citizens only. · Successful ...


  • Blue Rose Consulting Group, Inc. Washington DC, United States

    Job Description · Job Description · Blue Rose is seeking a Cloud Site Reliability Engineer (SRE) to support our work with multiple federal clients. This is a Hybrid role supporting our government clients across the Washington, DC area and is open to U.S. Citizens only . ...