Jobs
>
Washington, D.C.

    Site Reliability Engineer - Washington DC, United States - Splunk Inc.

    Default job background
    Description

    Splunk is here to build a safer and more resilient digital world. The world's leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. While customers love our technology, it's our people that make Splunk stand out as an amazing career destination and why we've won so many awards as a best place to work. If you become a Splunker, we want your whole, authentic self, what we call your "million data points". So bring your work experience, problem-solving skills and talent, of course, but also bring your joy, your passion and all the things that make you, you. Role Summary The Cloud organization at Splunk focuses on building and maintaining robust and resilient platform solutions for SaaS hosting of Splunk's enterprise software. Our main technologies are Cloud Infrastructure based, focusing on puppet and terraform. It is the responsibility of the TechOps team to monitor and resolve issues that affect the availability and performance of Splunk for our cloud customers 24/7. As the authority on our customer's experience. The TechOps team provides a backstop for all staff on any questions or issues that arise during their shift related to their technical area of expertise. The TechOps engineers lead their respective queue and ensure all requests coming into that queue are addressed in a timely manner. What you'll get to do 4 x 10 shifts: Sunday - Wednesday or Wednesday - Saturday / Nights, weekends and swing-shift. We are looking for a TechOps Engineer to help maintain, contribute to, and improve the next generation of our large scale Cloud offering. You will be working with providers and supporting the infrastructure that powers Splunk's cloud offering.

    • Provide technical support for the Splunk Cloud fleet
    • Perform impact assessments and problem solving according to established procedures
    • Document issues, remediation steps, and help with follow up problem management
    • Lead support cases and also ensures queue management
    • Lead hierarchical and functional requests. Communicate with TechOps engineers as well as business partners around Cloud through email, chat, and in person.
    • Assist other TechOps engineers on your shift on complex tasks
    • Represent the TechOps team in meetings/process changes and make recommendations on new procedures/ processes
    • Use the internal tools to restore normal service operations as quickly as possible to minimize the impact to business operations during escalated incidents
    • Lead by example and drive the core values of the company
    • Always ensure a quality customer experience
    • You love large complex systems. Experience in working on distributed systems or a passion for finding edge cases that appear at scale. You're interested in how to bring something from a small one off task to how to implement it across several thousand machines at once
    • "Can this process be automated?" is a question you constantly ask yourself
    • Data drives your decisions and excites you - you make decisions based on numbers rather than assumptions. If an issue arises, you strive to be alerted before our customers notice.
    • Ability to work nights, weekends, and swing shifts.
    • This is a FULLY REMOTE, US-based position. We encourage applicants anywhere in the US You must be a US Citizen working on US soil to be considered. Candidates must be able to support FedRAMP High.
    Must-have Qualifications
    • Requires a minimum of 2 years of related experience with a technical Bachelor's degree; or equivalent practical experience
    • Ability to obtain a favorably adjudicated Single Scope Background Investigation (SSBI) and SECRET clearance
    Nice-to-have Qualifications We've taken special care to separate the must-have qualifications from the nice-to-haves. "Nice-to-have" means just that: Nice. To. Have. So, don't worry if you can't check off every box. We're not hiring a list of bullet points–we're interested in the whole you.
    • Experience monitoring and troubleshooting Splunk environments..
    • Experience in administering or architecting distributed Splunk environments.
    • Experience with the development and deployment of a hosted cloud environment, preferably AWS.
    • Experience with Python, Golang or Shell for scripting, and Git or similar version control systems.
    • Experience supporting customer facing SaaS infrastructure or similar cloud related services.
    • Have an understanding of Systems programming (network stack, file system, OS services) and networking (L2 vs. L3, network architecture, VLANs, etc)
    • Knowledge of standard methodologies related to security, performance, and disaster recovery.
    • Have knowledge of the Linux Operating System
    Splunk is an Equal Opportunity Employer At Splunk, we believe creating a culture of belonging isn't just the right thing to do; it's also the smart thing. We prioritize diversity, equity, inclusion, and belonging to ensure our employees are supported to bring their best, most authentic selves to work where they can thrive. Qualified applicants receive consideration for employment without regard to race, religion, color, national origin, ancestry, sex, gender, gender identity, gender expression, sexual orientation, marital status, age, physical or mental disability or medical condition, genetic information, veteran status, or any other consideration made unlawful by federal, state, or local laws. We consider qualified applicants with criminal histories, consistent with legal requirements.

    Note:

    Base Pay Range

    SF Bay Area, Seattle Metro, and New York City Metro Area

    Base Pay Range: $119, ,900.00 per year

    California (excludes SF Bay Area), Washington (excludes Seattle Metro), Washington DC Metro, and Massachusetts

    Base Pay Range: $107, ,510.00 per year

    All other cities and states excluding California, Washington, Massachusetts, New York City Metro Area and Washington DC Metro Area.

    Base Pay Range: $95, ,120.00 per year

    Splunk provides flexibility and choice in the working arrangement for most roles, including remote and/or in-office roles. We have a market-based pay structure which varies by location. Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location as set out above, as well as the knowledge, skills and experience of the candidate. In addition to base pay, this role is eligible for incentive compensation and may be eligible for equity or long-term cash awards.

    Benefits are an important part of Splunk's Total Rewards package. This role is eligible for a competitive benefits package which includes medical, dental, vision, a 401(k) plan and match, paid time off and much more Learn more about our comprehensive benefits and wellbeing offering at .

    #J-18808-Ljbffr


  • Saint-Gobain Washington DC, United States

    Consistent with CertainTeed Gypsum Vision, Mission, Values and Objectives, the Reliability Engineer identifies and quantifies Line 1 and Line 2 root cause failure(s), and drives permanent solutions to address systemic or chronic mechanical deficiencies to world class levels of sa ...


  • ALTA IT Services Washington, United States

    Site Reliability Engineering (SRE) Lead · 100% Remote · US Citizenship required per government contract Must be able to obtain a DHS Public Trust clearance As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end users. As the ideal ...


  • KMS Solutions, LLC Washington DC, United States

    Reliability Engineer · KMS Solutions, LLC is a technical management/solutions company that specializes in engineering, analysis, and cyber security. Founded in 2005, KMS is a certified small business with over a decade and a half of experience supporting the Department of Defens ...


  • BuildSubmarines Washington DC, United States

    Reliability Engineer · KMS Solutions, LLC is a technical management/solutions company that specializes in engineering, analysis, and cyber security. Founded in 2005, KMS is a certified small business with over a decade and a half of experience supporting the Department of Defens ...


  • AES Corporation Arlington, United States Full time

    AES's mission is to improve lives by accelerating a safer and greener energy future. We are a global, agile, cohesive organization with an employee engagement level akin to a startup company. AES businesses throughout the world are often recognized as great places to work. Our pe ...


  • Cinder Washington DC, United States

    [Full Time] Site Reliability Engineer at Cinder (United States) | BEAMSTART Jobs · Site Reliability Engineer · Cinder United States · Date Posted · 31 Oct, 2022 · Work Location · Washington, DC, United States · Salary Offered · $110 — $220 yearly · Job Type · Full Time ...


  • Palantir Technologies Washington DC, United States

    Palantir builds the world's leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. · We ...


  • ARCADIS Farragut, United States

    Job Description · Arcadis is the world's leading company delivering sustainable design, engineering, and consultancy solutions for natural and built assets. · We are more than 36,000 people, in over 70 countries, dedicated to improving quality of life. Everyone has an important ...


  • Talent Discovery Pros Washington DC, United States

    Job DescriptionJob Description · TITLE : Site Reliability Engineer · LOCATION : Washington DC · CLEARANCE REQUIRED : TS/SCI · WORK AUTHORIZATION : US Citizen · As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observa ...


  • StaffWorthy Inc. Washington DC, United States

    We are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. For over two decades, we have been committed to excellence, with a mission centered around our passion for our people and the value the ...


  • Splunk Inc. Washington DC, United States

    Join us as we pursue our disruptive new vision to make machine data accessible, usable and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we're committed to our w ...


  • Guidehouse Washington DC, United States

    Job Family · IT Architecture/Cloud (Digital) Travel Required Up to 10% Clearance Required Ability to Obtain Public Trust What You Will Do Site Reliability Engineer (SRE) is the subject matter expert for infrastructure and cloud operations issues. The SRE will design full-sta ...


  • Radius Networks Washington DC, United States

    Who We Are · Radius Networks is the global leader in location technology solutions and powers some of the world's largest restaurant, grocery, retail and hospitality brands with its Flybuy platform. Flybuy helps companies deliver a frictionless customer experience, boost loyalty ...


  • Mount Indie Washington, United States

    Job Description · Job DescriptionMount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using ...


  • Mount Indie Washington, United States

    Job Description · Job DescriptionMount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using ...


  • Veradigm Washington DC, United States

    Welcome to Veradigm Our Mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our Vision is a Connected Community of Health that spans continents and borders. With the larg ...


  • Cinder Washington DC, United States

    Full Time] Site Reliability Engineer at Cinder (United States) | BEAMSTART Jobs Site Reliability Engineer · Full Time · Remote Work · Stock Options · Cinder provides a cutting-edge investigation platform to protect the internet. Companies rely on our software to investigate a ...


  • Blue Rose Consulting Group, Inc. Washington, United States

    Job Description · Job DescriptionBlue Rose is seeking a Cloud Site Reliability Engineer (SRE) to support our work with multiple federal clients. This is a Hybrid role supporting our government clients across the Washington, DC area and is open to U.S. Citizens only. · Successful ...


  • Blue Rose Consulting Group, Inc. Washington DC, United States

    Job Description · Job Description · Blue Rose is seeking a Cloud Site Reliability Engineer (SRE) to support our work with multiple federal clients. This is a Hybrid role supporting our government clients across the Washington, DC area and is open to U.S. Citizens only . ...


  • Talent Discovery Pros Washington DC, United States

    As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government. · Our client firmly believes that exceptional technology servi ...