Jobs
>
Remote

    Staff Site Reliability Engineer - Remote, United States - Cribl

    Cribl
    Cribl Remote, United States

    3 weeks ago

    Default job background
    Full time
    Description

    Cribl does differently.

    What does that mean? It means we are a serious company that doesn't take itself too seriously; and we're looking for people who love to get stuff done, and laugh a bit along the way. We're growing rapidly - looking for collaborative, curious, and motivated team members who are passionate about putting customers first. As a remote-first company we believe in empowering our employees to do their best work, wherever they are.

    As the data engine for IT and Security many of the biggest names in the most demanding industries trust Cribl to solve their most pressing data needs. Ready to do the best work of your career? Join the herd and unlock your opportunity.


    Why you'll love this role:

    Cribl Inc is seeking a Staff Site Reliability Engineer to join our mission to unlock the value of all observability data. Cribl provides users a new level of observability, intelligence and control over their real-time data. You will join a team of technical engineers who are committed to shipping only high-quality software and enjoying all the goat gifs the internet has to offer. This role is remote and you will be part of the engineering organization where you will contribute in our efforts to envision, create, deploy, test, and ship Cribl products.

    Not often do you get to be part of something that is fundamentally changing a technology. But here at Cribl we are building the next generation of software that puts our customers in full control of their observability data. If this is something that interests you, and you want to be truly at the center of the wheel helping make this work better every day. Then this opportunity might be something you have been waiting for to be a part of making a real impact.

    We are looking for Cloud Site Reliability Engineers and Developers at all levels at Cribl, who enjoy being in the thick of it. Fixing things at the operational side should always be the last resort, so our SRE engineers are involved from conception to design to development and all the way through production and beyond. You provide your creative input into all things Cloud, Scaling, Reliability, High Availability and much more.

    If reliability is your passion, and you have always had strong opinions on how to make things better and have the desire to build consensus around ideas. Then let's talk

    As An Active Member Of Our Team, You Will...

    • Engage with teams and improve service delivery and reliability across their entire lifecycle
    • Measure and monitor all production systems with an eye towards availability, latency and overall system health
    • Seek out the cause of errors and instability in our production cloud services and drive teams towards better operational excellence
    • Engage with product and platform teams to improve and evolve systems by lobbying for changes that improve reliability, resilience, and observability
    • Help Identify and drive down toil with creative innovation and automation
    • On-call responsibilities

    If You Got It, We Want It

    • Extensive experience with enterprise scale continuous delivery environments
    • 8+ years of experience with a DevOps or SRE job title
    • Development with in a Linux/Mac environment
    • Experience with Configuration Management Tools like Terraform (preferred) or Puppet, Chef, Ansible
    • Experience with sustainable incident response in a blameless environment
    • Knowledge of cloud platforms (prefer AWS) and container + orchestration technologies
    • Experience with APM and Observability and related tools such as, New Relic, Splunk, CloudWatch, Prometheus, Grafana/Kibana, Sentry etc.
    • Background in Linux Systems Engineering
    • Experience with Incident response related tools for instance, PagerDuty, FireHydrant, Blameless etc.
    • Comfortable with a high level of autonomy and working with a distributed team

    Preferred Qualifications

    • Knowledge of Cloud and application security
    • Strong knowledge of cloud design patterns for scale, data management, resiliency, etc.
    • A love for high quality and a knack for testing
    • Opinions about dashboards, metrics, and SLO's

    Salary Range

    ($144,000 - $278,000)

    The salary for this role is dependent on geographic location. The salary offered within the range described will be based on the individual candidate's job-related knowledge, skills, and experience. In addition to a competitive salary, Cribl also offers a generous benefits package which includes health, dental, vision, short-term disability, and life insurance, paid holidays and paid time off, a fertility treatment benefit, 401(k), equity, and eligibility for a discretionary company-wide bonus.

    #LI-MV1

    #Remote


    Bring Your Whole Self
    Diversity drives innovation, enables better decisions to support our customers, and inspires change for the better. We're building a culture where differences are valued and welcomed, and we work together to bring out the best in each other. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or any other applicable legally protected characteristics in the location in which the candidate is applying.

    Interested in joining the Cribl herd? Learn more about the smartest, funniest, most passionate goats you'll ever meet at



  • Coinbase Remote, United States

    We're a group of hard-working overachievers who are deeply focused on building the future of finance and Web3 for our users across the globe, whether they're trading, storing, staking or using crypto. Know those people who always lead the group project? We're a remote-first compa ...


  • Granicus Remote, United States Full time

    The Company Serving the People Who Serve the People Granicus is driven by the excitement of building, implementing, and maintaining technology that is transforming the Govtech industry by bringing governments and its constituents together. We are on a mission to support our cust ...


  • Granicus Remote, United States Full time

    The Company Serving the People Who Serve the People Granicus is driven by the excitement of building, implementing, and maintaining technology that is transforming the Govtech industry by bringing governments and its constituents together. We are on a mission to support our cust ...

  • Ocado Group

    Reliability Engineer

    3 weeks ago


    Ocado Group Remote, United States Full time

    Reliability EngineerAnywhere, USA (West Coast preference) · As a Reliability Engineer at Ocado, you will work directly with customers to maximize uptime of autonomous mobile robots and cloud-based software systems for warehouse automation. · Conduct comprehensive analyses to eva ...


  • Codeworks L.L.C Remote, United States

    Job Description · Job DescriptionNO C2C · From our client:Join a group of innovators who work with a customer-first mindset to build the very best services and experiences in financial services. We enable an ecosystem of large industry players and startups to delight our customer ...


  • Aurora Labs Remote, United States Full time

    About Us · Aurora Labs is the development company behind Aurora—the EVM blockchain that runs on the NEAR Protocol. We are also the developers of, and integration partner behind, Aurora Cloud—a suite of products that allow Web2 companies to capture the value of Web3. · We invite ...


  • Roadie Remote, United States Full time

    Roadie, a UPS Company, is a logistics management and crowdsourced delivery platform. Founded in 2014, Roadie offers businesses fast, flexible and asset-light logistics solutions for last-mile delivery. Roadie enables local delivery to more than 95% of U.S. households by providing ...


  • FiscalNote Remote, United States Full time

    About the Position · The Site Reliability Engineer (SRE) will report to the Lead DevOps Engineer. The ideal candidate will have a proven track record of 3+ years in Site Reliability Engineering. This role is responsible for ensuring the reliability, availability, security, and pe ...


  • Nozomi Networks Remote, United States Full time

    · Now is an amazing time to join Nozomi Networks as we build the future of OT and IoT Cybersecurity. We have hundreds of customers in more than 30 countries and we're just scratching the surface. · As we expand our product portfolio and global presence, our Engineering departmen ...


  • Podium Remote, United States Full time

    · At Podium, our mission is to help local businesses win. Our lead conversion platform, powered by AI and integrations, helps local businesses convert leads faster, communicate easier, and make more sales. Every day, thousands of local businesses utilize our review management, c ...


  • OPENLANE Remote, United States Full time

    Who We Are: · At OPENLANE we make wholesale easy so our customers can be more successful. · We're a technology company building the world's most advanced-and uncomplicated-digital marketplace for used vehicles. · We're a data company helping customers buy and sell smarter with cl ...


  • Sunrun Remote, United States Full time

    Everything we do at Sunrun is driven by a determination to transform the way we power our lives. We know that starts at the individual employee level. We strive to foster an environment you can thrive in through our commitment to diversity, inclusion and belonging. · Objective: · ...


  • AlphaSense Remote, United States Full time

    Location: Remote, United States · Reports to: Engineering Manager, SRE · About AlphaSense: · AlphaSense is a market intelligence platform used by the world's leading companies and financial institutions. Since 2011, our AI-based technology has helped professionals make smarter ...


  • Arcadia (DC) Remote, United States Full time

    Who We Are · Arcadia is the technology company empowering energy innovators and consumers to fight the climate crisis. Our software and APIs are revolutionizing an industry held back by outdated systems and institutions by creating unprecedented access to the data and clean ener ...


  • Modern Health Remote, United States Full time

    · Modern Health · Modern Health is a mental health benefits platform for employers. We are the first global mental health solution to offer employees access to one-on-one, group, and self-serve digital resources for their emotional, professional, social, financial, and physical ...


  • SentinelOne Remote, United States Full time

    · About Us: · SentinelOne is defining the future of cybersecurity through our XDR platform that automatically prevents, detects, and responds to threats in real-time. Singularity XDR ingests data and leverages our patented AI models to deliver autonomous protection. With Sentine ...


  • Edge & Node Remote, United States Full time

    Edge & Node stands as the revolutionary vanguard of web3, a vision of a world powered by individual autonomy, shared self-sovereignty and limitless collaboration. Established by trailblazers behind The Graph, we're on a mission to make The Graph the internet's unbreakable foundat ...


  • Lumin Digital Remote, United States Full time

    Our Site Reliability Engineers (SRE) are good developers with an operations mindset. They enjoy reducing or completely eliminating manual tasks, are excellent problem solvers, and know automation is the key to operating a large-scale system. · SREs make sure that our application ...


  • Zocdoc Remote, United States Full time

    · Our Mission · Healthcare should work for patients, but it doesn't. In their time of need, they call down outdated insurance directories. Then wait on hold. Then wait weeks for the privilege of a visit. Then wait in a room solely designed for waiting. Then wait for a surprise b ...


  • Sojern Remote, United States Full time

    Position Summary: · Sojern is looking for a Senior Site Reliability Engineer in the US to collaborate with Software Engineering teams located primarily in the Pacific Time Zone. An ideal candidate would have extensive experience building cloud infrastructure on Google Cloud with ...