Jobs
>
Remote

    Principal Site Reliability Engineer - Remote, United States - Zocdoc

    Zocdoc
    Zocdoc Remote, United States

    1 month ago

    Default job background
    Full time
    Description

    Our Mission

    Healthcare should work for patients, but it doesn't. In their time of need, they call down outdated insurance directories. Then wait on hold. Then wait weeks for the privilege of a visit. Then wait in a room solely designed for waiting. Then wait for a surprise bill. In any other consumer industry, the companies delivering such a poor customer experience would not survive. But in healthcare, patients lack market power. Which means they are expected to accept the unacceptable.

    Zocdoc's mission is to give power to the patient. To do that, we've built the leading healthcare marketplace that makes it easy to find and book in-person or virtual care in all 50 states, across +200 specialties and +12k insurance plans. By giving patients the ability to see and choose, we give them power. In doing so, we can make healthcare work like every other consumer sector, where businesses compete for customers, not the other way around. In time, this will drive quality up and prices down.

    We're 15 years old and the leader in our space, but we are still just getting started. If you like solving important, complex problems alongside deeply thoughtful, driven, and collaborative teammates, read on.


    Your Impact on our Mission:

    Zocdoc is looking for a Principal Site Reliability Engineer to help drive the reliability, resiliency, observability, availability, and scalability of our systems and services. You'll be challenged to drive continuous improvements to uptime and performance for our patients and providers in a constantly evolving environment. You'll work with and provide subject matter expertise for our AWS cloud-based environments, distributed systems, monolith, and microservices. We're looking for someone who loves challenging the status quo and strives to make everything they touch easier, faster, and more robust.


    You'll enjoy this role if you are...

    • Passionate about ensuring complex systems never skip a beat
    • Pragmatic in your decision making day-to-day
    • Motivated to learn new technologies, design patterns, and work in the cloud
    • Comfortable with failures and outages and believe in blameless post-mortems
    • Excited to work in a highly collaborative environment with diverse individuals
    • Autonomous, individually accountable, and comfortable working in a remote environment
    • A believer that diverse and inclusive teams and cultures are non-negotiable


    Your day to day is...

    • Analyzing and decomposing complex distributed system challenges to drive sound design decisions targeted toward reliability, availability, scalability, and resiliency
    • Supporting our large product engineering org with their scaling, performance, and uptime needs as well as helping diagnose and debug production related issues
    • Monitoring and maintaining complex cloud-based infrastructure, systems, and services, and ensuring its uptime in order to enable millions of patients to get the care they need
    • Automating and codifying our tooling, processes, and infrastructure to speed up development and make them repeatable and error-proof
    • Analyzing and performance tuning systems, code, and networking for scaling and optimal operation
    • Mentoring your peers and colleagues by giving thoughtful feedback, with the notion that helping others means learning and growing yourself


    You'll be successful in this role if you have...

    • A Bachelor's degree in Computer Science, Computer Engineering, or equivalent engineering experience
    • 8+ years of progressive engineering experience in Site Reliability or adjacent disciplines (DevOps, Platform, Backend Engineering, etc.)
    • 5+ years of experience in deploying, managing, and supporting modern cloud-based environments and infrastructure like AWS/Azure/GCP, Docker, Kubernetes, IaC, etc.
    • 4+ years of production on-call experience in a 24/7 cloud-based environment
    • Exceptional troubleshooting, debugging, and diagnostic skills for cloud and web-based technologies using industry standard observability tooling and frameworks
    • Experience with edge technologies such as load balancers, reverse proxies, web application firewalls, routing, etc.
    • Deep understanding of web applications and ability to troubleshoot HTTP/HTTPS, TLS, DNS, TCP/IP, and similar protocols



    Benefits:

    • Flexible, hybrid work environment
    • Unlimited PTO
    • 100% paid employee health benefit options
    • Employer funded 401(k) match
    • L&D offerings + a free LinkedIn learning account
    • Corporate wellness programs with Headspace and Peloton
    • Sabbatical leave (for employees with 5+ years of service)
    • Competitive parental leave
    • Cell phone reimbursement


    Zocdoc is committed to fair and equitable compensation practices. Salary ranges are determined through alignment with market data. Base salary offered is determined by a number of factors including the candidate's experience, qualifications, and skills. Certain positions are also eligible for variable pay and/or equity; your recruiter will discuss the full compensation package details.

    Remote Base Salary Range

    $184,000—$260,000 USD



    About us
    Zocdoc is the country's leading digital health marketplace that helps patients easily find and book the care they need. Each month, millions of patients use our free service to find nearby, in-network providers, compare choices based on verified patient reviews, and instantly book in-person or video visits online. Providers participate in Zocdoc's Marketplace to reach new patients to grow their practice, fill their last-minute openings, and deliver a better healthcare experience. Founded in 2007 with a mission to give power to the patient, our work each day in pursuit of that mission is guided by our six core values. Zocdoc is a private company backed by some of the world's leading investors, and we believe we're still only scratching the surface of what we plan to accomplish.

    Zocdoc is a mission-driven organization dedicated to building teams as diverse as the patients and providers we aim to serve. In the spirit of one of our core values - Together, Not Alone, we are a company that prides itself on being highly collaborative, and we believe that diverse perspectives, experiences and contributors make our community and our platform better. We're an equal opportunity employer committed to providing employees with a work environment free of discrimination and harassment. Applicants are considered for employment regardless of race, color, ethnicity, ancestry, religion, national origin, gender, sex, gender identity, gender expression, sexual orientation, age, citizenship, marital or parental status, disability, veteran status, or any other class protected by applicable laws.
    Job Applicant Privacy Notice



  • Granicus Remote, United States Full time

    The Company Serving the People Who Serve the People Granicus is driven by the excitement of building, implementing, and maintaining technology that is transforming the Govtech industry by bringing governments and its constituents together. We are on a mission to support our cust ...


  • Granicus Remote, United States Full time

    The Company Serving the People Who Serve the People Granicus is driven by the excitement of building, implementing, and maintaining technology that is transforming the Govtech industry by bringing governments and its constituents together. We are on a mission to support our cust ...


  • Ocado Group Remote, United States Full time

    Reliability EngineerAnywhere, USA (West Coast preference) · As a Reliability Engineer at Ocado, you will work directly with customers to maximize uptime of autonomous mobile robots and cloud-based software systems for warehouse automation. · Conduct comprehensive analyses to eva ...


  • Codeworks L.L.C Remote, United States

    Job Description · Job DescriptionNO C2C · From our client:Join a group of innovators who work with a customer-first mindset to build the very best services and experiences in financial services. We enable an ecosystem of large industry players and startups to delight our customer ...


  • Cribl Remote, United States Full time

    · Cribl does differently. · What does that mean? It means we are a serious company that doesn't take itself too seriously; and we're looking for people who love to get stuff done, and laugh a bit along the way. We're growing rapidly - looking for collaborative, curious, and mot ...


  • Coinbase Remote, United States

    We're a group of hard-working overachievers who are deeply focused on building the future of finance and Web3 for our users across the globe, whether they're trading, storing, staking or using crypto. Know those people who always lead the group project? We're a remote-first compa ...


  • Fireblocks Remote, United States Full time

    · The world of digital assets is accelerating in speed, magnitude, and complexity, opening the door to new ways for leveraging the blockchain. Fireblocks' platform and network provide the simplest and most secure way for companies to work with digital assets and it trusted by so ...


  • Aurora Labs Remote, United States Full time

    About Us · Aurora Labs is the development company behind Aurora—the EVM blockchain that runs on the NEAR Protocol. We are also the developers of, and integration partner behind, Aurora Cloud—a suite of products that allow Web2 companies to capture the value of Web3. · We invite ...


  • FiscalNote Remote, United States Full time

    About the Position · The Site Reliability Engineer (SRE) will report to the Lead DevOps Engineer. The ideal candidate will have a proven track record of 3+ years in Site Reliability Engineering. This role is responsible for ensuring the reliability, availability, security, and pe ...


  • Nozomi Networks Remote, United States Full time

    · Now is an amazing time to join Nozomi Networks as we build the future of OT and IoT Cybersecurity. We have hundreds of customers in more than 30 countries and we're just scratching the surface. · As we expand our product portfolio and global presence, our Engineering departmen ...


  • Roadie Remote, United States Full time

    Roadie, a UPS Company, is a logistics management and crowdsourced delivery platform. Founded in 2014, Roadie offers businesses fast, flexible and asset-light logistics solutions for last-mile delivery. Roadie enables local delivery to more than 95% of U.S. households by providing ...


  • Podium Remote, United States Full time

    · At Podium, our mission is to help local businesses win. Our lead conversion platform, powered by AI and integrations, helps local businesses convert leads faster, communicate easier, and make more sales. Every day, thousands of local businesses utilize our review management, c ...


  • OPENLANE Remote, United States Full time

    Who We Are: · At OPENLANE we make wholesale easy so our customers can be more successful. · We're a technology company building the world's most advanced-and uncomplicated-digital marketplace for used vehicles. · We're a data company helping customers buy and sell smarter with cl ...


  • Brooksource Remote, United States

    Contract to Hire * · *Remote (EST Time Zone)* · Our Fortune 15 health care client is seeking a Site Reliability Engineer (SRE) to assist them as they fully transition to the cloud. You will play a critical role in ensuring the reliability, scalability, and performance of their sy ...


  • Sunrun Remote, United States Full time

    Everything we do at Sunrun is driven by a determination to transform the way we power our lives. We know that starts at the individual employee level. We strive to foster an environment you can thrive in through our commitment to diversity, inclusion and belonging. · Objective: · ...


  • AlphaSense Remote, United States Full time

    Location: Remote, United States · Reports to: Engineering Manager, SRE · About AlphaSense: · AlphaSense is a market intelligence platform used by the world's leading companies and financial institutions. Since 2011, our AI-based technology has helped professionals make smarter ...


  • Arcadia (DC) Remote, United States Full time

    Who We Are · Arcadia is the technology company empowering energy innovators and consumers to fight the climate crisis. Our software and APIs are revolutionizing an industry held back by outdated systems and institutions by creating unprecedented access to the data and clean ener ...


  • Sojern Remote, United States Full time

    Position Summary: · Sojern is looking for a Senior Site Reliability Engineer in the US to collaborate with Software Engineering teams located primarily in the Pacific Time Zone. An ideal candidate would have extensive experience building cloud infrastructure on Google Cloud with ...


  • SentinelOne Remote, United States Full time

    · About Us: · SentinelOne is defining the future of cybersecurity through our XDR platform that automatically prevents, detects, and responds to threats in real-time. Singularity XDR ingests data and leverages our patented AI models to deliver autonomous protection. With Sentine ...


  • Edge & Node Remote, United States Full time

    Edge & Node stands as the revolutionary vanguard of web3, a vision of a world powered by individual autonomy, shared self-sovereignty and limitless collaboration. Established by trailblazers behind The Graph, we're on a mission to make The Graph the internet's unbreakable foundat ...