Senior/Staff Site Reliability Engineer - San Francisco

Only for registered members San Francisco, United States

1 week ago

Default job background
$230,000 - $280,000 (USD)
We are building a platform that makes it easier for patients to find the right providers access the right medications take control of their health with transparency trust Over past few years we have experienced rapid growth by combining operational excellence clinical expertise innovative technology deliver care more human intuitive effective We believe future healthcare personal building technology power it At Mochi Health you will join team values inclusivity collaboration bold thinking opportunity do most meaningful work career Build AI-driven APM incident management system alert page learns Design feedback loops human-loop RLHF-style guardrails automation let reliability posture improve time Own systems workflows turn incidents intelligence automated triage root cause analysis remediation bug-fix proposals PRs test runs staged rollouts issues code-level If youre excited idea building self-improving SRE copilot this job you What Youll Do Build AI-driven SRE platform ingests telemetry logs/metrics/traces deploy events incident artifacts detect anomalies summarize failures propose mitigations Design human-loop learning loop RLHF-style so system gets better every incident capturing decisions outcomes postmortems training/evaluation data Create safe auto-remediation capabilities runbook execution automated rollbacks feature-flag actions strong guardrails auditability progressive rollout controls Build tooling propose bug fixes generate well-scoped PRs run tests support canary releases-clear handoff approval flows Define operationalize SLOSLIs error budgets critical user journeys patient onboarding provider workflows pharmacy fulfillment billing etc Level observability end-end alert quality dashboarding tracing standards unknown unknown detection Lead incident response excellence on-call improvements incident command blameless postmortems driving systemic fixes reduce repeat failures Partner product engineering teams reduce toil improve reliability architecture load testing resilience testing capacity planning Establish reliability standards patterns org golden signals deployment safety dependency management fault isolation Who You Are Years experience SRE /platform infrastructure engineering track record owning production reliability scale Deep experience operating Kubernetes-based systems cloud AWS preferred networking autoscaling rollout strategies mitigation Strong software ability debug production issues services understand failure modes contribute code Python/Go/TypeScript all great Expert-level grasp observability incident response metrics logs tracing alerting design postmortem-driven improvements Comfortable building automation touches production obsessive safety least-privilege access audit logs approvals canaries rollback Excited AI tooling agentic workflows LLM-based triage summarization retrieval runbooks/postmortems evaluation harnesses feedback loops Strong communication collaboration skills lead during incidents write clearly align teams around reliability priorities Startup mindset move fast take ownership love turning ambiguity shipped systems Excited work-person Team San Francisco Downtown SF HQ designed deep-work casual collaboration Nice Haves Experience building LLM-powered internal tools Incident Copilots Automated Debugging RAG Docs Runbooks familiarity security compliance regulated environments HIPAA SOC Audit Requirements PHI handling Experience chaos engineering game days resilience testing CI CD Guardrails Progressive Delivery Systems Canaries Automated Verification Safe Rollout Policies Prior Work Distributed Tracing Standards OpenTelemetry Service Meshes Large-Scale Event-Driven Systems Our Core Technologies Include AWS Kubernetes Postgres Redis Python SQL Whatever need build world-class reliability platform Life at Mochi Daily Meals Espresso Bar Breakfast lunch dinner weekday barista keeps espresso matcha flowing day Pre-Tax Commuter Perks Save transit parking pre-tax commuter benefits Top-Market Compensation Offer competitive salaries generous equity packages share success create High-Impact Work Help shape future digital healthcare your work here directly improves lives scales nationwide World-Class Team Collaborate teammates Tesla SpaceX Citadel Harvard IIT value excellence humility empathy equal measure Comprehensive Benefits k match generous time off life insurance high-quality medical dental vision plans Mochi Health Membership Cover monthly subscription fee same care patients medications included Time Recharge Enjoy unlimited PTO generous company holidays true flexibility trust take time need rest reset thrive Wellness First Weekly mindfulness sessions group workouts fitness perks physical mental health top priority Team Socials Community Make connect regular socials happy hours spontaneous events stocked kitchen hurt either Downtown SF HQ Designed steps BART Muni food Steps from BART Muni Great Food Location San Francisco Country Code US Workplace Policy In-person company based San Francisco CA team works together person five days week foster collaboration innovation strong connections Believe face-to-face interaction builds culture excellence allow deliver best outcomes patients providers serve For office-based roles standard schedule Monday Friday am pm Actual hours vary business needs role responsibilities All employees receive meal rest breaks accordance state local laws For designated remote roles in-person policy apply Equal Opportunity Employer Provide employment opportunities applicants employees discrimination race religion color national origin gender pregnancy childbirth related medical conditions sexual orientation gender identity gender expression age protected veteran disability status legally protected characteristic Prohibit form discrimination harassment This policy applies all terms conditions employment including hiring Candidate Privacy Notice Please review Candidate Privacy Notice Accommodations Comply Americans Disabilities Act ADA as amended ADA Amendments Act applicable state local laws Will reasonably accommodate qualified individuals disability application process throughout employment required law If need assistance accommodations due disability please contact us
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Only for registered members San Francisco

    Cisco Silicon One ASICs are transforming the Future of the Internet. · Owning reliability test plans for new products. · Supporting High power Burn In, biased HAST and ESD/LU bring-up and debug for reliability qualification and evaluation. · ...

  • Only for registered members San Francisco

    We're growing incredibly fast and need someone who thrives in a dynamic, high-pressure environment to work side-by-side with the our engineering and leadership team. · Write tests, monitoring, and evaluating the health of our platform. · Design, write, and maintain automated test ...

  • Only for registered members San Francisco $130,000 - $190,000 (USD)

    We are making sure that when businesses build AI agents the experience of doing so doesn't suck.Our team is a group of ex-athletes founders and builders with low egos and a high belief that life not about taking the easy road but challenging ourselves to find the most we can be. ...

  • Only for registered members San Francisco

    We are making sure that when businesses build AI agents the experience of doing so doesn't suck. We're growing fast and need someone who thrives in a dynamic environment to work side-by-side with our engineering team.Your job is to write tests, monitoring, and evaluating the heal ...

  • Only for registered members San Francisco

    Verrus is redefining the future of data centers with an emphasis on innovation, flexibility, and sustainability. · ...

  • Only for registered members San Francisco $175,000 - $250,000 (USD)

    We're looking for engineers who are excited to improve the reliability of complex systems and enjoy digging into how things work. · Bring a generalist mindset and are comfortable working across infrastructure layers—from compute and networking to storage, databases, and app runti ...

  • Only for registered members San Francisco

    Join a stealth-mode startup building out their AI and cloud platform powered by thousands of H100s H200s and B200s ready to go for experimentation full-scale model training or inference. · ...

  • Only for registered members San Francisco

    We are looking for a Senior Site Reliability Engineer (SRE) to build the reliability foundation for a mission-critical healthcare platform. · This is not a "keep the lights on" SRE role. You'll own reliability end-to-end, · define what good looks like: SLIs, SLOs, incident respon ...

  • Only for registered members San Francisco Full time

    We're hiring an SRE to join our engineering team at Plenful. · You'll bring strong technical judgment, calm problem solving during incidents and a practical approach to improving reliability. · ...

  • Only for registered members San Francisco, CA

    We are building a new kind of general frontier model designed for real-world reliability, decision-making, and autonomy in production. · Create robust infrastructure for serving inference across multiple cloud providers · Work to ensure inference infrastructure and services are r ...

  • Only for registered members San Francisco

    + Reliability expert to maintain and enhance the stability and scalability of our rapidly evolving infrastructure. · + Design and implement solutions to ensure the scalability of our infrastructure. · + Build and maintain load, chaos and synthetic testing software. · Job summary: ...

  • Only for registered members San Francisco Full time

    We are seeking an experienced Site Reliability Engineer to join our Platform Engineering team in the Bay Area. · Design and implement scalable infrastructure on Google Cloud Platform. · Own critical platform services. · ...

  • Only for registered members San Francisco Full time

    Join the engineering teams that bring OpenAI's ideas safely to the world as a Software Engineer in Reliability role at OpenAI in San Francisco. · ...

  • Only for registered members San Francisco, CA

    We are seeking an experienced Site Reliability Engineer to join our Platform Engineering team in the Bay Area. · ...

  • Only for registered members San Francisco

    We are looking for a Site Reliability Engineer (SRE) with strong experience in Microsoft Azure cloud services and Java-based application development.This role blends software engineering and operations, with a focus on building reliable, · scalable,and highly available systems. · ...

  • Only for registered members San Francisco

    We are looking for a Site Reliability Engineer to keep all user-facing services and production systems running smoothly. You will be responsible for participating in on-call rotation, building and running infrastructure with Ansible, Terraform, and Kubernetes, building monitoring ...

  • Only for registered members San Francisco $150,000 - $200,000 (USD)

    As a Site Reliability Engineer (SRE) at Together AI you are responsible for keeping all user-facing services and production systems running smoothly.You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles operational discipline an ...

  • Only for registered members San Francisco

    Patreon powers creators to do what they love and get paid by the people who love what they do.We're continuing to invest heavily in building the best creator platform with the best team in the creator economy. · ...

  • Only for registered members San Francisco

    we are seeking an experienced site reliability engineer to join our platform engineering team in the bay area you ll be instrumental in ensuring the high availability performance and scalability of coderabbit s ai powered code review platform this role sits at the intersection of ...

  • Air Apps San Francisco

    We are a family-founded company on a mission to create the world's first AI-powered Personal & Entrepreneurial Resource Planner (PRP), and we need your passion and ambition to help us change how people plan, work, and live. · ...

  • Only for registered members San Francisco Full time

    Mercor is creating a new category of work where expertise powers AI advancement. · ...