Jobs
>
San Francisco

    Site Reliability Engineer - San Francisco, United States - Cypress Human Capital Management, LLC

    Default job background
    Description
    Site Reliability Engineer (Grafana)

    Responsibilities

    Collaborate with Service Owners and Observability Leaders to develop a strategy for monitoring the technology stack using Grafana.
    Initiate data ingestion by deploying Telegraf and exporters (if necessary), utilizing discovery to feed data into Grafana Mimir.
    Establish initial alerting by creating alert rules and enabling self-service alerting in Grafana.
    Create initial dashboards to monitor the health and capacity of services.
    Transition responsibilities to the service owner by providing comprehensive documentation and necessary training.

    Required Skills

    3+ years of experience as a Site Reliability Engineer with proficiency in the Grafana platform.
    Expertise in Grafana, particularly in dashboard best practices and writing PromQL to create widgets and alert rules.
    Proficiency in Grafana Mimir or equivalent systems (e.g., Thanos, Cortex).
    Extensive knowledge of Prometheus.
    Advanced skills in Telegraf.
    Expertise in Ansible, including writing playbooks and deploying/configuring services.
    Proficient in using Git (GitLab) for self-service as code.
    Broad expertise in various technology stacks and transitioning their monitoring to the Grafana ecosystem.
    Experience with modern operating systems, specifically CentOS and Ubuntu.


    Pay Rate :
    $60-$70/hour

    #J-18808-Ljbffr


  • BHO Tech San Francisco, United States Full time

    Job Description · We're the driverless car company. We're building the world's best autonomous vehicles to safely connect people to the places, things, and experiences they care about. · Our vehicles are on the road in California, Arizona, and Michigan navigating some of the mos ...


  • PicnicHealth San Francisco, United States

    [Full Time] Site Reliability Engineer at PicnicHealth (United States) | BEAMSTART Jobs · Site Reliability Engineer · PicnicHealth United States · Date Posted · 10 Aug, 2023 · Work Location · San Francisco, United States · Salary Offered · $160 — $190 yearly · Job Type · Full Ti ...


  • Wasmer San Francisco, United States

    [Full Time] Site Reliability Engineer at Wasmer (United States) | BEAMSTART Jobs · Site Reliability Engineer · Wasmer United States · Date Posted · 25 Mar, 2023 · Work Location · San Francisco, United States · Salary Offered · Not Specified · Job Type · Full Time · Experience R ...


  • OpenAI San Francisco, United States

    Join the engineering teams that bring OpenAI's ideas safely to the world · The Applied Engineering team works across research, engineering, product, and design to bring OpenAI's technology to consumers and businesses. We seek to learn from deployment and distribute the benefits ...


  • Together San Francisco, United States

    As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, a ...


  • Instabase San Francisco, United States

    At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the wo ...


  • Instabase San Francisco, United States

    At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. · With customers representing some of the largest and most complex organizations in the ...


  • Mission Box Solutions San Francisco, United States Permanent

    As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government. Our client firmly believes that exceptional technology services ...


  • Syndio San Francisco, United States

    Do you want to empower organizations to fairly and equitably hire, promote, retain and compensate their employees? Syndio is a Series-C technology company committed to fairness in the workplace. Fueled by investments of $83M from Bessemer Ventures, Voyager Capital and social chan ...


  • eTeam Inc. San Francisco, United States

    Role: Site Reliability Engineer · Location: 100% remote · Duration: 6+ Months · Primary Skill: · Minimum 8 years exp in Terraform, Ansible, Networking, Jenkins, Python, GCP in Technology companies. · Security (vulnerability management). ...


  • DigitalOcean San Francisco, United States

    Do you ever wonder what happens inside the cloud? · DigitalOcean (NYSE: DOCN) simplifies cloud computing so builders can spend more time creating software that changes the world. With our mission-critical infrastructure and fully managed offerings, DigitalOcean enables startups a ...


  • Best Secret San Francisco, United States

    About BestSecretGroup · We are a leading European members-only online destination for premium and luxury off-price fashion. Partnering with over 3,000 international brands, our tech-focused mindset and strong commitment to sustainability drives a truly unique experience for our m ...


  • Orb San Francisco, United States

    Mission · Orb is on an ambitious mission to provide every business with the infrastructure to unlock their revenue. Best-in class businesses find ways to effectively align their monetization to product usage—whether that's through seats, consumption, feature limits, or usage-bas ...


  • Anthropic San Francisco, United States

    We are looking for a Site Reliability Engineer who will ensure the high availability and performance of our Kubernetes clusters that power machine learning research and services. · About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems ...


  • Together AI San Francisco, United States

    As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, a ...


  • Replit San Francisco, United States

    [Full Time] Site Reliability Engineer at Replit (United States) | BEAMSTART Jobs · Site Reliability Engineer · Replit United States · Date Posted · 23 Feb, 2023 · Work Location · San Francisco, United States · Salary Offered · $70000 — $175000 yearly · Job Type · Full Time · Ex ...


  • Withorb San Francisco, United States

    Mission · Orb is on an ambitious mission to provide every business with the infrastructure to unlock their revenue. Best-in class businesses find ways to effectively align their monetization to product usage—whether that's through seats, consumption, feature limits, or usage-bas ...


  • Radar San Francisco, United States

    [Full Time] Site Reliability Engineer at RADAR (United States) | BEAMSTART Jobs · Site Reliability Engineer · RADAR United States · Date Posted · 14 Mar, 2023 · Work Location · San Francisco, United States · Salary Offered · $100000 — $230000 yearly · Job Type · Full Time · Exp ...


  • Resource Informatics Group San Francisco, United States

    Job Title: Site Reliability Engineer · Work Location: San Francisco, CA (Hybrid after showing successful engagement) · Duration: 18+ months · Most important skills:10 years of Oracle database administration experience on large production environment · Database hands on skills ...


  • PostHog San Francisco, United States

    [Full Time] Site Reliability Engineer at PostHog (United States) | BEAMSTART Jobs · Site Reliability Engineer · PostHog United States · Date Posted · 31 Oct, 2022 · Work Location · San Francisco, United States · Salary Offered · Not Specified · Job Type · Full Time · Experience ...