-
Site Reliability Engineer
2 weeks ago
BHO Tech San Francisco, United States Full timeJob Description · We're the driverless car company. We're building the world's best autonomous vehicles to safely connect people to the places, things, and experiences they care about. · Our vehicles are on the road in California, Arizona, and Michigan navigating some of the mos ...
-
Site Reliability Engineer
1 week ago
PicnicHealth San Francisco, United States[Full Time] Site Reliability Engineer at PicnicHealth (United States) | BEAMSTART Jobs · Site Reliability Engineer · PicnicHealth United States · Date Posted · 10 Aug, 2023 · Work Location · San Francisco, United States · Salary Offered · $160 — $190 yearly · Job Type · Full Ti ...
-
Site Reliability Engineer
1 week ago
Wasmer San Francisco, United States[Full Time] Site Reliability Engineer at Wasmer (United States) | BEAMSTART Jobs · Site Reliability Engineer · Wasmer United States · Date Posted · 25 Mar, 2023 · Work Location · San Francisco, United States · Salary Offered · Not Specified · Job Type · Full Time · Experience R ...
-
Software Engineer, Reliability
1 week ago
OpenAI San Francisco, United StatesJoin the engineering teams that bring OpenAI's ideas safely to the world · The Applied Engineering team works across research, engineering, product, and design to bring OpenAI's technology to consumers and businesses. We seek to learn from deployment and distribute the benefits ...
-
Site Reliability Engineer
1 day ago
Together San Francisco, United StatesAs a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, a ...
-
Site Reliability Engineer
1 day ago
Instabase San Francisco, United StatesAt Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the wo ...
-
Site Reliability Engineer
3 weeks ago
Instabase San Francisco, United StatesAt Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. · With customers representing some of the largest and most complex organizations in the ...
-
Site Reliability Engineer
6 days ago
Mission Box Solutions San Francisco, United States PermanentAs a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government. Our client firmly believes that exceptional technology services ...
-
Site Reliability Engineer
3 weeks ago
Syndio San Francisco, United StatesDo you want to empower organizations to fairly and equitably hire, promote, retain and compensate their employees? Syndio is a Series-C technology company committed to fairness in the workplace. Fueled by investments of $83M from Bessemer Ventures, Voyager Capital and social chan ...
-
Site Reliability Engineer
1 day ago
eTeam Inc. San Francisco, United StatesRole: Site Reliability Engineer · Location: 100% remote · Duration: 6+ Months · Primary Skill: · Minimum 8 years exp in Terraform, Ansible, Networking, Jenkins, Python, GCP in Technology companies. · Security (vulnerability management). ...
-
Site Reliability Engineer
3 weeks ago
DigitalOcean San Francisco, United StatesDo you ever wonder what happens inside the cloud? · DigitalOcean (NYSE: DOCN) simplifies cloud computing so builders can spend more time creating software that changes the world. With our mission-critical infrastructure and fully managed offerings, DigitalOcean enables startups a ...
-
Site Reliability Engineer
1 week ago
Best Secret San Francisco, United StatesAbout BestSecretGroup · We are a leading European members-only online destination for premium and luxury off-price fashion. Partnering with over 3,000 international brands, our tech-focused mindset and strong commitment to sustainability drives a truly unique experience for our m ...
-
Site Reliability Engineer
3 days ago
Orb San Francisco, United StatesMission · Orb is on an ambitious mission to provide every business with the infrastructure to unlock their revenue. Best-in class businesses find ways to effectively align their monetization to product usage—whether that's through seats, consumption, feature limits, or usage-bas ...
-
Site Reliability Engineer
1 day ago
Anthropic San Francisco, United StatesWe are looking for a Site Reliability Engineer who will ensure the high availability and performance of our Kubernetes clusters that power machine learning research and services. · About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems ...
-
Site Reliability Engineer
2 days ago
Together AI San Francisco, United StatesAs a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, a ...
-
Site Reliability Engineer
1 week ago
Replit San Francisco, United States[Full Time] Site Reliability Engineer at Replit (United States) | BEAMSTART Jobs · Site Reliability Engineer · Replit United States · Date Posted · 23 Feb, 2023 · Work Location · San Francisco, United States · Salary Offered · $70000 — $175000 yearly · Job Type · Full Time · Ex ...
-
Site Reliability Engineer
2 days ago
Withorb San Francisco, United StatesMission · Orb is on an ambitious mission to provide every business with the infrastructure to unlock their revenue. Best-in class businesses find ways to effectively align their monetization to product usage—whether that's through seats, consumption, feature limits, or usage-bas ...
-
Site Reliability Engineer
1 week ago
Radar San Francisco, United States[Full Time] Site Reliability Engineer at RADAR (United States) | BEAMSTART Jobs · Site Reliability Engineer · RADAR United States · Date Posted · 14 Mar, 2023 · Work Location · San Francisco, United States · Salary Offered · $100000 — $230000 yearly · Job Type · Full Time · Exp ...
-
Site Reliability Engineer
3 weeks ago
Resource Informatics Group San Francisco, United StatesJob Title: Site Reliability Engineer · Work Location: San Francisco, CA (Hybrid after showing successful engagement) · Duration: 18+ months · Most important skills:10 years of Oracle database administration experience on large production environment · Database hands on skills ...
-
Site Reliability Engineer
1 week ago
PostHog San Francisco, United States[Full Time] Site Reliability Engineer at PostHog (United States) | BEAMSTART Jobs · Site Reliability Engineer · PostHog United States · Date Posted · 31 Oct, 2022 · Work Location · San Francisco, United States · Salary Offered · Not Specified · Job Type · Full Time · Experience ...
Site Reliability Engineer - San Francisco, United States - Cypress Human Capital Management, LLC
![Default job background](https://contents.bebee.com/public/img/bg-user-ex-1.jpg)
Description
Site Reliability Engineer (Grafana)Responsibilities
Collaborate with Service Owners and Observability Leaders to develop a strategy for monitoring the technology stack using Grafana.
Initiate data ingestion by deploying Telegraf and exporters (if necessary), utilizing discovery to feed data into Grafana Mimir.
Establish initial alerting by creating alert rules and enabling self-service alerting in Grafana.
Create initial dashboards to monitor the health and capacity of services.
Transition responsibilities to the service owner by providing comprehensive documentation and necessary training.
Required Skills
3+ years of experience as a Site Reliability Engineer with proficiency in the Grafana platform.
Expertise in Grafana, particularly in dashboard best practices and writing PromQL to create widgets and alert rules.
Proficiency in Grafana Mimir or equivalent systems (e.g., Thanos, Cortex).
Extensive knowledge of Prometheus.
Advanced skills in Telegraf.
Expertise in Ansible, including writing playbooks and deploying/configuring services.
Proficient in using Git (GitLab) for self-service as code.
Broad expertise in various technology stacks and transitioning their monitoring to the Grafana ecosystem.
Experience with modern operating systems, specifically CentOS and Ubuntu.
Pay Rate :
$60-$70/hour
#J-18808-Ljbffr