Lead Site Reliability Engineer - San Francisco, United States - JobBoard

JobBoard San Francisco, United States

2 weeks ago

Description

North America

By making evidence the heart of security, we help customers stay ahead of ever-changing cyber-attacks.

Corelight is a cybersecurity company that transforms network and cloud activity into evidence.

Evidence that elite defenders use to proactively hunt for threats, accelerate response to cyber incidents, gain complete network visibility and create powerful analytics using machine-learning and behavioral analysis tools.

Easily deployed, and available in traditional and SaaS-based formats, Corelight is the fastest-growing Network Detection and Response (NDR) platform in the industry.

And we are the only NDR platform that leverages the power of Open Source projects in addition to our own technology to deliver Intrusion Detection (IDS), Network Security Monitoring (NSM), and Smart PCAP solutions.

We sell to some of the most sensitive, mission critical large enterprises and government agencies in the world.

As the Technical Lead - SRE you will collaborate with development and quality engineering to build and maintain our continuous integration pipeline from development to production.

You'll bring a strong systems background and an eye toward automated software engineering and continuous delivery.

Your deep understanding of SaaS and cloud technologies, combined with your leadership skills, will be vital in shaping the future of Corelight's Open NDR SaaS Platform.

Your Role and Responsibilities

Engage in overall software architecture from design to implementation, to monitoring, and to testing.

This includes conducting ongoing analysis of our architecture and designs, CI/CD practices, implementing automated test suites, monitoring tools, and alerting mechanics.

Drive Corelight SaaS Cloud architecture, working closely with Engineering, Product, and other technical leaders

Drive SaaS Operations improvements including Cost, Monitoring, Security, Change Management controls, etc.

Design, develop and maintain robust and scalable Machine Learning pipelines Infrastructure.

Implement automation, disaster recovery, and system resilience best practices.

Work in an Agile development team to design and deliver service features end-to-end from design to production deployment and monitoring.

Engage in hands-on, in-depth analysis, review, and design of the Cloud Infrastructure, high availability, resilience, and meeting stringent SLO objectives.

Work closely with offshore teams on various development projects.

Qualifications

10+ years of Enterprise Distributed System Architecture, Public Cloud Infrastructure, Observability, and Infrastructure as a Code.

Experience programming skills in Python or Golang, Infrastructure such as code (Terraform, Pulumi) and Ansible

Hands-on experience with Kubernetes, Kafka, Elastic Search, Docker, and Containers

Experience with CI/CD practices, pipelines, monitoring and alerting tools, and automated test suite frameworks such as Gitlab, cloud devops tools, etc

Experience with current SRE/DevOps best practices.

Experience in architecting, building, and scaling platforms and distributed systems that require high availability, resilience, and meeting stringent SLO objectives is required.

Knowledgeable in distributed systems and redundancy / high-availability and performance optimizations

Experience in designing and implementing infrastructure for machine learning pipelines using Apache Spark or Apache Flink.

Solid understanding of distributed systems and big data technologies.

Familiarity with AWS, particularly Lambda, APIGW, MSK, EMR, AppSync,EKS, MLOps

Preferred Qualifications

Experience in optimizing and troubleshooting complex, managing/deploying large-scale cloud infrastructure

Experience in backup strategies and Disaster Recovery

Knowledge of Network-based Security Detections and Attack techniques desirable.

Experience with Search and Analytics tools like Splunk, Elasticsearch etc.

Experience working in a distributed team.

Good to have compliance requirements of FedRAMP, GDPR, SOC2, etc.

Familiar with security and risk mitigation (authentication, encryption, anomaly detection) for a cloud-based environment

We understand that no candidate is perfectly qualified. Experience comes in different forms; many skills transfer; and passion goes a long way.

We want you to learn new things in this role, and we encourage you to apply if your experience is close to what we're looking for.

We are proud of our culture and values - driving diversity of background and thought, low-ego results, applied curiosity and tireless service to our customers and community.

Corelight is committed to a geographically dispersed yet connected employee base with employees working from home and office locations around the world.

Fueled by an accelerating revenue stream, and investments from top-tier venture capital organizations such as Crowdstrike, Accel and Insight - we are rapidly expanding our team.

Notice of Pay Transparency:

The compensation for this position ranges from $180,000 - $225,000/year and may vary depending on factors such as your location, skills and experience.

Depending on the nature and seniority of the role, a percentage of compensation may come in the form of a commission-based or discretionary bonus.

Equity and additional benefits will also be awarded.

Check us out at

#J-18808-Ljbffr

Sr. Site Reliability Engineer

3 weeks ago

Outdefine San Francisco, CA, United States

full time $ /yr remote ???????? USD · full time $ /yr hybrid ???????? USD · #J-18808-Ljbffr ...
Reliability Engineer

2 weeks ago

OpenAI San Francisco, United States

Join the engineering teams that bring OpenAI's ideas safely to the world · The Applied Engineering team works across research, engineering, product, and design to bring OpenAI's technology to consumers and businesses. We seek to learn from deployment and distribute the benefits ...
Junior Reliability Engineer

1 hour ago

Jones Lange Lasalle, Inc. West Valley City, United States

The Junior Reliability Engineer is responsible for performing data validation around assets (HVAC, Electrical, Plumbing, etc.) that are managed by both Mobile and Static Facilities Management Technicians at all managed facilities within our West Caro Reliability Engineer, Liabili ...
Site Reliability Engineer

2 weeks ago

Vertisystem San Francisco, United States

Duration: 6 months contract · Pay rate: $90/hr on W2 · Job Summary: · It is an exciting time to be part of the organization's CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The ...
Engineering Director, Reliability

2 weeks ago

StarTree San Francisco, United States

At StarTree we're a group of passionate individuals that desire to improve the lives of many by developing tools and technologies that support availability and speed in the world of real-time analytics. · Our aim is to make it simple for every company to delight their users - ex ...
Site Reliability Engineer

2 days ago

Instabase San Francisco, United States

At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. · With customers representing some of the largest and most complex organizations in the ...
Site Reliability Engineer

2 weeks ago

CAPTIVATEIQ INC San Francisco, United States

[Full Time] Site Reliability Engineer - Remote at CaptivateIQ (United States) | BEAMSTART Jobs · Site Reliability Engineer - Remote · CaptivateIQ United States · Date Posted · 31 Jan, 2023 · Work Location · San Francisco, United States · Salary Offered · $139000 — $186000 yearl ...
Site Reliability Engineer

2 weeks ago

Wasmer San Francisco, United States

[Full Time] Site Reliability Engineer at Wasmer (United States) | BEAMSTART Jobs · Site Reliability Engineer · Wasmer United States · Date Posted · 25 Mar, 2023 · Work Location · San Francisco, United States · Salary Offered · Not Specified · Job Type · Full Time · Experience R ...
Site Reliability Engineer

2 weeks ago

Vertisystem San Francisco, United States

Duration: 6 months contract · Pay rate: $90/hr on W2 · Job Summary: · It is an exciting time to be part of the organizations CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. T ...
Site Reliability Engineer

1 week ago

Vertisystem San Francisco, United States

Duration: 6 months contract · Pay rate: $90/hr on W2 · Job Summary: · It is an exciting time to be part of the organization's CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. Th ...
Site Reliability Engineer

2 weeks ago

Telestream San Francisco, United States

About Us: · Welcome to the forefront of innovation at Telestream, an industry leading digital video delivery company. We are a dynamic and forward-thinking organization committed to leveraging cutting-edge cloud technologies to drive our success. If you're ready to be part of a ...
Plant Reliability Engineer

3 weeks ago

Affinity Executive Search San Francisco, CA, United States

Plant Reliability Engineer · Location: San Francisco, CA area · Description: · This opportunity is with a medium sized specialty chemical manufacturer located outside of San Francisco. The plant is PSM regulated, DCS controlled, with a very high standard of safety and overall ...
Site Reliability Engineer

2 weeks ago

DAOmatch San Francisco, United States

Aptos is a people-first blockchain on a mission to help billions of people achieve universal and fair access to decentralized assets in a safe and scalable way.Founded by some of the original creators and maintainers that researched, designed, and built the Diem blockchain to ser ...
Reliability Engineering Manager

3 weeks ago

OpenAI San Francisco, United States

About the team: · Reliable services are what enables Open AI to train the best AI models in the world and to bring the promise of safe, effective AI to the world. The SRE team in research is responsible for defining, measuring, and improving the reliability of the research platf ...
Site Reliability Engineer

2 days ago

Talkdesk San Francisco, United States

At Talkdesk, we are courageous innovators focused on helping organizations around the world create better customer experiences. Our AI-powered cloud contact center solutions optimize our customers' most critical customer service processes. We are recognized as a Contact Center as ...
Site Reliability Engineer

2 days ago

Cypress Human Capital Management, LLC San Francisco, United States

Site Reliability Engineer (Grafana) · Responsibilities · Collaborate with Service Owners and Observability Leaders to develop a strategy for monitoring the technology stack using Grafana. · Initiate data ingestion by deploying Telegraf and exporters (if necessary), utilizing di ...
Site Reliability Engineer

2 days ago

DigitalOcean San Francisco, United States

Do you ever wonder what happens inside the cloud? · DigitalOcean (NYSE: DOCN) simplifies cloud computing so builders can spend more time creating software that changes the world. With our mission-critical infrastructure and fully managed offerings, DigitalOcean enables startups ...
Site Reliability Engineer

2 weeks ago

Pelago San Francisco, United States

Role Overview: · At Pelago, we run a serverless architecture on AWS, with infrastructure managed using Terraform. Our system has been built to deliver our virtual clinic for Substance Use Management, and we are looking for a talented Site Reliability Engineer to join the engineer ...
Site Reliability Engineer

18 hours ago

WEX San Francisco, United States

(*) This is a remote position; however, the candidate must reside within 30 miles of one of the following locations: Boston, MA; Dallas, TX; San Francisco Bay Area, CA; Portland, ME; and Washington, D.C. · About the Team/Role · The WEX Site Reliability Engineering (SRE) team is ...
Site Reliability Engineer

2 weeks ago

PostHog San Francisco, United States

[Full Time] Site Reliability Engineer at PostHog (United States) | BEAMSTART Jobs · Site Reliability Engineer · PostHog United States · Date Posted · 31 Oct, 2022 · Work Location · San Francisco, United States · Salary Offered · Not Specified · Job Type · Full Time · Experience ...

Lead Site Reliability Engineer - San Francisco, United States - JobBoard

Description

Sr. Site Reliability Engineer

Reliability Engineer

Junior Reliability Engineer

Site Reliability Engineer

Engineering Director, Reliability

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Plant Reliability Engineer

Site Reliability Engineer

Reliability Engineering Manager

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Kineret Stanley

kshitija gupte

Raj Ray

Paul Campbell

McKenzie Friel

Mustafa Hafeez

for Recruiters

Information

Lead Site Reliability Engineer - San Francisco, United States - JobBoard

Description

Lead Site Reliability Engineer professionals in San Francisco