Jobs
>
San Mateo

    Senior Site Reliability Engineer - San Mateo, United States - Qualys

    Default job background
    Description

    Come work at a place where innovation and teamwork come together to support the most exciting missions in the world

    Description
    We are seeking a highly motivated and talented Site Reliability Engineer to work on Qualys' Cloud Platform & Middleware technologies. Working with a team of engineers and architects, you will combine software development and systems engineering skills to build and run scalable, distributed and fault-tolerant systems.

    The ideal candidate will write software to optimize day to day work through better automation, monitoring, alerting, testing and deployment.
    Responsibilities

    Co-develop and participate in the full lifecycle development of cloud platform services from inception and design, deployment, operation andimprovement by applying scientific principles.
    Increase the effectiveness, reliability and performance of cloud platform technologies by identifying and measuring key indicators, making changes to the production systems in an automated way and evaluating the results.
    Support cloud platform team before the technologies are pushed for production release through activities such as system design, capacity planning, automation of key deployments, engaging in building a strategy for production monitoring and alerting and participate in testing/verification process.
    Ensure that the cloud platform technologies are maintained properly by measuring and monitoring availability, latency, performance and system health.
    Advice the cloud platform team to improve the reliability of the systems in production and scale them based on need.
    Participate in the development process by supporting new features, services, releases and hold an ownership mindset for the cloud platform technologies
    Develop tools and automate the process for achieving large scaleprovisioning and deployment of cloud platform technologies
    Participate in on-call rotation for cloud platform technologies. At times of incidents, lead incident response and be part of writing detailed postmortem analysis reports which are brutally honest with no-blame.
    Propose improvements and drive efficiencies in systems and processes related to capacity planning, configuration management, scaling services, performance tuning, monitoring, alerting and root cause analysis

    Requirements

    5 years of relevant experience in running distributed systems at scale in production.
    Expertise in one of the programming language: Java, Python or Go.
    Proficient in writing bash scripts
    Good understanding of SQL and NoSQL systems
    Good understanding of systems programming (network stack, file system, OS services)
    Understanding of network elements such as firewalls, load balancers, DNS, NAT, TLS/SSL, VLANs etc
    Skilled in identifying performance bottlenecks, identifying anomalous system behavior, and determining the root cause of incidents.
    Knowledge of JVM concepts like garbage collection, heap, stack, profiling, class loading, etc.
    Knowledge of best practices related to security, performance, high-availability, and disaster recovery.
    Demonstrate a proven record of handling production issues, planning escalation procedures, conducting post-mortems, impact analysis, risk assessments and other related procedures.
    Able to drive results and set priorities independently
    BS/MS degree in Computer Science, Applied Math or related field
    Bonus Points if you have:
    Experience with managing large scale deployments of search engines like Elasticsearch
    Experience with managing large scale deployments of message-oriented middleware such as Kafka
    Experience with managing large scale deployments of RDBMS systems such as oracle
    Experience with managing large scale deployments of NoSQL databases such as Cassandra
    Experience with managing large scale deployments of In-memory caching using Redis, Memcached, etc.
    Experience with container and orchestration technologies such as Docker, Kubernetes etc
    Experience with monitoring tools such as Graphite, Grafana andPrometheus
    Experience with Hashicorp technologies such as Consul, Vault, Terraform and Vagrant
    Experience with configuration management tools such as Chef, Puppet or Ansible
    In-depth experience with continuous integration and continuous deployment pipelines
    Exposure to Maven, Ant or Gradle for builds

    **********************

    Annual Salary Guidelines: $115,000 - $135,000

    Qualys is an Equal Opportunity Employer, please see our EEO policy.



  • Outdefine San Francisco, CA, United States

    full time $ /yr remote ???????? USD · full time $ /yr hybrid ???????? USD · #J-18808-Ljbffr ...


  • Zoox Foster City, United States

    At Zoox we have set the goal to provide our customers with the highest level of safety and a best-in-class experience while using our fully autonomous vehicles. You will work with a team of world-class engineers with diverse backgrounds such as robotics, control, and vehicle engi ...


  • Zoox Foster City, United States

    At Zoox we have set the goal to provide our customers with the highest level of safety and a best-in-class experience while using our fully autonomous vehicles. You will work with a team of world-class engineers with diverse backgrounds such as robotics, control, and vehicle engi ...


  • The Spear Group Belmont, United States

    Job Description · Job Description · The position requires technical expertise and leadership to implement and execute asset reliability strategies across operating site. Roles and responsibilities for this position are listed below. · Key Responsibilities: · Leads site team of ...


  • Verkada San Mateo, United States

    Who We Are · Verkada is the largest cloud-based B2B physical security platform company in the world. Only Verkada offers six product lines - video security cameras, access control, environmental sensors, alarms, workplace and intercoms - integrated with a single cloud-based soft ...


  • Zoox Foster City, United States

    At Zoox we have set the goal to provide our customers with the highest level of safety and a best-in-class experience while using our fully autonomous vehicles. You will work with a team of world-class engineers with diverse backgrounds such as robotics, control, and vehicle engi ...


  • Zoox Foster City, United States

    Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service fro ...


  • Zoox Foster City, United States Full time

    Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service fro ...


  • Arkose Labs San Mateo, United States

    Job Description · Job DescriptionThe mission of Arkose Labs is to create an online environment where all consumers are protected from online spam and abuse. Recognized by G2 as the 2023 Leader in Bot Detection and Mitigation, with the highest score in customer satisfaction and la ...


  • Arkose Labs San Mateo, United States

    The mission of Arkose Labs is to create an online environment where all consumers are protected from online spam and abuse. Recognized by G2 as the 2023 Leader in Bot Detection and Mitigation, with the highest score in customer satisfaction and largest market presence four quarte ...


  • Arkose Labs San Mateo, CA, United States

    The mission of Arkose Labs is to create an online environment where all consumers are protected from online spam and abuse. Recognized by G2 as the 2023 Leader in Bot Detection and Mitigation, with the highest score in customer satisfaction and largest market presence four quarte ...


  • Sentry Burlingame, United States

    Bad software is everywhere, and we're tired of it. Sentry is on a mission to help developers write better software faster, so we can get back to enjoying technology. · With more than $217 million in funding and 90,000 organizations that believe we're on to something, we're buildi ...


  • Electric Hydrogen San Carlos, United States

    Electric Hydrogen's mission is to make molecules to decarbonize our world Our outstanding people are our most important asset and will allow us to deliver hydrogen from renewable electrolysis for heavy industry, at prices below fossil fuels. · In this role, you will use your exp ...


  • Qualys Foster City, United States

    Come work at a place where innovation and teamwork come together to support the most exciting missions in the world · Description · We are seeking a highly motivated and talented Site Reliability Engineer to work on Qualys' Cloud Platform & Middleware technologies. Working with a ...


  • PDS Tech Commercial, Inc. San Carlos, CA, United States

    PDS Tech Commercial, Inc is currently looking for Aircraft/Aerospace Reliability Engineers for our electric VTOL Aircraft client in Northern California. · CONTRACT (12 months+) · Location: San Carlos. CA · Pay Rate: $60 + DOE · Looking for those that can Drive Design-FMEA effo ...


  • PDS Tech San Carlos, United States

    PDS Tech Commercial, Inc is currently looking for Reliability Engineers for our electric VTOL Aircraft client in Northern California. · CONTRACT (12 months+) · Location: San Carlos. CA · Pay Rate: $60 + DOE · Responsibilities · Drive Design-FMEA efforts · , · Assist with defini ...


  • Electric Hydrogen San Carlos, United States

    Electric Hydrogen 's mission is to make molecules to decarbonize our world Our outstanding people are our most important asset and will allow us to deliver hydrogen from renewable electrolysis for heavy industry, at prices below fossil fuels. · In this role, you will use your exp ...


  • PDS Tech Commercial, Inc. San Carlos, CA, United States

    PDS Tech Commercial, Inc is currently looking for Reliability Engineers for our electric VTOL Aircraft client in Northern California. · CONTRACT (12 months+) · Location: San Carlos. CA · Pay Rate: $60 + DOE · Responsibilities · Drive Design-FMEA efforts , Assist with defining ...


  • Electric Hydrogen San Carlos, United States

    Electric Hydrogen's mission is to make molecules to decarbonize our world Our outstanding people are our most important asset and will allow us to deliver hydrogen from renewable electrolysis for heavy industry, at prices below fossil fuels. · In this role, you will use your exp ...


  • PDS Tech San Carlos, United States

    **PDS Tech Commercial, Inc is currently looking for Reliability Engineers for our electric VTOL Aircraft client in Northern California.** · **CONTRACT (12 months+)** · **Location: San Carlos. CA** · **Pay Rate: $60 + DOE** · **Responsibilities** · + **Drive Design-FMEA efforts** ...