Jobs
>
San Jose

    Tech Lead Manager, Site Reliability Engineer - San Jose, United States - Selby Jennings

    Selby Jennings background
    Technology / Internet
    Description

    The Company: Our client is one of the world's leading social media companies. This platform allows innovative avenues to express creativity, explore interests, and most importantly global connectivity. Having over a billion users, this company pursues the best of the best engineering talent, while also forming dynamic teams who, like the users, are intelligent, compassionate, and creative

    Responsibilities:

    • Hire and lead a team of 8-12 engineers to design, develop, and operate large-scale cloud infrastructure.
    • Give technical leadership and support to your team and stakeholders.
    • Coordinate with different teams and stakeholders in areas like Advertising, Machine Learning, and E-commerce.
    • Design and implement software platforms and monitoring frameworks to govern service-oriented architecture (SOA) efficiently, automatically, and intelligently.
    • Develop and manage components of cloud-managed data infrastructure, encompassing technologies such as Kubernetes, Redis, MySQL, Flink, and more.
    • Establish sustainable mechanisms for scaling systems, such as automation, to drive enhancements in reliability, efficiency, and velocity.

    Qualifications:

    • 3 YOE hiring and managing teams of eight or more engineers.
    • 7 YOE in software development or SRE, particularly in the design, building, scaling, and troubleshooting of cloud systems.
    • Hands-on experience and understanding of system architecture and performance bottlenecks in at least one of these areas: Database (SQL/Non-SQL), Kubernetes, or Big Data processing and storage systems, for both streaming and batch scenarios.
    • Strong verbal and written communication skills, proficient in collaborating with various stakeholders like development teams, product teams, data scientists, and leadership.

    Additional Information:

    • Same position available in San Jose and Seattle
    • Work Model: Fully onsite
    • Pay Structure: Base + Bonus + RSUs Complete, Competitive Benefits Package
    • No C2C at this time


  • Advanced Micro Devices , Inc. San Jose, United States

    Overview: · WHAT YOU DO AT AMD CHANGES EVERYTHING · We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building bloc ...


  • TEKsystems San Jose, United States Contract

    Description: · Adobe is looking for an experienced Site Reliability Engineer to join the internal tooling team support, configure, integrate, upgrade, and automate the use of enterprise tools used across their large Engineering organization. Role will be focused on user interact ...


  • HCLTech San Jose, United States

    About HCLTech: · HCLTech is a global technology company, home to 221,000+ people across 60 countries, delivering industry-leading capabilities centered around digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients ac ...


  • Natron Energy Santa Clara, United States

    Natron is seeking a Reliability Engineer to support the development and test of our high-power battery systems for data center UPS and EV charging applications. The occupant of this position will work with the Product Engineering, Reliability, Technology, and Operations teams to ...

  • Comtech Telecom

    Reliability Engineer

    3 weeks ago


    Comtech Telecom Santa Clara, United States

    Comtech Telecommunications Corp. has an opportunity in Santa Clara, CA for a Reliability/Failure Analysis Engineer. In this important role, you will collaborate with a diverse team of technical professionals and interact with outside customers, providing solutions to a variety of ...

  • COMTECH TELECOMMUNICATIONS

    Reliability Engineer

    3 weeks ago


    COMTECH TELECOMMUNICATIONS Santa Clara, United States

    Job Description · Job DescriptionComtech Telecommunications Corp. has an opportunity in Santa Clara, CA for a Reliability/Failure Analysis Engineer. In this important role, you will collaborate with a diverse team of technical professionals and interact with outside customers, pr ...


  • Diverse Lynx San Jose, United States

    Semiconductor Reliability Senior Engineer · 5+ experience in IC reliability engineering with hands-on experience in 1 or more related areas such as Product Engineering, Test Engineering, Failure Analysis. · •Good understanding of Semiconductor, manufacturing process (Fab, Assembl ...


  • MRINetwork Jobs San Jose, United States

    Job Description · Job Description · We are working with a company operating in the best of both worlds – an innovative start-up inside of a $6 billion parent company building the next generation of solar. They have developed an industry-leading building-integrated solar technol ...


  • Antora Energy San Jose, United States

    Job Description · Job DescriptionAt Antora, we're on a mission to stop climate change. And we can't do that unless we tackle the 30% of global emissions that come from industry. · Antora is unlocking zero-emissions industrial energy, cheaper than fossil fuels. Antora's thermal ba ...


  • Intel San Jose, United States

    Job Details: · Job Description: · Microelectronic Quality Reliability Engineers provide project management, product, process design/development and sustaining support for integrated circuit or semiconductor assemblies, various other electronic components, sub systems and/or com ...


  • HCLTech San Jose, United States

    About HCLTech: · HCLTech is a global technology company, home to 221,000+ people across 60 countries, delivering industry-leading capabilities centered around digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients ac ...


  • Antora Energy San Jose, United States

    At Antora, we're on a mission to stop climate change. And we can't do that unless we tackle the 30% of global emissions that come from industry. · Antora is unlocking zero-emissions industrial energy, cheaper than fossil fuels. Antora's thermal batteries store energy from renewa ...


  • Myriad Consulting Inc San Jose, United States

    This role also open for junior (3+ yoe) candidates, and SRE lead (7+ yoe). · Site Reliability Engineering(SRE) team combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. In our team, you ll have the opportunity ...


  • Analog Devices San Jose, United States

    Come join Analog Devices (ADI) - a place where Innovation meets Impact. For more than 55 years, Analog Devices has been inventing new breakthrough technologies that transform lives. At ADI you will work alongside the brightest minds to collaborate on solving complex problems that ...


  • The Dignify Solutions LLC San Jose, United States

    AWS Infra SRE/DevOps engineer with proven work experience ensuring reliability, availability and performance of cloud infra and platform. · - Specialist on Cisco Cloud run-on for infrastructure management, who can install, run, and maintain software like docker, and containers. ...


  • Cisco San Jose, United States

    The successful applicant will be performing work in FedRAMP environments, and therefore, must be a U.S. Person (i.e. U.S. citizen, U.S. national, lawful permanent resident, asylee, or refugee). This position may also perform work that the U.S. government has specified can only be ...


  • Cisco San Jose, United States

    The successful applicant will be performing work in FedRAMP environments, and therefore, must be a U.S. Person (i.e. U.S. citizen, U.S. national, lawful permanent resident, asylee, or refugee). This position may also perform work that the U.S. government has specified can only be ...


  • Analog Devices San Jose, United States

    Come join Analog Devices (ADI) – a place where Innovation meets Impact. For more than 55 years, Analog Devices has been inventing new breakthrough technologies that transform lives. At ADI you will work alongside the brightest minds to collaborate on solving complex problems that ...


  • IBM San Jose, United States

    Automation: Develop and maintain automation tools and scripts to streamline deployment, monitoring, and management of the infrastructure and · applications. · Monitoring and Alerting: Set up and maintain monitoring and alerting systems to proactively identify and resolve issues b ...


  • Zoom San Jose, United States

    ** Sponsorship is not available for this position ** · What you can expect · As a senior level Product Resilience SRE, you will define, scope, plan, and schedule Disaster Recovery Testing at Zoom. You will document any gaps identified by our testing, and drive technical solutions ...