Jobs
>
Redmond

    Site Reliability Engineering Manager - Redmond, United States - 3MD Inc.

    Default job background
    Description

    Job Description

    Job Description

    Summary of Position:

    As the Site Reliability Engineering (SRE) Manager you will be responsible for leading a team of Cloud Operations, Site Reliability Engineers, Cloud Security, and Cloud Architecture experts in AWS supporting technologies. This position will ensure that the team delivers value to internal development teams, cloud security, operations, and manages operations to maximize uptime, availability, and system stability.

    Essential Functions:

    • Lead a team involved with Cloud Operations, Site Reliability Engineering, Cloud Infrastructure, Cloud Security, and Cloud Architecture using AWS and other supporting technologies.
    • Provide thought leadership on SRE topics including definition of metrics, production system management, change management, continuous deployment management, and incident response.
    • Define and implement best practices for Site Reliability including Observability, Resiliancy, Automation and Security.
    • Provide infrastructure as code and operational support for full-stack software applications
    • Collaborate with development staff to create, monitor, and troubleshoot the system infrastructure
    • Increase system resilience and serve larger customer volumes with expert-level infrastructure as code, bulletproof release, and change management skills
    • Deliver on Commitments on time with high quality and sense of ownership and accountability. Ensure that the team delivers value to delivery teams, cloud security, operations, and manages operations to maximize uptime, availability, and system stability.
    • Manage AWS and on-premise infrastructure and services, including EC2, CloudWatch, S3, Redshift, RDS, ECS, EKS, On-Premise VMWare, CloudFormation, Terraform, Ansible, Lambda, CloudWatch Alarms, Alerts, and Automation, VPC Management, network security groups, Network Access Control List, VerneMQ, Kafka, OpenStack, Kubernetes, Docker, Java, and Ruby on Rails.
    • Work closely with the team and client stakeholders to prevent incidents by delivering robust solutions, as well as resolve priority incidents and other production issues.
    • Ensure the resolution of issues using professionalism, technical skills, common sense, and leadership.
    • Reduce the cycle time between problem or incident identification and problem or incident resolution following policies and processes set by our team, the client, and industry best practices.
    • Create, maintain, publish and delivery of the technology roadmap.
    • Develop, maintain and communicate detailed engineering resource plans and schedules.
    • Consult with business decision-makers and senior engineering staff to inform discussions related to technical decisions.
    • Serve as primary point of contact for reviewing user and/or technical requirements, scope estimating, identifying tasks, assigning and coordinating technical resources.
    • Proactively identify project dependencies and develop and implement strategies to mitigate risks.
    • Facilitate strong communication amongst team members and cross-functional leads.

    Competencies:

    1. Ensures Accountability
    2. Tech Savvy
    3. Communicates Effectively
    4. Values Differences
    5. Customer Focus
    6. Resourcefulness
    7. Drives Results
    8. Plans and Prioritizes
    9. Decision Quality
    10. Self-Development

    Work Environment:

    This position requires occasional onsite work at the client. This job operates in a professional office environment. This role routinely uses standard office equipment such as computers, phones, photocopiers, filing cabinets, and fax machines.

    Physical Demands:

    The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job.

    While performing the duties of this job, the employee in this position frequently communicates with other co-workers/clients who have inquiries about the various projects and other needs. Must be able to exchange accurate information in these situations. The employee must be able to remain in a stationary position 75% of the time. The employee in this position needs to occasionally move about inside the office to access file cabinets, office machinery, etc. Constantly operate a computer and office machinery such as a calculator, keyboard, copy machine, and printer. Frequently moves boxes with equipment weighing up to 25lbs across the building and/or to other offsite buildings for various project needs.

    Required Education and Experience:

    • Bachelor's degree in computer science or a related field
    • 10-15 Years of Experience

    Qualifications:

    • At least 8 years of experience in managing AWS cloud infrastructure and services
    • Experience with DevOps and Site Reliability Engineering practices
    • Strong leadership and management skills with experience leading and managing a 24x7 team
    • Experience in Incident, Problem, and Change Management
    • Experience in managing cloud security and architecture
    • Strong problem-solving skills and a quality focus
    • Strong communication and interpersonal skills
    • Experience working in a fast-paced, dynamic environment
    • Relevant industry certifications such as AWS Certified Solutions Architect, AWS Certified DevOps Engineer, or similar

    AAP/EEO Statement:

    3MD Inc. is an equal opportunity employer and does not discriminate based on gender, sex, age, race and color, religion, marital status, national origin, disability, sexual orientation, gender identity or expression, veteran status, or any other category that is protected by applicable law.

    Other Duties:

    Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties, or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice.



  • Galaxy Group Bellevue, United States

    You. With us? For our team of system development, located in Arnsberg, Berzdorf, Dortmund or Essen, we are looking for dedicated reinforcement to join us as soon as possible for a fixed term of four years. · Your work should be meaningful and you want to contribute with all your ...


  • GForce Life Sciences Redmond, United States

    Internal Title: Project Manager, Engineering · Duration: 12-month contract · Location: Redmond, WA · Responsibilities: · Assist in the management of multiple engineering projects; prioritize tasks and assign team members to ensure that the team's overall resources are used effec ...


  • Microsoft Corporation Redmond, United States

    Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsofts expanding Cloud Infrastructure and responsible for powering Microsofts Intelligent Cloud mission. SCHIE delivers the core infrastructure and foundational technologies for Micro ...


  • META Redmond, United States

    Summary: · As an Electrical Engineering Manager at Meta Reality Labs (RL), you will lead the team designing Augmented Reality (AR) headsets for Facebook. You will work cross-functionally with teams that will dynamically change depending on the problem and potentially include mech ...


  • Meta Platforms, Inc. Redmond, United States

    As an Electrical Engineering Manager at Meta Reality Labs (RL), you will lead the team designing Augmented Reality (AR) headsets for Facebook. You will work cross-functionally with teams that will dynamically change depending on the problem and potentially include mechanical, opt ...


  • Meta Redmond, WA, United States

    As a Firmware Engineering Manager at Reality Labs (RL), you will lead, manage, and inspire engineering teams developing platforms for Augmented Reality (AR). · Firmware for AR systems spans multiple target classes, requires deep collaboration across engineering disciplines and th ...


  • GForce Life Sciences Redmond, United States

    Internal Title: Project Manager, Engineering · Duration: 12-month contract · Location: Redmond, WA · Responsibilities: · Assist in the management of multiple engineering projects; prioritize tasks and assign team members to ensure that the team's overall resources are used ...


  • Microsoft Corporation Redmond, United States

    The Microsoft Advertising Service Engineering Manager is responsible for a team of Service engineers that provide advanced technical support for our online Microsoft advertising business while delivering customer experiences unmatched by our competitors. The primary focus will be ...


  • iMPact Business Group Redmond, United States

    Our client, a Global Leader in the Medical Device Industry has an immediate opening for an Engineer Project Manager *for a * 12 month + Contract to Hire opportunity. Our client offers results-driven people a place where they can make a difference - every day You will also have th ...


  • iMPact Business Group Redmond, United States

    Job Description · Our client, a Global Leader in the Medical Device Industry has an immediate opening for an · Engineer Project Manager · for a · 12 month + Contract to Hire opportunity . Our client offers results-driven people a place where they can make a difference - every ...


  • Microsoft Corporation Redmond, United States

    Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsofts expanding Cloud Infrastructure and responsible for powering Microsofts Intelligent Cloud mission. SCHIE delivers the core infrastructure and foundational technologies for Micro ...


  • Microsoft Corporation Redmond, United States

    Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for M ...


  • Microsoft Corporation Redmond, United States

    Do you see yourself as a coach, a guide, a collaborator, and an engineer? Would you be excited to drive major changes to a high scale service that is critical to the company? Do you care deeply about fostering a great team culture? We are looking for a Principal Engineering Manag ...


  • META Redmond, United States

    Summary: · As a Firmware Engineering Manager at Reality Labs (RL), you will lead, manage, and inspire engineering teams developing platforms for Augmented Reality (AR).Firmware for AR systems spans multiple target classes, requires deep collaboration across engineering discipline ...


  • Microsoft Corporation Redmond, United States

    Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for M ...


  • Microsoft Corporation Redmond, United States

    Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world. · Microsoft's Azure Data ...


  • iMPact Business Group Redmond, United States

    Job Description · Our client, a Global Leader in the Medical Device Industry has an immediate opening for an Engineer Project Manager for a 12 month + Contract to Hire opportunity. Our client offers results-driven people a place where they can make a difference - every day You w ...


  • Microsoft Corporation Redmond, United States

    Microsoft Silicon and Cloud Hardware Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for Mic ...


  • Microsoft Corporation Redmond, United States

    Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for M ...


  • Insight Global Redmond, United States

    Assignment and tracking of work that is being done · * Strong project management skillset · * Ensure we are staying on top of tasks (whether it is tests, development projects, etc.) · * Connecting with team 1:1 and build status report outs for all testing in all labs · * Repo ...