Senior Manager, SRE - Seattle
6 hours ago

Job description
At F5, we strive to bring a better digital world to life.Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world.
We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.
About The Role
We are seeking a highly experienced Senior Manager to lead our Platform SRE, Virtualization, Networking, and AI Infrastructure organizations.
This leader will oversee teams operating mission-critical infrastructure across:
Kubernetes platforms:
OpenShift, Titan‑k8s, Robin, Vanilla Kubernetes
Virtualization & hypervisors:
Proxmox, VMware, XCP‑ng, KVM
Private cloud platforms:
OpenStack
Networking:
Data center & cloud networking, L4/L7 services, Kubernetes CNI/service mesh
AI/GPU compute:
BNK‑AI‑LAB & TMOS AI Lab clusters
This role is responsible for multi-team leadership, strategic platform roadmap, operational excellence, and end‑to‑end reliability across hybrid compute environments (VMs, containers, and AI workloads).
What You'll Lead
Multi-team ownership: SRE, Networking, Virtualization, AI/GPU Infrastructure
Lead hybrid data centers — spanning routing, switching, firewalls, SDN/overlay, Kubernetes CNI, and service‑mesh/L4‑L7 traffic — to drive network reliability, performance, security, and automation.
Reliability strategy:
SLO/SLI programs, incident management, automation, scaling
Kubernetes platform operations across multiple distros
Virtualization & private cloud:
Proxmox, VMware, XCP-ng, KVM, OpenStack
Provide executive oversight for OpenStack compute storage, and networking services.
Ensure scalable VM lifecycle management, resource optimization, and operational maturity.
Networking:
datacenter/cloud networking, CNI, service mesh, L4/L7 traffic
Own end‑to‑end reliability and performance of AI compute platforms, including model training/inference pipelines, GPU scheduling and autoscaling, and high‑performance compute environments
Partner with ML, Data, and Product to build next-gen AI compute platforms.
Drive adoption of automation-first operations, GitOps, and infrastructure-as-code.
Own the multi‑year platform roadmap across hybrid compute, Kubernetes, virtualization, AI, and networking while driving cross‑org alignment and leading large‑scale modernization across CI/CD, observability, and infrastructure.
Drive organizational strategy, prioritization, staffing plans, hiring, and budgeting.Build a high-performance, inclusive culture focused on ownership, excellence, and continuous improvement.
What You Bring
10+ years infrastructure/SRE/platform engineering experience
5+ years managing engineering teams (including managers or tech leads)
Deep experience with Kubernetes, virtualization, and cloud/networking
Strong leadership, communication, and cross-functional alignment
Proven record of accomplishment improving platform uptime, performance, and reliability
Required Qualifications
10+ years of experience in SRE, Platform Engineering, Virtualization, Networking, or Infrastructure.
5+ years managing engineering teams.
Proven leadership in:
Kubernetes platforms (OpenShift, Titan‑k8s, Robin, Vanilla K8s)
Virtualization (Proxmox, VMware, XCP‑ng, KVM)
OpenStack (Nova, Neutron, Cinder, Keystone)
Data center/cloud networking and distributed systems
Strong executive communication skills and cross-org influencing ability.
Demonstrated experience improving operational maturity and reliability for large-scale systems.
Strong background in automation, CI/CD, observability, and infrastructure architecture.
Preferred Qualifications:
Experience running large-scale multi-cluster Kubernetes environments.
Experience with service mesh, ingress controllers, and network policy frameworks.
Familiarity with GPU scheduling, Ray, Kubeflow, MLflow, or Triton Inference Server.
Experience with storage backends (Ceph, vSAN, ZFS, CSI-based solutions).
Experience driving multi-year infrastructure transformation programs.
Expertise in GitOps and IaC (Terraform, Ansible, Pulumi).
The Job Description is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change.
The annual base pay for this position is: $196, $295,200.00
F5 maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, geographic locations, and market conditions, as well as to reflect F5's differing products, industries, and lines of business.
You may also be offered incentive compensation, bonus, restricted stock units, and benefits.
More details about F5's benefits can be found at the following link:
F5 reserves the right to change or terminate any benefit plan without notice.
Please note that F5 only contacts candidates through F5 email address (ending with ) or auto email notification from Workday (ending with or ).
It is the policy of F5 to provide equal employment opportunities to all employees and employment applicants without regard to unlawful considerations of race, religion, color, national origin, sex, sexual orientation, gender identity or expression, age, sensory, physical, or mental disability, marital status, veteran or military status, genetic information, or any other classification protected by applicable local, state, or federal laws.
This policy applies to all aspects of employment, including, but not limited to, hiring, job assignment, compensation, promotion, benefits, training, discipline, and termination.
F5 offers a variety of reasonable accommodations for candidates. Requesting an accommodation is completely voluntary.F5 will assess the need for accommodations in the application process separately from those that may be needed to perform the job.
Request by contactingSimilar jobs
The ASE Cassandra SRE team develops applications and tooling that are safe, reliable, scalable, and fast. As a leader in this organization you will manage develop and grow · a team responsible for Cassandra's scalability and performance across Apple.You will also be responsible ...
1 month ago
We're seeking an experienced database systems engineering manager to join our Databases SRE organization. · The databases organization is expanding its footprint into Seattle and is seeking an experienced engineering manager to join and build out this site. · nThe Databases SRE ...
1 month ago
The Databases SRE organization covers multiple database technologies. · ...
1 month ago
We are seeking an experienced database systems engineering manager to join our Databases SRE organization. The databases organization is expanding its footprint into Seattle and is seeking an experienced engineering manager to join and build out this site. · The databases organiz ...
1 month ago
The Apple Service Engineering(ASE) team builds and provides systems and infrastructure that fuel Apple's services (such as iCloud, iTunes, Siri, and Maps). We are the foundation on which Apple's software developers build the products that our customers love. We are looking for a ...
1 week ago
We are looking for a passionate and dedicated Site Reliability Engineering Manager to lead a team which focuses on providing our customers the highest quality Apple Services experience. Our services have to scale globally, stay highly available, · Minimum 5+ years of handling ser ...
1 month ago
The Apple Service Engineering(ASE) team builds and provides systems and infrastructure that fuel Apple's services (such as iCloud, iTunes, Siri, and Maps). · We are looking for a passionate and dedicated Site Reliability Engineering Manager to lead a team which focuses on providi ...
1 month ago
We are looking for a high-impact SRE Ops Manager who can lead independently, · drive critical support initiatives, · and communicate effectively with senior stakeholders and clients.Own and drive resolution of production issues end-to-end · Proactively identify issues using obser ...
1 month ago
The Apple Service Engineering(ASE) team builds and provides systems and infrastructure that fuel Apple's services (such as iCloud, iTunes, Siri, · Maps). We are looking for a passionate and dedicated Site Reliability Engineering Manager · to lead teams which focuses on providing ...
1 week ago
Site Reliability Manager, Site Reliability Engineering, BigQuery SRE
Only for registered members
The Site Reliability Engineer (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services—both our internally critical and our externally-visible systems—have reliability, uptime a ...
1 week ago
Senior Site Reliability Engineer with strong background and experience in Azure Cloud, Terraform, Azure DevOps. · ...
4 days ago
Dice is the leading career destination for tech experts at every stage of their careers. Our client, ClifyX, is seeking the Site Reliability Engineer SRE with strong background and experience in Azure Cloud, Terraform, Azure DevOps, Azure API Services and Microsoft D365 Infrastru ...
1 week ago
Job Title: SRE Infrastructure Engineer · Work Location: Seattle WA (onsite) · Minimum years of experience:10 years · Job Details: · Must Have Skills (Top 3 technical skills only) * · 1. Handson experience as an SRE, DevOps, or Infrastructure Engineer in enterprise environments · ...
4 hours ago
Dice is the leading career destination for tech experts at every stage of their careers. · ...
1 week ago
Deploy and manage AI resources on Microsoft Azure. · Deploy and manage AI resources on Microsoft Azure, including AI Foundry and RAG solutionsMonitor and ensure service uptime, availability, reliability, and latencyTrack and integrate SRE metrics with enterprise monitoring system ...
2 weeks ago
The Apple Service Engineering - SRE team is looking for Site Reliability Engineers with experience in developing processes, tools, and automation for managing distributed systems in production environments. · You will help building next generation search infrastructure and platfo ...
1 month ago
The storage SRE teams of Apple Cloud are building and running the next generation distributed storage systems to support Apple's most critical services. · As a Storage SRE at Apple, you'll need to solve these problems using your deep understanding of storage, data analysis, progr ...
1 month ago
We are looking for seasoned software and systems engineers to join the Block Storage SRE team at Apple The role involves tremendous amount of individual responsibility and influence over the direction the platform, shaping its use by many critical Apple Cloud services for years t ...
1 month ago
The Apple Service Engineering - SRE team is looking for Site Reliability Engineers with experience in developing processes, tools, · and automation for managing distributed systems in production environments. · Bachelor's Degree in Computer Science, · ,,an engineering-related fie ...
1 month ago
We are looking for an SRE with experience in building and supporting highly available customer-facing services. · solve problems using data, teamwork, and your own expertise. · ...
1 month ago