- Responsible for reliability and support of Container Platform on-prem and external clouds (Azure /AWS /Google)
- Monitor and troubleshoot Container platform (Openshift) and Azure (AKS) environment performance issues, connectivity issues, security issues, etc.
- Perform deep dives into systemic and latent reliability issues, Incident management, problem management
- Identifying, analyzing, and resolving infrastructure vulnerabilities and application deployment issues.
- Perform blameless RCA, partner with engineering and operation teams across the organization to roll out fixes.
- Responsible for application onboarding and provide troubleshooting support through the lifecycle of the applications on the container platform.
- Identify and drive opportunities to improve automation to reduce TOIL and improve operational excellence.
- Partner with risk, and compliance teams to bring visibility and implement right controls and remediation of vulnerabilities.
- Ensure resiliency during implementation and identify/fix resiliency problems by collaborating with engineering teams.
- Be a key stakeholder in the design of cloud services and work with Architecture, engineering, product teams
- Participate in 24x7 on-call coverage follow the sun model
- BS /MS degree in Computer Science or related technical field involving systems or equivalent practical experience.
- Minimum 5+ years of hands-on experience supporting Kubernetes /Openshift / AKS /EKS Container platform.
- Experience with Python, Ansible, Golang, and shell scripting
- Strong experience in major services related to Compute, Storage, Network and Security
- Experience with monitoring tools like Prometheus and Dynatrace, as well as cloud native tools like Azure Monitor and Log Analytics
- Strong understanding and background of working with a complex IAM infrastructure, including Active Directory, Azure AD Connect, Azure AD, and Ping Identity or other SSO solutions.
- Advanced knowledge of Linux OS, DNS, DHCP, Kerberos and Windows Authentication
- Experience with CI/CD tools git /Jenkins, GitOps model
- Excellent understanding of Linux /Windows operating systems administration
- Experience in Container security and vulnerability remediation.
- Systematic problem-solving approach, sense of ownership and drive
- Ability to juggle competing priorities and adapt to changes in project scope.
- Excellent interpersonal, organizational and communication (written, verbal, and presentation) skills are a must.
- Proven ability to work independently with minimal supervision and as part of a team with direct responsibilities.
. - Experience in Openshift, CSP Kubernetes services such as AKS and EKS
- Kubernetes /Openshift /Terraform certifications are a plus
- Experience in Terraform, ArgoCD, Tekton, and K-native technologies.
- Experience in agile deployment methodologies (GitOps)
- Knowledge of various container runtimes
- Familiarity with the operator deployment pattern.
- Experience working in a highly available multi-datacenter environment
- Experience working with monitoring tools such as Prometheus, Splunk, Dynatrace, Sysdig, or similar tools.
- Understanding of cost management, inventory management, FinOps model
-
Reliability Engineer
4 days ago
Amentum Chandler, United StatesAmentum · is seeking a · Reliability Planning Analyst I · to support our team of multi-skilled technicians in a manufacturing facility. · The · Reliability Planning Analyst I , is responsible for the effective execution of all maintenance work controls processes. The primary ...
-
Reliability Engineer
3 weeks ago
Amentum Chandler, United States**Amentum** is seeking a **Reliability Planning Analyst I** to support our team of multi-skilled technicians in a manufacturing facility. · The **Reliability Planning Analyst I** , is responsible for the effective execution of all maintenance work controls processes. The primary ...
-
Reliability Engineer
3 weeks ago
Amentum Chandler, United States Full timeAmentum is seeking a Reliability Planning Analyst I to support our team of multi-skilled technicians in a manufacturing facility. · The Reliability Planning Analyst I, is responsible for the effective execution of all maintenance work controls processes. The primary role is to im ...
-
Junior Reliability Engineer
1 week ago
Jones Lange Lasalle, Inc. Chandler, United StatesJLL supports the Whole You, personally and professionally. Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology to our clients. We are committed to hiring the best, most Reliability Engineer, Liabili ...
-
Site Reliability Engineer
1 week ago
Bank of America Chandler, United States Full timeJob Description: · At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. ...
-
Site Reliability Engineer
2 weeks ago
Hispanic Technology Executive Council Chandler, United StatesAt Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. · One of the keys ...
-
Site Reliability Engineer
3 weeks ago
Hispanic Technology Executive Council Chandler, United StatesAt Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. · One of the keys ...
-
Site Reliability Engineer
1 week ago
Bank of America Chandler, United StatesSite Reliability Engineer - Hybrid Cloud Container Platform · Chandler, Arizona;Atlanta, Georgia; Richmond, Virginia; Charlotte, North Carolina; Plano, Texas; Jersey City, New Jersey · **Job Description:** · At Bank of America, we are guided by a common purpose to help make fi ...
-
Junior Reliability Engineer
5 days ago
JLL Chandler, United StatesJLL supports the Whole You, personally and professionally. · Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology to our clients. We are committed to hiring the best, most talented people in our ind ...
-
Junior Reliability Engineer
1 week ago
JLL Chandler, United States Regular, Full timeJLL is seeking aJunior Reliability Engineerto join our team · InJLL Work Dynamics our most significant assets are our "People" and our "Clients". We will act with Dignity and Respect, make Ethical Decisions, champion Corporate Responsibility and serve as a driving force for a Su ...
-
DevOps / Site Reliability Engineer
4 weeks ago
Motion Recruitment Partners LLC Chandler, United StatesDevOps / Site Reliability Engineer · Chandler, AZ · **Hybrid** · Contract · $53.56/hr - $60.35/hr · Outstanding long-term contract opportunity A well-known Financial Services Company is looking for a Software Engineerin Chandler, AZ(Hybrid). · Work with the brightest minds at one ...
-
DevOps / Site Reliability Engineer
3 weeks ago
Motion Recruitment Partners LLC Chandler, United StatesOutstanding long-term contract opportunity A well-known Financial Services Company is looking for a Engineer 5 – Contingentin Chandler, AZ(Hybrid). · Work with the brightest minds at one of the largest financial institutions in the world. This is long-term contract opportunity t ...
-
Site Reliability Engineer
6 days ago
CAR CarsArrive Network Inc Mesa, United StatesJob Description · Job Description: · Site Reliability Engineer (f.k.a. Platform Engineer) for CarsArrive Network, Inc. located in Mesa, AZ. Provide daily, hands-on assistance to maintain and advance the build process to ensure reliability and optimum integration with Continuous I ...
-
Site Reliability Engineer
1 week ago
Trident Consulting Tempe, United StatesPosition – Site Reliability Engineer · Location – Chicago, IL or Tempe, AZ (3 days onsite) · Type – Contract · **Candidates local to IL or AZ highly preferred** · JOB DESCRIPTION: · • Lead production stability effort by preventing production issue and improve production stabilit ...
-
Site Reliability Engineer
1 week ago
CharacterStrong Mesa, United StatesJob Description · Job DescriptionSalary: Starting Salary $100,000 - $125,000, commensurate with experience · Position: · The Site Reliability Engineer (SRE) at CharacterStrong is entrusted with the development and maintenance of scalable, reliable, and efficient site reliability ...
-
Reliability Engineer
13 hours ago
Excelon Solutions Phoenix, United StatesJob Description · Job DescriptionJob Description: · Experience leading onshore/offshore teams · Hands on building/troubleshooting experience · Transitioned from Prometheus to Mimir; Grafana is still a must-have · Site Reliability/Observability dev experience is a must · App Dev b ...
-
Broadcast Reliability Engineer
1 week ago
Fox Service Company Tempe, United StatesOVERVIEW OF THE COMPANY · Fox CorporationUnder the FOX banner, we produce and distribute content through some of the world's leading and most valued brands, including: FOX News Media, FOX Sports, FOX Entertainment, FOX Television Stations and Tubi Media Group. We empower a divers ...
-
Site Reliability Engineer
3 weeks ago
Russell Tobin Phoenix, United States· What are we looking for in our Site Reliability Engineer? · SRE / Site Reliability Engineer · Phoenix, AZ (Hybrid once a month) · 3 - 6 Months Contract to Permanent · $50 – 60 /hr. W2 · PURPOSE OF THE JOB · Serves in a real-time production support role to ensure the stabilit ...
-
Principal Reliability Engineer
4 weeks ago
Insight Global Scottsdale, United StatesQualifications · Apply below after reading through all the details and supporting information regarding this job opportunity. 10+ years of experience in reliability engineering · BS or MS in Electrical Engineering, Material Science, or related fields · Reliability experience o ...
-
Principal Reliability Engineer
3 weeks ago
Insight Global Scottsdale, United StatesQualifications · 10+ years of experience in reliability engineering · BS or MS in Electrical Engineering, Material Science, or related fields · Reliability experience on power semiconductor products (Si, SiC, Discrete, power module) · Understanding of reliability failure mechanis ...
Site Reliability Engineer - Chandler, United States - Bank of America
Description
Description
: At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day.
One of the keys to driving Responsible Growth is being a great place to work for our teammates around the world. We're devoted to being a diverse and inclusive workplace for everyone. We hire individuals with a broad range of backgrounds and experiences and invest heavily in our teammates and their families by offering competitive benefits to support their physical, emotional, and financial well-being.
Bank of America believes both in the importance of working together and offering flexibility to our employees. We use a multi-faceted approach for flexibility, depending on the various roles in our organization.
Working at Bank of America will give you a great career with opportunities to learn, grow and make an impact, along with the power to make a difference. Join us
About Bank of America – Global Technology:
Global Technology delivers technology services globally across the bank's eight lines of business that serve individuals, companies, and institutions. The team also focuses on digital banking, payments, infrastructure, data management and technology that enhances cyber security, and risk and capital management. Innovation is at the heart of all Global Technology does.
Enterprise Cloud Platforms Team:
Enterprise Cloud Platforms team in the CTO organization offers Private and Public Cloud platforms for Bank of America's developers to drive faster time-to-market, innovation with private and public cloud capabilities, and reduce complexity with bult-in integrations. We believe in high quality engineering culture to engineer our platforms with customer and platform mindset, design for large enterprise scale and resilience, and accelerate market innovation into the technical platforms we deliver.
As part of this team, you will have a large impact on the evolution of next generation Cloud services for Bank of America and explore an extensive list of new technologies that will drive innovation across our company.
We are seeking an experienced Site Reliability Engineer (SRE) to support and administration of our Hybrid Cloud Container (OpenShift /AKS) platform.
Our HCCP Service Reliability Engineers (SRE) ensure that our Platform meets the reliability and uptime requirements of our demanding enterprise customers. This is achieved with, the best engineering practices and resilient design and through a well-defined and effective global on-call rotation that runs 24x7.
The role provides opportunity to work with wide range of technologies and unique perspective on how various services (on-prem/off-prem) interact with each other. You will work with colleagues that are as smart, hardworking, and driven as you. You will get an opportunity to work in a team that keeps growing, innovating, and giving you room to be proactive and creative.
Are you ready for the next step in your career? Then we'd love to hear from you
Position Summary
Required Skills
Desired Skills
Shift:
1st shift (United States of America)Hours Per Week:
40