- Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles.
- Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs).
- Proactively identify and address potential issues before they impact operations, utilizing observability tools like New Relic, Scalyr/Splunk, bash scripts, and Python scripts.
- Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices.
- Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence.
- Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions.
- Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices.
- Design and maintain robust production monitoring systems to ensure timely detection and resolution of issues, following SRE guidelines for effective monitoring and alerting.
- Utilize a diverse array of tools to troubleshoot performance and stability issues effectively, employing SRE methodologies to identify and mitigate bottlenecks.
- Evaluate and enhance application and environment security measures, integrating SRE-driven security practices into the development and deployment pipelines.
- Provide support for globally distributed, multi-cloud (public and/or private) environments, implementing SRE strategies for resilience and fault tolerance.
- Automate repetitive tasks at scale to streamline operational workflows and enhance efficiency, focusing on the implementation of SRE-driven automation solutions.
- Adhere to change management processes during implementations and utilize version control for application infrastructure, following SRE principles for reliable and auditable change management.
- Foster a SRE mindset throughout the organization, promoting collaboration and shared responsibility for reliability and performance
- Bachelor's Degree in Computer Science or related field, or foreign equivalent.
- Demonstrated curiosity and self-drive to tackle complex challenges and drive change in a diverse organizational landscape.
- Excellent written and verbal communication skills, with the ability to effectively communicate with engineering management, developers, and leadership.
- Proven ability to adapt to new technologies and learn quickly.
- Minimum of 5 years of experience in Site Reliability Engineering (SRE) or related roles.
-
Site Reliability Engineer
3 weeks ago
Amtex Systems Marlborough, United StatesJob Title: Lead Site Reliability Engineer · Duration: 6 months Contract to hire · Location: Marlborough, MA (Hybrid) · Responsibilities: · Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engin ...
-
Lead Site Reliability Engineer
3 days ago
BJ's Wholesale Club Marlborough, United StatesBJs Wholesale Club · Lead Site Reliability Engineer · BJ's Club Support Center Marlborough , · Massachusetts · Apply Now · Join our team of more than 34,000 team members, supporting our members and communities in our Club Support Center, 235+ clubs and eight distribution cen ...
-
Lead Site Reliability Engineer
3 weeks ago
BJ's Wholesale Club Fayville, United StatesJoin our team of more than 34,000 team members, supporting our members and communities in our Club Support Center, 235+ clubs and eight distribution centers. BJ's Wholesale Club offers a collaborative and inclusive environment where all team members can learn, grow and be their a ...
-
Senior Reliability Engineer II
3 weeks ago
Saint-Gobain Northborough, United StatesPourquoi a-t-on besoin de vous ? · CertainTeed, a subsidiarity of Saint-Gobain, develops innovative solar roofing that combines aesthetics, high energy efficiency, and durability. Our R&D team designs and validates electrical and mechanical assemblies which together with the pho ...
-
Senior Reliability Engineer II
5 days ago
Saint-Gobain Northborough, United StatesPourquoi a-t-on besoin de vous ? · CertainTeed, a subsidiarity of Saint-Gobain, develops innovative solar roofing that combines aesthetics, high energy efficiency, and durability. Our R&D team designs and validates electrical and mechanical assemblies which together with the pho ...
-
Reliability Engineering
3 weeks ago
Akamai Cambridge, United StatesJob Title: Manager.Senior.Site Reliability Engineering · Work Location: 145 Broadway, Cambridge, MA 02142 · Job Description: · Akamai Technologies, Inc. is hiring for the following role in Cambridge, MA (multiple openings): · Manager.Senior.Site Reliability Engineering · Col ...
-
Reliability Engineering
3 weeks ago
Akamai Cambridge, United StatesJob Title: Manager.Senior.Site Reliability Engineering · Work Location: 145 Broadway, Cambridge, MA 02142 · Job Description: · Akamai Technologies, Inc. is hiring for the following role in · Cambridge, MA · (multiple openings): · Manager.Senior.Site Reliability Engineering · Co ...
-
Senior Reliability Engineer
2 weeks ago
Takeda Pharmaceutical Company Ltd Hopkinton, United StatesBy clicking the Apply button, I understand that my employment application process with Takeda will commence and that the information I provide in my application will be processed in line with Takedas Privacy Notice and Terms of Use. I further attest that all information I submit ...
-
Lead Site Reliability Engineer
3 weeks ago
BJ's Wholesale Club Stow, United StatesJoin our team of more than 34,000 team members, supporting our members and communities in our Club Support Center, 235+ clubs and eight distribution centers. BJs Wholesale Club offers a collaborative and inclusive environment where all team members can learn, grow and be their au ...
-
HW Reliability Engineer, Amazon Robotics
2 weeks ago
Amazon Inc Westborough, United StatesAre you inspired by invention? Is problem solving through teamwork in your DNA? Do you like the idea of seeing how your work impacts the bigger picture? Answer yes to any of these and youll fit right in here at Amazon Robotics. We are a smart team of Reliability Engineer, Robotic ...
-
Senior Reliability Engineer
3 weeks ago
Takeda Pharmaceutical Company Ltd Cambridge, United StatesBy clicking the "Apply" button, I understand that my employment application process with Takeda will commence and that the information I provide in my application will be processed in line with Takeda's Privacy Notice and Terms of Use. I further attest that all information I subm ...
-
Equipment Reliability Engineer
1 week ago
Entegris Bedford, United StatesJob Title: · Equipment Reliability Engineer · Job Description: · The Role: · The Entegris Manufacturing site in Bedford, MA is seeking for Equipment Reliability Engineer to join our team. · The Equipment Reliability engineer is responsible for improving the reliability of mu ...
-
Senior Reliability Engineer
2 weeks ago
Takeda Pharmaceutical Company Ltd Cambridge, United StatesBy clicking the "Apply" button, I understand that my employment application process with Takeda will commence and that the information I provide in my application will be processed in line with Takeda's Privacy Notice and Terms of Use . I further attest that all information I sub ...
-
Site Reliability Engineer
2 weeks ago
SS&C Technologies Holdings, Inc. Watertown, United StatesDay to Day:Responds to and resolves escalated incidents for customer issues or monitoring alerts. In-depth analysis of incident root cause;Working with R&D and architecture teams on defects and runtime inefficiencies identified in the production envi Reliability Engineer, Liabili ...
-
Equipment Reliability Engineer
3 weeks ago
Entegris Bedford, United StatesThe Role: · The Entegris Manufacturing site in Bedford, MA is seeking for Equipment Reliability Engineer to join our team. · The Equipment Reliability engineer is responsible for improving the reliability of multiple manufacturing equipment by increasing Mean Time to Failure wi ...
-
Site Reliability Engineer
4 days ago
Dentsply Sirona Waltham, United StatesDentsply Sirona is the world's largest manufacturer of professional dental products and technologies, with a 130-year history of innovation and service to the dental industry and patients worldwide. Dentsply Sirona develops, manufactures, and markets a comprehensive solutions off ...
-
Lead Reliability Engineer
5 days ago
The MITRE Corporation Bedford, United StatesDepartment Summary:The Mechanical & Reliability Systems and Prototype Development Department provides innovative solutions for electronic prototypes designed for advanced sensing, communication, and navigation systems. Areas of expertise include grou Reliability Engineer, Liabili ...
-
Lead Reliability Engineer
6 days ago
The MITRE Corporation Bedford, United StatesThe MITRE Corporation · Lead Reliability Engineer · Bedford , · Massachusetts · Apply Now · Why choose between doing meaningful work and having a fulfilling life? At MITRE, you can have both. That's because MITRE people are committed to tackling our nation's toughest challen ...
-
WBG Reliability Engineer
3 weeks ago
Analog Devices Wilmington, United StatesAnalog Devices, Inc. (NASDAQ: ADI) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologies into solutions that help drive advancements in digitized facto ...
-
Site Reliability Engineer
1 week ago
Sun Life Financial Wellesley, United StatesSite Reliability Engineer page is loaded · Site Reliability Engineer · Apply · locations · Wellesley Hills, Massachusetts · U.S. Employees (Remote) · time type · Full time · posted on · Posted 2 Days Ago · job requisition id · JR · You are as unique as your background ...
Site Reliability Engineer - Marlborough, United States - Amtex Systems Inc.
Description
Job Title: Lead Site Reliability Engineer
Duration: 6 months Contract to hire
Location: Marlborough, MA (Hybrid)
Responsibilities:
Qualifications: