Data Center Facility Operations Reliability Engineer - Denver, United States - META

    META
    Meta background
    Description

    Meta is seeking an experienced and self-motivated Reliability Engineer to join our Global Asset Management team within Facility Operations. This person will work at the leading edge of Facility Operations to identify and manage asset reliability risks that could adversely affect data center operations. Managing stakeholders spread across time zones is a significant challenge and key to the success of our individual projects and overall asset management, quality and reliability program.

    Data Center Facility Operations Reliability Engineer Responsibilities

    • Support the asset care and maintenance strategies for critical assets based on Meta processes
    • Support the development of standards, guidelines and processes to execute reliability program function
    • Lead and facilitate asset criticality assessments, RCM studies, PM Optimization and other reliability studies
    • Perform reliability analytics include Weibull distribution, Monte Carlo simulation and other reliability analysis
    • Act as liaison between Reliability and other partner teams
    • Support the development of standardized PM template to facilitate trending
    • Works with appropriate technical teams to evaluate reliability and maintainability of data center equipment to significantly influence reliability and maintainability improvements
    • Works with Asset Management and Quality teams to evaluate the failure data and other information and build that into a global reliability database
    • Provides input for key documents such as reliability process playbooks, executive briefs/presentations and program metrics
    • Define, design, develop, monitor, and refine an asset maintenance plan that includes both (a) value-added preventive maintenance tasks and (b) predictive and other non-destructive testing methods designed to identify and isolate inherent reliability issues
    • Provide technical support to Operations, Maintenance management, and technical personnel
    • Apply value analysis to repair/replace, repair/redesign, and make/buy decision
    • Develop Reliability Improvement Process (RIP) reports on critical asset failures
    • Develop or recommend engineering solutions to repetitive failures and all other problems that adversely affect plant operations
    • Support Master Data and asset onboarding process
    • Support the development and stewardship of maintenance strategies
    • Support the spares development and sustainment program
    • Work with Maintenance to analyze asset characteristics, including: asset availability, overall equipment effectiveness and remaining useful life
    Minimum Qualifications
    • 10+ years of experience in reliability engineering (related to electrical or mechanical cooling equipment)
    • Bachelor's degree in Mechanical, Electrical Reliability Engineering or similar technical discipline
    • Knowledgeable of relevant ISO standards (ISO 14224, ISO 17359, ISO
    • Proficient in usage of EAM solutions to extract data and develop meaningful insights
    • Experienced in Reliability Centered Maintenance (RCM) and Failure Maintenance Effect Analysis activities for maintenance/process/equipment design optimization to meet reliability requirements
    Preferred Qualifications
    • Proficient in developing and executing Design for Reliability Activities (Reliability prediction models, accelerated life testing, environmental stress testing, equipment qualification criteria, etc.)
    • Experience in performing Reliability, Availability and Maintainability (RAM) models
    • Experience with data center equipment such as critical cooling systems, generators, main switchboards, network gear
    • Certifications in Maintenance & Reliability such as CMRP, CRL, CRE
    Start preparing
    Learn about how to prepare for your interview with our interview guide, tips, and interactive experiences.
    Visit interview prep