Sr. Systems Engineer - Cincinnati, United States - Vaco

    Default job background
    Description

    Vaco has partnered with a leading healthcare provider in their search for a Senior Systems Engineer. In this role you will: provide administration of HPC scheduler and resource management systems; build custom software modules in an HPC setting, requiring knowledge of programming language compilers (e.g. gcc) and the typical software build process on Linux systems; perform advanced HPC job troubleshooting with end users and Linux system build and operations support for RHEL/OEL/CentOS based distributions.

    The senior role is expected to be able to lead medium sized projects and ensure project tasks are moving forward with minimal supervision. This would include (but not limited to): coordination with other teams across IS, developing proof-of-concepts, scheduling downtimes, training other team members, and documentation. Individuals in this position will work closely with developers, data scientists and platform users to assess needs and develop innovative and/or custom technology solutions.

    RESPONSIBILITIES

    *Project management

    Participate in the design, development, and implementation of new and enhanced application requests. Identify the appropriate resources needed to complete small projects. Support the communication between internal and external parties on project related issues and developments. Participate in developing and managing project plans. Determine the scope and complexity of small to midsized projects. Work with cross functional teams.

    *Analysis

    Analyze, design, implement, and maintain moderately complex systems that greatly improves clinical care and patient management. Support system testing. Document testing outcomes. Work to develop technical solutions. Work to design, write, and prepare complete user and technical documentation. Analyze existing documentation and provide corrections and enhancement. Utilizes Development lifecycle process, operating procedures and documentation to implement and support system solutions.

    *Technical support

    Provide technical support and problem resolution assistance for production and process issues. Troubleshoot and decipher error messages. Identify required resources to resolve minor to midsized issues. Utilizes appropriate Change Control methods to implement system solutions. Serve as a resource person for and as a liaison between various departments and Information Services. Support departmental efforts to improve customer satisfaction. Evaluate and monitor system performance and functionality to avoid potential issues as well as gathering information for future development needs or feasibility studies. Participate in on-call support rotation and handle incident resolution, problem determination and resolution during that time.

    *Technical support

    Ensure outstanding end-user support is provided, including ongoing monitoring of Service Level Agreements for incident management and collaboration with other areas to ensure customer-centered incident management and support. Adhere to and promote continual adoption of change management policies and procedures. Model outstanding customer service behavior, including timely and effective follow-up with customers.

    *Knowledge

    Develop knowledge and professional skills through cross-training, literature and attendance at department meetings and vendor education. Develop and maintain positive relationships, both internal and external. Motivate people and encourage teamwork. Work well with others and fosters a positive team environment. Prepare oral and written presentations. Conduct and participate in instructional seminars. Develop expertise in several computer-based systems.

    QUALIFICATIONS:

    • Bachelor's degree in a related field or equivalent combination of education and experience
    • 5-7 years of work experience
    • RHEL-based system administration
    • Minimum 2 years of HPC cluster administration experience
    • TCP/IP and networking fundamentals (DNS, DHCP)
    • Scripting/Programming (bash, Python)
    • Docker/Container knowledge
    • Kubernetes (Rancher) exposure
    • Basic C/C++/Java know
    • Experience with Puppet/Satellite or related configuration management tools
    • Experience configuring LDAP, DNS, networking, storage, services and logging
    • AI or Machine learning knowledge
    • Experience with NVIDIA GPUs and related toolsets