Junior HPC Systems Engineer - Mooresville, United States - Corvid Technologies

    Corvid Technologies
    Corvid Technologies Mooresville, United States

    4 weeks ago

    Default job background
    Description
    Job Description

    Job Description


    Corvid Technologies is seeking an Junior HPC Systems Engineer with a strong background and enthusiasm for Linux to support our Linux based High Performance Computer consisting of 80,000+ processor cores.

    If you enjoy learning, playing with hardware, optimizing performance, efficiency, and spend most of your time on the command line, this is the job for you.


    This candidate will be responsible for the following:
    Supporting software installation and configuration of license management servers (e.g., FlexLM or RLM)
    Troubleshoot slow, hanging, or failing HPC jobs on internal or customer HPC clusters
    Automate repetitive tasks and implement custom solutions using

    scripting/programming

    languages such as bash or Python
    Learning to troubleshoot hardware and software issues on Linux servers, and how to install, configure, and design HPC systems
    Design, test and implement an HPC environment consisting of a provisioner (Warewulf), scheduler (Slurm), RDMA connections (e.g
    InfiniBand), a subnet manager, head node, storage node, and 5+ compute nodes within the first 180 days of employment
    Obtaining a CompTIA Security+ certification within the first year of employment

    Requirements:
    Bachelor's degree in Engineering or related STEM field
    2+ yrs scripting experience (including during education)
    2+ yrs

    professional/personal

    experience using command line Linux (RHEL derivatives preferred)
    Experience in one or more engineering computational code OR 2+ years of IT-related experience (e.g., basic networking, a home
    Linux environment)
    US Citizenship
    Obtain and maintain a U.S security clearance

    Preferred Skills:
    Past experience as an HPC User on a large-scale cluster
    Experience configuring, installing, and troubleshooting MPI and OpenMP applications
    Experience configuring, installing, tuning, and maintaining scientific software on large-scale systems
    Experience with configuration management tools such as Ansible or Puppet
    Experience configuring, installing, maintaining, and using performance monitoring and optimization tools
    Experience with Linux operating systems (e.g., Ubuntu, Rocky Linux) with experience with RHEL-derivative operating systems
    Experience with at least one Linux filesystem (e.g., ZFS, XFS, ext4)
    Experience with RAID arrays

    Why Corvid:


    Founded in 2004, we are a group of over 300 engineers and scientists, about 3/4 with master' degrees or Ph.

    D.'s, that provide end-to-end solutions including concept development, design and optimization, prototype build, test and manufacture.

    We leverage the predictive capability of our high-fidelity computational physics solvers, indigenous massively parallel supercomputer system, prototyping plant, and ballistics and mechanics lab to investigate a variety of high-rate physics phenomena.

    The results are complex engineering solutions for a variety of applications; aircraft, ballistic missile defense, cybersecurity, motorsports, armor development, biological systems, and missile and warhead design and development.

    These results are achieved with optimal design and cost efficiency due to the predictive capability of Corvid's tools and our in-house, end-to-end integrated approach, which differentiates Corvid from the market.

    We value our people and offer employees a broad range of benefits.

    Benefits for full-time employees include:
    Paid gym membership
    Flexible schedules
    Blue Cross Blue Shield insurance including Medical, Dental and Vision
    401k match up to 6%
    Three weeks starting PTO; increasing with tenure
    Continued education and training opportunities

    #J-18808-Ljbffr