Jobs
>
Washington, D.C.

    High Performance Computing Engineer - Washington, United States - Howard University

    Default job background
    Full time
    Description

    High Performance Computing Engineer

    locations
    Interdisciplinary Research Building
    time type
    Full time

    job requisition id
    JR104450
    The Talent Acquisition department hires qualified candidates to fill positions which contribute to the overall strategic success of Howard University. Hiring staff "for fit" makes significant contributions to Howard University's overall mission.

    BASIC FUNCTION:

    The purpose of this position is to serve as the High-Performance Computing Engineer for the Research Institute for Tactical Autonomy (RITA), University Affiliated Research Center (UARC).

    SUPERVISORY ACCOUNTABILITY:

    Typically responsible for performing some non-supervisory duties in addition to supervisory responsibilities.

    NATURE AND SCOPE:

    Serve as the High-Performance Computing Engineer for the Research Institute for Tactical Autonomy (RITA), internal contacts include executives, administrators, faculty, staff and students of the departments and the University at large with special emphasis on the HR, ETS, Finance, Office of Research and The Office of the CFO. External contacts include representatives from federal government, other colleges and universities, professional associations, consultants, vendors, and the general public.

    PRINCIPAL ACCOUNTABILITIES:

    Responsible for understanding the concepts, procedures, and guidelines to solve highly complex problems in the maintenance and hardware/software network infrastructure.

    Experience performing system set-up, experiments, and diagnostics to evaluate printed circuit board exchanges, and troubleshoot and make component repairs based on test results.

    Performs configurations and operates multipurpose, multi-tasking computer systems.

    Supports day-to-day operations for the Computing team by monitoring computing resource performance, managing configurations, and addressing security administration. Applies revisions to system firmware and software. Engages and collaborates with vendors to assist with support activities as required.

    Performs training and support for technical staff in the use of new software and hardware, either developed or acquired.

    Creates, deletes, maintains and manages HPC researcher accounts and logins for RITA staff, performs system back-ups, and maintains system configuration files.

    Installs, configures, modifies, tunes and maintains various research software applications for access on HPC clusters.

    Performs researcher support and documentation for software applications, programs and enterprise services.

    Designs, installs, configures, and performs document management for cluster infrastructure, including operating systems, job schedulers, resource managers, provisioning managers, configuration managers, network devices, and other components.

    Investigates, debugs, and addresses researcher inquiries and requests efficiently through a customer issue ticketing system. Communicates complex technical concepts in simple, straightforward language.

    Explores emerging technologies and technical developments to address expanding analytical requirements. Identifies new services and develops implementation plans. Stays current with best practices in the HPC field. Maintains collaborative relationships with peer HPC research organizations.

    Performs other related duties as assigned or requested.

    CORE COMPETENCIES:

    Familiarity with low-latency/high-bandwidth, interconnected infrastructure (including InfiniBand, 10/100GigE, and others).

    Expertise with HPC system software cluster management tools, job schedulers, and other HPC tools including Slurm, Ansible, and more.

    Proficiency with fundamental programming skills (Tensorflow, PyTorch, ML/AI Tools, Python, C/C++ or similar languages). Expertise with administration, monitoring, and maintaining secure Linux/Unix operating systems (CentOS).

    Knowledge of HPC storage (FC, SAS) principles, file systems (NFS, Lustre, BeegFS, ZFS, etc.), and compute node storage, Network Attached Storage.

    Proficiency with web interfacing of ML/AI tools such as Tensorflow, PyTorch

    Ability to drive technical leadership and management of complex, large-scale computing system projects.

    Proficiency with multi-vendor management, security and network/Internet protocols.

    Demonstrated expertise in design configuration and planning, with excellent organization skills, and the ability to identify and resolve problems and manage performance.

    Excellent written and oral communication skills, with experience presenting technical topics to nontechnical audiences.

    Ability to establish processes for maintaining system performance and managing best-in-class standards.

    Knowledge of computer applications and experience with accompanying user-friendly software, e.g., Workday, word processing, spreadsheet, data base, outlook, presentation, etc.

    Excellent leadership, training and developmental skills.

    Skill in oral and written (English) communications with the ability to explain complicated, fiscal and budgetary processes to lay persons, and the ability to make public presentations.

    Strong organizational skills to establish priorities meet deadlines and perform in a responsible, professional manner.

    Ability to maintain harmonious working relationship with staff, students, faculty and University officials and the general public.

    Skill in leadership with ability to delegate tasks and assignments appropriately.

    Ability to manage cross-functional teams, delegate tasks, and promote and direct staff development.

    Ability to conduct research, compile, and prepare comprehensive complex financial and budget reports.

    Ability to keep abreast of and adhere to new policies initiated by changes in federal, District of Columbia or University regulations and to communicate this information to others.

    Strong decision-making skills.

    MINIMUM REQUIREMENTS:

    Bachelor's degree (foreign equivalent or higher) in a relevant field, such as computer science, computer information systems, etc.

    Four or more years of experience in one of the following fields: information technology, HPC system administration, network engineering, or large-scale HPC file systems.

    Relevant experience must include Linux and scripting (for example, Python) and ML/AI programming such as Tensotflow, PyTorch, etc. In addition, it may also include maintaining hardware or software over their lifecycle (i.e., requirements analysis, implementation, testing, integration, deployment/installation, and maintenance), or computer/network security, or high-performance computing.

    Experience with cloud computing and container technologies.


  • Unreal Gigs

    Computer Engineer

    3 weeks ago


    Unreal Gigs Washington, United States

    Job Description · Job DescriptionWhat You Will Do · Collaborate with a seasoned startup and engineering team to push the boundaries of compute infrastructure. · Develop software spanning our network stack, focusing on profiling and optimizing latency, reliability, and performance ...


  • Department Of Justice Washington DC, United States

    To build and retain a workforce that reflects the diverse experiences and perspectives of the American people, we welcome applicants from the many communities, identities, races, ethnicities, backgrounds, abilities, religions, and cultures of the United States who share our commi ...


  • Department Of Justice Washington DC, United States

    Duties · As the federal agency whose mission is to ensure the fair and impartial administration of justice for all Americans, the Department of Justice is committed to fostering a diverse and inclusive work environment. To build and retain a workforce that reflects the diverse ex ...


  • Jobot Washington, United States

    Job Description · Job DescriptionComputer Vision/Robotics Engineer - $140k+ · This Jobot Job is hosted by: Christopher Montero · Are you a fit? Easy Apply now by clicking the "Apply Now" buttonand sending us your resume. · Salary: $110,000 - $140,000 per year · A bit about us: · ...


  • Noetic Strategies Inc. Washington, United States

    Job Description · Job DescriptionType: Full Time (This position will be officially opening in August) · Clearance Required: Public Trust (Must be US Citizen) · Location: Washington, DC · Overview · Noetic is seeking a highly skilled engineer to support an IPv6 implementation pr ...


  • ST2 ManTech Advanced Systems Intl Washington, United States Full time

    Secure our Nation, Ignite your Future · Become an integral part of a diverse team while working at an Industry Leading Organization, where our employees come first. At ManTech International, you'll help protect our national security while working on innovative projects that offe ...


  • Guidehouse Bethesda, United States Full time

    Job Family: · Software Development & Support (Digital) · Travel Required: · None · Clearance Required: · Ability to Obtain Public TrustWhat You Will Do: · We are currently searching for a Computer and Electronics Engineer to independently provide support services to satisfy the o ...

  • AnaVation

    Computer Engineer

    1 week ago


    AnaVation Quantico, United States Full time

    AnaVation is looking for a dedicated Computer Engineer (CE) to join our team . The Computer Engineer (CE) is tasked with strategically planning and leading site surveys to gather crucial information for delineating products, services, and infrastructure needed to support mission ...


  • Jobot Washington DC, United States

    Computer Vision/Robotics Engineer - $140k+ This Jobot Job is hosted by: Christopher Montero · Are you a fit? Easy Apply now by clicking the "Apply" button · and sending us your resume. · Salary: $110,000 - $140,000 per year A bit about us: Well established Robotics/Manufactur ...


  • Onyx Government Services, LLC San Francisco, United States Permanent

    Essential Functions: · - Designing and developing hardware components. · - Testing hardware components individually and in tandem with external computer systems. · - Analyzing systems data in order to make changes to the hardware configuration. · - Completing functionality and pe ...


  • Onyx Government Services, LLC Arlington, United States

    Essential Functions: · - Designing and developing hardware components. · - Testing hardware components individually and in tandem with external computer systems. · - Analyzing systems data in order to make changes to the hardware configuration. · - Completing functionality and pe ...


  • Peraton Washington, United States

    About Peraton Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy. As the world's leading mission capability integrator and transformative enterprise IT provider, we deli ...

  • Momentum

    Computer Engineer

    1 week ago


    Momentum Quantico, United States

    Job Description · Job DescriptionSalary: · Welcome to the MOMENTUM Family · MOMENTUM is not just our company name; it is the highest value we deliver to our customers. We are a rapidly growing technology solutions company delivering innovative technology, engineering, and intell ...

  • Momentum

    Computer Engineer

    1 week ago


    Momentum Quantico, United States

    Job Description · Job DescriptionSalary: $140,000-$160,000 · Welcome to the MOMENTUM Family · MOMENTUM is not just our company name; it is the highest value we deliver to our customers. We are a rapidly growing technology solutions company delivering innovative technology, engine ...


  • Booz Allen Hamilton Washington, United States

    Job Number: R Computer Vision Engineer · Key Role: Design and implement effective artificial intelligence (AI) solution architecture using approaches of various AI technologies and methods to address clients' business problems and needs, while complying with the company's strateg ...


  • Leidos Linthicum, United States Full time

    Description · Are you seeking a new and challenging position supporting a complex, mission-critical Program? Well, look no further Leidos is currently looking to add a Systems Engineer (Entry-level) to an Information Assurance Program near Ft. Meade, MD. The successful candidate ...


  • Leidos Simpsonville, United States

    Description · Unleash Your Potential · At Leidos, we deliver innovative solutions by leveraging our diverse and talented workforce who are dedicated to our customer's success. We empower our teams, contribute to our communities, and operate sustainably. Everything we do is buil ...


  • Securicon LLC Alexandria, United States

    Systems Analyst (TS/SCI w/CI Poly) / Senior Computer Systems Engineer/Architect · Securicon is seeking an experienced IT Systems Engineer who desires to focus on soup to nuts cyber project management and coordination - to completely own applied research and experimentation proje ...


  • Guidehouse Bethesda, MD, United States

    Job Family : · Software Development & Support (Digital) Travel Required : · None Clearance Required : · Ability to Obtain Public Trust What You Will Do : · We are currently searching for a Computer and Electronics Engineer to independently provide support services to sa ...


  • Guidehouse Bethesda, MD, United States

    Software Development & Support (Digital) · Travel Required : · We are currently searching for a Computer and Electronics Engineer to independently provide support services to satisfy the overall operational objectives of the National Institute of Mental Health (NIMH). The primar ...