HPC Professional Engineer - Arlington, VA - TS/SCI IT
Arlington, VA 
Share
Posted 15 days ago
Job Description

HPC Professional Engineer - Arlington, VA - TS/SCI

As the on-site HPC Professional Engineer you will assist in tasks and directions provided by the customer to include:

  • HPC configuration, management and maintenance.
  • Extended knowledge transfer.
  • SID documentation (planning, cables, labeling, switches, etc.)
  • Addressing hardware failures and support tickets.
  • Validation of firmware versions and settings.
  • Validation of HPC software versions and settings.
  • Validation of Dell HPC best practices.
  • Assist with benchmark testing.
    • High Performance Linpack (HPL), Alltoall, bidirectional bandwidth and Stream.
  • Knowledge of gpfs storage.
  • Experience Ubuntu and SLES.
  • Network (IB & Enet) testing and management.
  • Experience with Kubernetes is a plus.

During the residency service, Services personnel may perform the following over the duration of the engagement:

Administration:
  • Monitors, reviews, and manages Dell infrastructure listed in the SOW.
  • Manages user requests, Manages and reviews log files
  • Generate regular operational reports.
  • Provide capacity planning.
  • Assist with disaster recovery planning and design.
Problem Management:
  • Isolates and troubleshoots incidents.
  • Performs service incident coordination.
  • Opens service requests on behalf of the Customer.
  • Participates in root cause analysis review.
Change Management:
  • Performs software/firmware management assistance and collaboration.
  • Implements change management requests.
  • Assists with solution documentation of policies and procedures in conjunction with the compliance manager(s) and with key stakeholders.
  • Monitor's migration activities
Continual Service Improvement:
  • Recommends procedure changes that result in operational optimization.
  • Shares best practices from other engagements.
  • Provides performance tuning recommendations.
  • Post Implementation Planning and Knowledge Sharing:
  • Works with customer's technical leadership on an ongoing basis to ensure they have awareness of system status and discuss architectural design, strategies and plans for the future.
  • Performs transition planning with deployment team.
  • Performs incremental host and network configuration beyond deployment scope.

Standard Operating Procedures:
  • Provides recommendations on product enhancements and upgrades.
  • Implements Dell EMC System Management Tools
  • Works with customer staff to develop Run Books (document products and environment, including system information, code level, access instructions, configuration, "how tos")
Change Evaluation and Recommendations:
  • Reviews IT processes and policies (Incident, capacity, performance and change management, user, and back up policy) - as part of new solution or continuous improvement.
  • Assists with the solution documentation of policies and procedures in conjunction with the compliance manager(s) and with other key stakeholders.
  • Conducts knowledge transfer to address the Customer's skills and resource gaps as well as technology recommendations.
Task:
Seeking experience with HPC environments, background in Ubuntu Linux, and knowledge of front end IP networking.

Qualifications:
  • Must have an active TS/SCI security clearance
  • Bachelors Degree and 10 years experience, Associates and 12-15 years of equivalent years experience or 16 plus years experience


Employment Pre-requisites
The following requirements must be met to be eligible for this position: Successful completion of a background investigation, and drug urinalysis.

SOC, a Day & Zimmermann company, is an Equal Opportunity Employer, EOE AA M/F/Vet/Disability.

#INDSOC

 

Job Summary
Company
Start Date
As soon as possible
Employment Term and Type
Regular, Full Time
Required Education
Bachelor's Degree
Required Experience
10+ years
Email this Job to Yourself or a Friend
Indicates required fields