Our client is searching for a Senior Linux Storage Engineer to join their Unix Engineering team. This is an excellent opportunity for someone who thrives in an exciting and challenging work environment. You will contribute to the team by playing a key role in maintaining and defining the storage environment. The position will involve designing, planning, and implementing storage technologies as well as the day-to-day administration of storage systems in our high-performance computing (HPC) environment and Linux server and desktop environment. The team is responsible for 21 HPC clusters containing over 500 nodes and approximately 5 petabytes of storage, along with over 200 physical servers and desktops.
Core responsibilities will include:
Storage Engineering Duties – 75%
Design and deploy parallel filesystem or distributed filesystems
Work to ensure the deployed storage systems have high availability, stability, and performance.
Recommend standards for storage systems and configurations
Use of configuration management tools (Ansible, IPMI) to help maintain distributed computing environments.
Responsible for management of existing storage services including capacity increases, data migrations between systems and monitoring and metric collection
Plan and implement changes; review and validate change requests for all storage environments
Evaluate new technologies and develop roadmaps for storage services
Develop system processes and procedures so that users and other administrators can find information and other system administrators can provide support coverage.
Perform root cause analysis of performance issues related to file system and IO.
Collaborate with faculty and systems engineers on designs and storage architectures for new HPC clusters and servers
Senior System Engineer – 25%
System installation, system upgrades, maintenance of high-performance computing clusters, and Linux servers and workstations
Problem diagnosis and resolution on Linux workstations, servers and HPC Clusters
Qualifications
Experience with maintaining high-performance filesystems such as Lustre, Ceph, Glusterfs, etc. in production
Experience with provisioning and administering ZFS filesystems in production
Experience with ZFS snapshots and replication
Experience with NFS performance monitoring and tuning
Knowledge of InfiniBand and RDMA networking and configuration
HDJ + Associates is consistently named one of the top recruiting firms in the Pittsburgh area. We are a professional employment and search solutions company focused on recruiting the best possible talent available in today’s demanding market place.
Our clients often tell us that finding the right candidates to join their company is one of the most difficult tasks on their already overburdened task list.
Likewise, our candidates often tell us that finding the right position in today’s crowded market place is frustrating and overwhelming.
Let HDJ + Associates take the pressure out of the recruiting process for both candidates and employers. We will hit the employment bullseye each time streamlining the recruiting process to success.