Thursday, August 05, 2010

National Renewable Energy Laboratory - Senior HPC Systems Administrator

Senior HPC Systems Administrator-National Renewable Energy Laborator
Posted by: "huntaroundnow" penny.hunt@prodigy.net

The National Renewable Energy Laboratory (NREL), located at the foothills of the Rocky Mountains in Golden, Colorado, is the nation's primary laboratory for research, development, and deployment of renewable energy and energy efficiency technologies.

The Data, Informatics, and Systems Group in the NREL Computational Science Center has an immediate opening for a full-time senior high performance computing (HPC) system administrator with an emphasis in HPC environments, LINUX system administration,
and networking.

The Computational Science Center hosts and supports NREL's HPC systems with capability at the petascale.

Involves the integration of technologies needed in a role of direct and immediate support high performance computing environments including hardware, software, and users.

Perform basic UNIX system admin duties - Install and maintain UNIX operating systems and applications; User-ID administration; Backup and restore; Troubleshoot system and network problems; Perform minor hardware replacement such as disks, memory, adapter cards; and respond to user requests for technical assistance.

High Performance Computing administration:

- Manage nodes and HPC Network; Configure and support scheduling tools, queue management, job submission, and job status; Support HPC software and application stack; Assist Scientific community with HPC
support issues and topics; Compile using standard tools and Scripting.

Documentation - Perform documentation for procedures, best practice system use, server, upgrades, configuration changes, application installations, hardware changes and backup schedules.

Relevant Bachelor's degree and 10 years experience or equivalent relevant education/experience.

• Minimum of 5 years experience in UNIX system administration

• Extremely strong Linux (RedHat/CentOS) skills including architecture, implementation, monitoring and troubleshooting

• Knowledge and understanding of UNIX environment, including standard daemons, protocols and applications (DNS, DHCP, NFS, SSH, SNMP, Apache, sendmail etc.)

• Familiarity with programming languages and compilers

• Familiarity with Linux clusters and batch scheduling software

• Experience with tape libraries and archival/backup strategies

• Knowledge and experience with UNIX scripting

• Knowledge and experience with network concepts including Ethernet, infiniband and myrinet

• Familiarity with parallel file systems such as Lustre

• Ability to communicate clearly, verbally and in writing to varying audiences regarding technical issues

• Ability to problem solve in a group setting and
participate in pair problem solving sessions


EEO Policy NREL's policy is to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, religion, national origin, marital or veteran status, or any other legally protected status.

Find more great opportunities at: www.nrel.gov/employment