HPC Storage Systems Analyst

Lawrence Berkeley National Lab’s (LBNL ) NERSC Division has an opening for an HPC Storage Systems Analyst to join the team.

 

In this exciting role, you will join the Storage Systems Group which is made up of system engineers and programmers providing NERSC’s 300 petabyte High Performance Storage System and center-wide, parallel file systems. Our storage systems are utilized by more than 8,000 scientists who use NERSC to perform unclassified, scientific research across a wide range of disciplines, including climate modeling, research into new materials, simulations of the early universe, high energy physics and a host of other scientific endeavors.

 

In addition to system operation responsibilities, you will lead and contribute to ongoing efforts to develop a storage strategy and plan the associated architecture and integration with NERSC’s HPC infrastructure. You will lead and contribute to evaluation of existing and emerging High Performance Computing (HPC), as well as AI/ML, storage systems, including analyzing the performance characteristics of leading-edge DOE Office of Science workloads and workflows on these systems. This position requires knowledge of storage system architectures as well as associated interconnects and networks. The HPC Storage Systems Analyst will also work with peers at other leading HPC facilities and vendor engineering teams to evaluate emerging storage technologies and define future directions for deployment.

 

You will also participate in regular cross-team efforts to integrate our storage systems with NERSC’s computational and networking infrastructure, troubleshoot performance issues at scale, and develop innovative solutions to continuously optimize operational and user productivity. 

 

What You Will Do:

  • Monitor, administer, and optimize NERSC’s distributed parallel file systems, block storage arrays, and auxiliary Linux-based storage servers.
  • Analyze, troubleshoot, and resolve complex problems that arise in NERSC’s production storage hardware, software systems, storage networks and systems that utilize NERSC storage systems.
  • Participate in the planning and execution of cross-team maintenance activities, upgrades, and deployments at scale.
  • Provide off-hours emergency support in a shared, on-call rotation for a subset of NERSC storage systems.

 

Additional Responsibilities as needed:

  • Contribute to evaluation and benchmarking of existing and emerging storage systems.
  • Measure and analyze the performance of NERSC’s evolving workloads on current and future storage systems.
  • Propose remedies to identified bottlenecks via tuning and/or architectural improvement with comprehensive understanding of any trade-offs in design, cost, and operational effects.
  • Prepare timely reports, papers, and lectures describing significant results for dissemination within NERSC and throughout the broader HPC research community.
  • Provide technical conceptual guidance to other group members and management and suggest directions for investigation.
  • Participate in the NERSC decision-making process for acquisition of new HPC storage systems.
  • Participate in NERSC’s outreach activities through written documents, presentations and developing peer to peer contacts with other professionals in the HPC field.
  • Conduct research related to NERSC’s interests and ensure that any relevant research outcomes are deployed in production at NERSC.
  • Proactively seek opportunities to collaborate with researchers, operators, and vendors across the global HPC community to apply the best ideas and solutions to solving NERSC’s technical challenges.

 

What is Required:

  • Bachelor’s degree and a minimum of eight years of computing or storage experience; or six years and a Master’s degree; or equivalent experience.
  • Demonstrated expertise of programs used for performance evaluation (e.g. IOR, fio, SPECstorage).
  • Experience with the use of script languages and system utilities such as configure, Perl, Python, UNIX shell scripts, and “make.”
  • Experience leading technical projects in a highly collaborative team environment.
  • Strong understanding of Linux fundamentals including file systems, networking, and virtual memory management.
  • Understanding of file system internals, prior work developing storage systems, or experience  troubleshooting and optimizing parallel I/O.
  • Knowledge of storage system and computer architecture used in HPC.
  • Working knowledge of parallel storage technologies such as distributed storage systems, parallel file systems, object stores, hierarchical storage management, storage networking, and/or relevant hardware technologies.
  • Proven record of working effectively in a team, seeing projects through to completion, meeting deadlines, interacting with users, and thorough documentation of contributions.
  • Familiarity with industry-standard benchmark programs and methods.
  • A demonstrated ability to lead technical efforts with teams of people.
  • Ability to write and present technical talks at conferences and other venues.
  • Excellent written and oral communication skills.
  • Strong organizational skills and ability to effectively manage priorities across many projects ranging from immediate problem resolution to long-term strategic planning.
  • Ability to work effectively and collaboratively on a team, as well as give and receive constructive feedback to foster communication and trust.

 

Want to learn more about Berkeley Lab’s Culture, Benefits and answers to FAQs? Please visit: https://recruiting.lbl.gov/

 

Notes:

  • This is a full-time, career appointment, exempt (monthly paid) from overtime pay.
  • This position may be subject to a background check. Any convictions will be evaluated to determine if they directly relate to the responsibilities and requirements of the position. Having a conviction history will not automatically disqualify an applicant from being considered for employment.
  • Work will be primarily performed at Lawrence Berkeley National Lab, 1 Cyclotron Road, Berkeley, CA.

 

Based on University of California Policy – SARS-CoV-2 (COVID-19) Vaccination Program and U.S Federal Government requirements, Berkeley Lab requires that all members of our community obtain the COVID-19 vaccine as soon as they are eligible. As a condition of employment at Berkeley Lab, all Covered Individuals must Participate in the COVID-19 Vaccination Program by providing proof that vaccination requirements have been met or submitting a request for Exception or Deferral. Visit covid.lbl.gov for more information.

 

Berkeley Lab is committed to Inclusion, Diversity, Equity and Accountability (IDEA) and strives to continue building community with these shared values and commitments. Berkeley Lab is an Equal Opportunity and Affirmative Action Employer. We heartily welcome applications from women, minorities, veterans, and all who would contribute to the Lab’s mission of leading scientific discovery, inclusion, and professionalism. In support of our diverse global community, all qualified applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, or protected veteran status.

 

Equal Opportunity and IDEA Information Links:

Know your rights, click here for the supplement: Equal Employment Opportunity is the Law and the Pay Transparency Nondiscrimination Provision under 41 CFR 60-1.4.  

 

View or Apply
To help us track our recruitment effort, please indicate in your cover/motivation letter where (jobs-near-me.eu) you saw this job posting.

Share

Recent Posts

Structural Engineer – Sales Support

Job title: Structural Engineer - Sales Support Company Food Management Search Job description We are…

4 mins ago

Non-Life Insurance Pricing – Specialist (all genders)

Job title: Non-Life Insurance Pricing - Specialist (all genders) Company VIENNA INSURANCE GROUP Job description…

10 mins ago

Special Education Assistant (TEA)

GENERAL RESPONSIBILITIES Perform responsible human support and paraprofessional work assisting classroom teachers or other professionals…

11 mins ago

Technical Representative in Oakville, Ontario

Career Area: Product Support Job Description: Your Work Shapes the World at Caterpillar Inc. When…

11 mins ago

Dean of the School of Arts, Sciences and Education, School of Health Sciences, and School of Public Affairs and Social Services

Serves as principal academic and administrative leader of the School of Arts, Sciences and Education,…

11 mins ago

Technical Representative in Edmonton, Alberta

Career Area: Product Support Job Description: Your Work Shapes the World at Caterpillar Inc. When…

11 mins ago
For Apply Button. Please use Non-Amp Version

This website uses cookies.