AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
Article Link
Collect
Submit Manuscript
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Survey

Ad Hoc File Systems for High-Performance Computing

Zentrum für Datenverarbeitung, Johannes Gutenberg University Mainz, Mainz 55128, Germany
Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, CA 94550, U.S.A.
Department of Computer Science, Florida State University, Tallahassee, FL 32306, U.S.A.
Mathematics and Computer Science Division, Argonne National Laboratory, Lemont, IL 60439, U.S.A.
Department of Computer Architecture, Universitat Politecnica de Catalunya, Barcelona 08034, Spain
Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, U.S.A.
Computer Science Department, Barcelona Supercomputing Center, Barcelona 08034, Spain
Fraunhofer Institute for Industrial Mathematics ITWM, Fraunhofer-Platz 1, Kaiserslautern 67663, Germany
Show Author Information

Abstract

Storage backends of parallel compute clusters are still based mostly on magnetic disks, while newer and faster storage technologies such as flash-based SSDs or non-volatile random access memory (NVRAM) are deployed within compute nodes. Including these new storage technologies into scientific workflows is unfortunately today a mostly manual task, and most scientists therefore do not take advantage of the faster storage media. One approach to systematically include nodelocal SSDs or NVRAMs into scientific workflows is to deploy ad hoc file systems over a set of compute nodes, which serve as temporary storage systems for single applications or longer-running campaigns. This paper presents results from the Dagstuhl Seminar 17202 “Challenges and Opportunities of User-Level File Systems for HPC” and discusses application scenarios as well as design strategies for ad hoc file systems using node-local storage media. The discussion includes open research questions, such as how to couple ad hoc file systems with the batch scheduling environment and how to schedule stage-in and stage-out processes of data between the storage backend and the ad hoc file systems. Also presented are strategies to build ad hoc file systems by using reusable components for networking and how to improve storage device compatibility. Various interfaces and semantics are presented, for example those used by the three ad hoc file systems BeeOND, GekkoFS, and BurstFS. Their presentation covers a range from file systems running in production to cutting-edge research focusing on reaching the performance limits of the underlying devices.

Electronic Supplementary Material

Download File(s)
jcst-35-1-4-Highlights.pdf (421.3 KB)

References

【1】
【1】
 
 
Journal of Computer Science and Technology
Pages 4-26

{{item.num}}

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Close
Close
Cite this article:
Brinkmann A, Mohror K, Yu W, et al. Ad Hoc File Systems for High-Performance Computing. Journal of Computer Science and Technology, 2020, 35(1): 4-26. https://doi.org/10.1007/s11390-020-9801-1

929

Views

30

Crossref

N/A

Web of Science

33

Scopus

0

CSCD

Received: 30 June 2019
Revised: 30 August 2019
Published: 17 January 2020
©Institute of Computing Technology, Chinese Academy of Sciences 2020