Predicting file lifetimes for data placement in multi-tiered storage systems for HPC - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Operating Systems Review Année : 2021

Predicting file lifetimes for data placement in multi-tiered storage systems for HPC

(1) , (2) , (1) , (2) , (3, 1)
1
2
3

Résumé

The emergence of Exascale machines in HPC will have the foreseen consequence of putting more pressure on the storage systems in place, not only in terms of capacity but also bandwidth and latency. With limited budget we cannot imagine using only storage class memory, which leads to the use of a heterogeneous tiered storage hierarchy. In order to make the most efficient use of the high performance tier in this storage hierarchy, we need to be able to place user data on the right tier and at the right time. In this paper, we assume a 2-tier storage hierarchy with a high performance tier and a high capacity archival tier. Files are placed on the high performance tier at creation time and moved to capacity tier once their lifetime expires (that is once they are no more accessed). The main contribution of this paper lies in the design of a file lifetime prediction model solely based on its path based on the use of Convolutional Neural Network. Results show that our solution strikes a good trade-off between accuracy and under-estimation. Compared to previous work, our model made it possible to reach an accuracy close to previous work (around 98.60% compared to 98.84%) while reducing the underestimations by almost 10x to reach 2.21% (compared to 21.86%). The reduction in underestimations is crucial as it avoids misplacing files in the capacity tier while they are still in use.
Fichier principal
Vignette du fichier
CHEOPS_21_paper_6.pdf (617.69 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03349823 , version 1 (20-09-2021)

Identifiants

Citer

Luis Thomas, Sebastien Gougeaud, Stéphane Rubini, Philippe Deniel, Jalil Boukhobza. Predicting file lifetimes for data placement in multi-tiered storage systems for HPC. Operating Systems Review, 2021, 55 (1), pp.99-107. ⟨10.1145/3469379.3469392⟩. ⟨hal-03349823⟩
40 Consultations
35 Téléchargements

Altmetric

Partager

Gmail Facebook Twitter LinkedIn More