DeepHash: An End-to-End Learning Approach for Metadata Management in Distributed File Systems

被引:0
|
作者
Gao, Yuanning [1 ]
Gao, Xiaofeng [1 ]
Chen, Guihai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Key Lab Scalable Comp & Syst, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019) | 2019年
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
metadata management; neural network; locality preserving hashing; distributed file system;
D O I
10.1145/3337821.3337924
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In distributed file systems, distributed metadata management can be considered as a mapping problem, i.e., how to effectively map the metadata namespace tree to multiple metadata servers (MDS's). In general, all traditional distributed metadata management schemes simply presume a rigid mapping function, thus failing to adaptively meet the requirements of different applications. To better take advantage of the current distribution of the metadata, in this exploratory paper, we present the first machine learning based model called DeepHash, which leverages the deep neural network to learn a locality preserving hashing (LPH) mapping. To help learn a good position relationship of metadata nodes in the namespace tree, we first present a metadata representation strategy. Due to the absence of training labels, i.e., the hash values of metadata nodes, we design two kinds of loss functions with distinctive characters to train DeepHash respectively, including a pair loss and a triplet loss, and introduce some sampling strategies for these two approaches. We conduct extensive experiments on Amazon EC2 platform to compare the performance of DeepHash with traditional and state-of-the-art schemes. The results demonstrate that DeepHash can preserve the metadata locality well while maintaining a high load balancing, which denotes the effectiveness and efficiency of DeepHash.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] An End-to-End Learning-Based Metadata Management Approach for Distributed File Systems
    Gao, Yuanning
    Gao, Xiaofeng
    Zhang, Ruisi
    Chen, Guihai
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (05) : 1021 - 1034
  • [2] An End-to-end Log Management Framework for Distributed Systems
    He, Pinjia
    2017 IEEE 36TH INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2017, : 266 - 267
  • [3] On Modular Learning of Distributed Systems for Predicting End-to-End Latency
    Liang, Chieh-Jan Mike
    Fang, Zilin
    Xie, Yuqing
    Yang, Fan
    Li, Zhao Lucis
    Zhang, Li Lyna
    Yang, Mao
    Zhou, Lidong
    PROCEEDINGS OF THE 20TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, NSDI 2023, 2023, : 1081 - 1095
  • [4] Distributed End-to-End testing management
    Bai, XY
    Tsai, WT
    Paul, R
    Shen, TC
    Li, B
    FIFTH IEEE INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE, PROCEEDINGS, 2001, : 140 - 151
  • [5] Dynamic end-to-end QoS management middleware for distributed multimedia systems
    Ecklund, DJ
    Goebel, V
    Plagemann, T
    Ecklund, EF
    MULTIMEDIA SYSTEMS, 2002, 8 (05) : 431 - 442
  • [6] Dynamic end-to-end QoS management middleware for distributed multimedia systems
    Denise J. Ecklund
    Vera Goebel
    Thomas Plagemann
    Earl F. Ecklund Jr.
    Multimedia Systems, 2002, 8 : 431 - 442
  • [7] SECURE END-TO-END DELEGATIONS IN DISTRIBUTED SYSTEMS
    HARDJONO, T
    OHTA, T
    COMPUTER COMMUNICATIONS, 1994, 17 (03) : 230 - 238
  • [8] A distributed approach to end-to-end network topology inference
    Jin, Xing
    Xia, Qiuyan
    Chan, S. -H. Gary
    2007 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-14, 2007, : 1704 - 1709
  • [9] End-to-end Resource Reservations in Distributed Embedded Systems
    Ashjaei, Mohammad
    Mubeen, Saad
    Behnam, Moris
    Almeida, Luis
    Nolte, Thomas
    2016 IEEE 22ND INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA), 2016, : 1 - 11
  • [10] Distributed multimedia information systems: an end-to-end perspective
    Arif Ghafoor
    Multimedia Tools and Applications, 2007, 33 : 31 - 56