On the use of big data frameworks for big service composition

被引:19
作者
Sellami, Mokhtar [1 ]
Mezni, Haithem [2 ,3 ]
Hacid, Mohand Said [4 ]
机构
[1] Higher Inst Technol Studies Jendouba, RIADI Lab, Jendouba, Tunisia
[2] Taibah Univ, Medina, Saudi Arabia
[3] Univ Tunis, Higher Inst Managements, SMART Lab, Tunis, Tunisia
[4] Univ Claude Bernard Lyon 1, LIRIS, Lyon, France
关键词
Big data; Big service; Big service composition; Quality of big services; Fuzzy RCA; Spark; RELATIONAL CONCEPT ANALYSIS; QUALITY;
D O I
10.1016/j.jnca.2020.102732
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Over the last years, big data has emerged as a new paradigm for the processing and analysis of massive volumes of data. Big data processing has been combined with service and cloud computing, leading to a new class of services called "Big Services". In this new model, services can be seen as an abstract layer that hides the complexity of the processed big data. To meet users' complex and heterogeneous needs in the era of big data, service reuse is a natural and efficient means that helps orchestrating available services' operations, to provide customer on demand big services. However different from traditional Web service composition, composing big services refers to the reuse of, not only existing high-quality services, but also high-quality data sources, while taking into account their security constraints (e.g., data provenance, threat level and data leakage). Moreover, composing heterogeneous and large-scale data-centric services faces several challenges, apart from security risks, such as the big services' high execution time and the incompatibility between providers' policies across multiple domains and clouds. Aiming to solve the above issues, we propose a scalable approach for big service composition, which considers not only the quality of reused services (QoS), but also the quality of their consumed data sources (QoD). Since the correct representation of big services requirements is the first step towards an effective composition, we first propose a quality model for big services and we quantify the data breaches using L-Severity metrics. Then to facilitate processing and mining big services' related information during composition, we exploit the strong mathematical foundation of fuzzy Relational Concept Analysis (fuzzy RCA) to build the big services' repository as a lattice family. We also used fuzzy RCA to cluster services and data sources based on various criteria, including their quality levels, their domains, and the relationships between them. Finally, we define algorithms that parse the lattice family to select and compose high-quality and secure big services in a parallel fashion. The proposed method, which is implemented on top of Spark big data framework, is compared with two existing approaches, and experimental studies proved the effectiveness of our big service composition approach in terms of QoD-aware composition, scalability, and security breaches.
引用
收藏
页数:22
相关论文
共 46 条
[1]  
[Anonymous], 2016, J SOFTWARE SYST DEV, DOI DOI 10.5171/2016.356668
[2]   Context-aware data quality assessment for big data [J].
Ardagna, Danilo ;
Cappiello, Cinzia ;
Sama, Walter ;
Vitali, Monica .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 89 :548-562
[3]   Link key candidate extraction with relational concept analysis [J].
Atencia, Manuel ;
David, Jerome ;
Euzenat, Jerome ;
Napoli, Amedeo ;
Vizzini, Jeremy .
DISCRETE APPLIED MATHEMATICS, 2020, 273 :2-20
[4]  
Ballou DP, 2003, IEEE T KNOWL DATA EN, V15, P240, DOI 10.1109/TKDE.2003.1161595
[5]  
Barhamgi M., 2013, EDBT ICDT, V55, P86
[6]  
Bertino E., 2018, COMPREHENSIVE GUIDE, P425
[7]  
Boulakbech M., 2017, ICFNDS
[8]  
Cai L, 2015, Data science journal, V14, P1, DOI DOI 10.5334/DSJ-2015-002
[9]   Formal and relational concept analysis for fuzzy-based automatic semantic annotation [J].
De Maio, C. ;
Fenza, G. ;
Gallo, M. ;
Loia, V. ;
Senatore, S. .
APPLIED INTELLIGENCE, 2014, 40 (01) :154-177
[10]   Distributed online Temporal Fuzzy Concept Analysis for stream processing in smart cities [J].
De Maio, Carmen ;
Fenza, Giuseppe ;
Loia, Vincenzo ;
Orciuoli, Francesco .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2017, 110 :31-41