Instance segmentation on distributed deep learning big data cluster

被引:0
作者
Mohammed Elhmadany
Islam Elmadah
Hossam E. Abdelmunim
机构
[1] Ain Shams University,Computer and Systems Engineering, Faculty of Engineering
来源
Journal of Big Data | / 11卷
关键词
Distributed deep learning; Big data cluster; BigDl; Spark; Instance segmentation; YOLACT; OpenVINO; ONNX; Azure databricks;
D O I
暂无
中图分类号
学科分类号
摘要
Distributed deep learning is a promising approach for training and deploying large and complex deep learning models. This paper presents a comprehensive workflow for deploying and optimizing the YOLACT instance segmentation model as on big data clusters. OpenVINO, a toolkit known for its high-speed data processing and ability to optimize deep learning models for deployment on a variety of devices, was used to optimize the YOLACT model. The model is then run on a big data cluster using BigDL, a distributed deep learning library for Apache Spark. BigDL provides a high-level programming interface for defining and training deep neural networks, making it suitable for large-scale deep learning applications. In distributed deep learning, input data is divided and distributed across multiple machines for parallel processing. This approach offers several advantages, including the ability to handle very large data that can be stored in a distributed manner, scalability to decrease processing time by increasing the number of workers, and fault tolerance. The proposed workflow was evaluated on virtual machines and Azure Databricks, a cloud-based platform for big data analytics. The results indicated that the workflow can scale to large datasets and deliver high performance on Azure Databricks. This study explores the benefits and challenges of using distributed deep learning on big data clusters for instance segmentation. Popular distributed deep learning frameworks are discussed, and BigDL is chosen. Overall, this study highlights the practicality of distributed deep learning for deploying and scaling sophisticated deep learning models on big data clusters.
引用
收藏
相关论文
共 10 条
[1]  
Najafabadi MM(2015)Deep learning applications and challenges in big data analytics J Big Data 2 1-21
[2]  
Villanustre F(2010)Reverse engineering a gene network using an asynchronous parallel evolution strategy BMC Syst Biol 4 1-16
[3]  
Khoshgoftaar TM(2013)Scaling big data mining infrastructure: the twitter experience ACM SIGKDD Explorations Newsl 14 6-19
[4]  
Seliya N(undefined)undefined undefined undefined undefined-undefined
[5]  
Wald R(undefined)undefined undefined undefined undefined-undefined
[6]  
Muharemagic E(undefined)undefined undefined undefined undefined-undefined
[7]  
Jostins L(undefined)undefined undefined undefined undefined-undefined
[8]  
Jaeger J(undefined)undefined undefined undefined undefined-undefined
[9]  
Lin J(undefined)undefined undefined undefined undefined-undefined
[10]  
Ryaboy D(undefined)undefined undefined undefined undefined-undefined