Implementing Clustering and Classification Approaches for Big Data with MATLAB

被引:0
|
作者
Pitz, Katrin [1 ]
Anderl, Reiner [1 ]
机构
[1] Tech Univ Darmstadt, D-64283 Darmstadt, Germany
来源
PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2018, VOL 1 | 2019年 / 880卷
关键词
Big Data; Clustering; Classification; K-means; Discriminant analysis; Neural networks; MATLAB;
D O I
10.1007/978-3-030-02686-8_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data sets grow rapidly, driven by increasing storage capacities as well as by the wish to equip more and more devices with sensors and connectivity. In mechanical engineering Big Data offers new possibilities to gain knowledge from existing data for product design, manufacturing, maintenance and failure prevention. Typical interests when analyzing Big Data are the identification of clusters, the assignment to classes or the development of regression models for prediction. This paper assesses various Big Data approaches and chooses adequate clustering and classification solutions for a data set of simulated jet engine signals and life spans. These solutions include kmeans clustering, linear discriminant analysis and neural networks. MATLAB is chosen as the programming environment for implementation because of its dissemination in engineering disciplines. The suitability of MATLAB as a tool for Big Data analysis is to be evaluated. The results of all applied clustering and classification approaches are discussed and prospects for further adaption and transferability to other scenarios are pointed out.
引用
收藏
页码:458 / 480
页数:23
相关论文
共 50 条
  • [1] The Survey on Approaches to Efficient Clustering and Classification Analysis of Big Data
    Gandhi, Bhagyashri S.
    Deshpande, Leena A.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2016,
  • [2] Big Data and Clustering Algorithms
    Ajin, V. W.
    Kumar, Lekshmy D.
    2016 INTERNATIONAL CONFERENCE ON RESEARCH ADVANCES IN INTEGRATED NAVIGATION SYSTEMS (RAINS), 2016,
  • [3] Big Data clustering validity
    Tlili, Monia
    Hamdani, Tarek M.
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 348 - 352
  • [4] Vehicle Industry Big Data Analysis Using Clustering Approaches
    Seixas, Lenon Diniz
    Correa, Fernanda Cristina
    Siqueira, Hugo Valadares
    Trojan, Flavio
    Afonso, Paulo
    OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, PT II, OL2A 2023, 2024, 1982 : 312 - 325
  • [5] Iterative subsampling in solution path clustering of noisy big data
    Marchetti, Yuliya
    Zhou, Qing
    STATISTICS AND ITS INTERFACE, 2016, 9 (04) : 415 - 431
  • [6] COMPUTATIONAL APPROACHES TO AUTOMATIC DATA CLUSTERING AND CLASSIFICATION
    SUMPTER, BG
    NOID, DW
    COMPUTATIONAL POLYMER SCIENCE, 1995, 5 (03): : 121 - 134
  • [7] Metaheuristic Based Clustering with Deep Learning Model for Big Data Classification
    Krishnaswamy, R.
    Subramaniam, Kamalraj
    Nandini, V
    Vijayalakshmi, K.
    Kadry, Seifedine
    Nam, Yunyoung
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (01): : 391 - 406
  • [8] Scalable Clustering Algorithms for Big Data: A Review
    Mahdi, Mahmoud A.
    Hosny, Khalid M.
    Elhenawy, Ibrahim
    IEEE ACCESS, 2021, 9 : 80015 - 80027
  • [9] A survey on parallel clustering algorithms for Big Data
    Dafir, Zineb
    Lamari, Yasmine
    Slaoui, Said Chah
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (04) : 2411 - 2443
  • [10] A clustering-classification approach in categorizing vulnerability of roads and bridges using public assistance big data
    Bhattacharyya, Arkaprabha
    Morshedi, Mohamadali
    Hastak, Makarand
    INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION, 2023, 84