Data Column Prediction: Experiment in Automated Column Tagging Using Machine Learning

被引:0
|
作者
McCabe, S. [1 ]
Cropp, B. [1 ]
Coles, J. [1 ]
Del Vecchio, J. [1 ]
Ekstrum, J. [1 ]
机构
[1] CUBRC Inc, 4455 Genesee St, Buffalo, NY 14226 USA
关键词
machine learning; column prediction; ontology; intelligence analysis;
D O I
10.1117/12.2519305
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The lack of tools to rapidly identify and align data from different sources is a critical, needed capability for the Department of Defense especially when it comes to automated ingestion. In the current open source Karma Mapping Tool, the Steiner tree optimization algorithm suggests semantic types during data alignment. We hypothesize that Machine Learning (ML) may perform better than the Steiner approach on a subset of column types, or "labels", where 1.) the data is extremely similar in structure and content and 2.) inferring column type correctly is highly dependent on the interrelated components of the dataset. In this session we discuss the experimental design, our initial results, and a path toward future work in broader applications beginning with intelligence analysis in the maritime domain. The initial results from this experiment show there is promise in using ML to do column prediction in analysis environments where there are many similar or overlapping data.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Optimized machine learning models for prediction of effective stiffness of rectangular reinforced concrete column sections
    Sapkota, Sanjog Chhetri
    Das, Sourav
    Saha, Prasenjit
    STRUCTURES, 2024, 62
  • [32] Near duplicate column identification: a machine learning approach
    Chevallier, Marc
    Boufares, Faouzi
    Grozavu, Nistor
    Rogovschi, Nicoleta
    Clairmont, Charly
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [33] Comparison of machine learning algorithms for slope stability prediction using an automated machine learning approach
    Kurnaz, Talas Fikret
    Erden, Caner
    Dagdeviren, Ugur
    Demir, Alparslan Serhat
    Kokcam, Abdullah Hulusi
    NATURAL HAZARDS, 2024, 120 (08) : 6991 - 7014
  • [34] Prediction of Air Pressure Change Inside the Chamber of an Oscillating Water Column-Wave Energy Converter Using Machine-Learning in Big Data Platform
    Seo, Dongwoo
    Huh, Taesang
    Kim, Myungil
    Hwang, Jaesoon
    Jung, Daeyong
    ENERGIES, 2021, 14 (11)
  • [35] Predicting reliable H2 column density maps from molecular line data using machine learning
    Shimajiri, Yoshito
    Kawanishi, Yasutomo
    Fujita, Shinji
    Miyamoto, Yusuke
    Ito, Atsushi M.
    Arzoumanian, Doris
    Andre, Philippe
    Nishimura, Atsushi
    Tokuda, Kazuki
    Kaneko, Hiroyuki
    Takekawa, Shunya
    Ueda, Shota
    Onishi, Toshikazu
    Inoue, Tsuyoshi
    Nishimoto, Shimpei
    Yoneda, Ryuki
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2023, 526 (01) : 966 - 981
  • [36] Predicting reliable H2 column density maps from molecular line data using machine learning
    Shimajiri, Yoshito
    Kawanishi, Yasutomo
    Fujita, Shinji
    Miyamoto, Yusuke
    Ito, Atsushi M.
    Arzoumanian, Doris
    Andre, Philippe
    Nishimura, Atsushi
    Tokuda, Kazuki
    Kaneko, Hiroyuki
    Takekawa, Shunya
    Ueda, Shota
    Onishi, Toshikazu
    Inoue, Tsuyoshi
    Nishimoto, Shimpei
    Yoneda, Ryuki
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2023, : 966 - 981
  • [37] Classification of failure mode and prediction of shear strength for reinforced concrete beam-column joints using machine learning techniques
    Mangalathu, Sujith
    Jeon, Jong-Su
    ENGINEERING STRUCTURES, 2018, 160 : 85 - 94
  • [38] C/N ratio effect on oily wastewater treatment using column type SBR: machine learning prediction and metagenomics study
    Khan, Nadeem A.
    Majumder, Abhradeep
    Singh, Simranjeet
    Ramamurthy, Praveen C.
    Prakash, Sandra Kathott
    Farooqi, I. H.
    Mozaffari, Nastaran
    Lawal, Dahiru U.
    Aljundi, Isam H.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [39] Prediction of multi-inputs bubble column reactor using a novel hybrid model of computational fluid dynamics and machine learning
    Mosavi, Amir
    Shamshirband, Shahaboddin
    Salwana, Ely
    Chau, Kwok-Wing
    Tah, Joseph H. M.
    ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2019, 13 (01) : 482 - 492
  • [40] Estimation of Column Aerosol Contribution in Seoul and Gangneung Using Machine Learning Clustering Technique
    Pyo, Seong-Hun
    Lee, Kwon-Ho
    Lee, Kyu-Tae
    JOURNAL OF KOREAN SOCIETY FOR ATMOSPHERIC ENVIRONMENT, 2021, 37 (06) : 931 - 945