Discovering the Arrow of Time in Machine Learning

被引:0
|
作者
Kasmire, J. [1 ]
Zhao, Anran [1 ]
机构
[1] Univ Manchester, UK Data Serv & Cathie Marsh Inst, Manchester M13 9PL, Lancs, England
关键词
machine learning; time; naive Bayes classification; recurrent neural networks; Twitter; social media data; automatic classification; INFORMATION;
D O I
10.3390/info12110439
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) is increasingly useful as data grow in volume and accessibility. ML can perform tasks (e.g., categorisation, decision making, anomaly detection, etc.) through experience and without explicit instruction, even when the data are too vast, complex, highly variable, full of errors to be analysed in other ways. Thus, ML is great for natural language, images, or other complex and messy data available in large and growing volumes. Selecting ML models for tasks depends on many factors as they vary in supervision needed, tolerable error levels, and ability to account for order or temporal context, among many other things. Importantly, ML methods for tasks that use explicitly ordered or time-dependent data struggle with errors or data asymmetry. Most data are (implicitly) ordered or time-dependent, potentially allowing a hidden 'arrow of time' to affect ML performance on non-temporal tasks. This research explores the interaction of ML and implicit order using two ML models to automatically classify (a non-temporal task) tweets (temporal data) under conditions that balance volume and complexity of data. Results show that performance was affected, suggesting that researchers should carefully consider time when matching appropriate ML models to tasks, even when time is only implicitly included.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Landslide Prediction with Machine Learning and Time Windows
    Guerrero-Rodriguez, Byron
    Garcia-Rodriguez, Jose
    Salvador, Jaime
    Mejia-Escobar, Christian
    Bonifaz, Michelle
    Gallardo, Oswaldo
    BIO-INSPIRED SYSTEMS AND APPLICATIONS: FROM ROBOTICS TO AMBIENT INTELLIGENCE, PT II, 2022, 13259 : 193 - 202
  • [42] Machine Learning Strategies for Time Series Forecasting
    Bontempi, Gianluca
    Ben Taieb, Souhaib
    Le Borgne, Yann-Ael
    BUSINESS INTELLIGENCE, EBISS 2012, 2013, 138 : 62 - 77
  • [43] Machine Learning Advances for Time Series Forecasting
    Masini, Ricardo P.
    Medeiros, Marcelo C.
    Mendes, Eduardo F.
    JOURNAL OF ECONOMIC SURVEYS, 2023, 37 (01) : 76 - 111
  • [44] Time Series Prediction Based on Machine Learning
    Jiang, Q. Y.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, AUTOMATION AND MECHANICAL ENGINEERING (EAME 2015), 2015, 13 : 128 - 129
  • [45] Machine learning for aircraft approach time prediction
    Ye B.
    Bao X.
    Liu B.
    Tian Y.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2020, 41 (10):
  • [46] TIME SERIES FEATURES AND MACHINE LEARNING FORECASTS
    Claveria, Oscar
    Monte, Enric
    Torra, Salvador
    TOURISM ANALYSIS, 2020, 25 (04): : 463 - 472
  • [47] Topological machine learning for multivariate time series
    Wu, Chengyuan
    Hargreaves, Carol Anne
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2022, 34 (02) : 311 - 326
  • [48] Machine Learning and Pharmacogenomics at the Time of Precision Psychiatry
    Del Casale, Antonio
    Sarli, Giuseppe
    Bargagna, Paride
    Polidori, Lorenzo
    Alcibiade, Alessandro
    Zoppi, Teodolinda
    Borro, Marina
    Gentile, Giovanna
    Zocchi, Clarissa
    Ferracuti, Stefano
    Preissner, Robert
    Simmaco, Maurizio
    Pompili, Maurizio
    CURRENT NEUROPHARMACOLOGY, 2023, 21 (12) : 2395 - 2408
  • [49] Time-Variability of Flow Recession Dynamics: Application of Machine Learning and Learning From the Machine
    Kim, Minseok
    Bauser, Hannes H. H.
    Beven, Keith
    Troch, Peter A. A.
    WATER RESOURCES RESEARCH, 2023, 59 (05)
  • [50] A survey of visual analytics techniques for machine learning
    Yuan, Jun
    Chen, Changjian
    Yang, Weikai
    Liu, Mengchen
    Xia, Jiazhi
    Liu, Shixia
    COMPUTATIONAL VISUAL MEDIA, 2021, 7 (01) : 3 - 36