Discovering the Arrow of Time in Machine Learning

被引:0
|
作者
Kasmire, J. [1 ]
Zhao, Anran [1 ]
机构
[1] Univ Manchester, UK Data Serv & Cathie Marsh Inst, Manchester M13 9PL, Lancs, England
关键词
machine learning; time; naive Bayes classification; recurrent neural networks; Twitter; social media data; automatic classification; INFORMATION;
D O I
10.3390/info12110439
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) is increasingly useful as data grow in volume and accessibility. ML can perform tasks (e.g., categorisation, decision making, anomaly detection, etc.) through experience and without explicit instruction, even when the data are too vast, complex, highly variable, full of errors to be analysed in other ways. Thus, ML is great for natural language, images, or other complex and messy data available in large and growing volumes. Selecting ML models for tasks depends on many factors as they vary in supervision needed, tolerable error levels, and ability to account for order or temporal context, among many other things. Importantly, ML methods for tasks that use explicitly ordered or time-dependent data struggle with errors or data asymmetry. Most data are (implicitly) ordered or time-dependent, potentially allowing a hidden 'arrow of time' to affect ML performance on non-temporal tasks. This research explores the interaction of ML and implicit order using two ML models to automatically classify (a non-temporal task) tweets (temporal data) under conditions that balance volume and complexity of data. Results show that performance was affected, suggesting that researchers should carefully consider time when matching appropriate ML models to tasks, even when time is only implicitly included.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Discovering unusual structures from exception using big data and machine learning techniques
    Jie, Jianshu
    Hu, Zongxiang
    Qian, Guoyu
    Weng, Mouyi
    Li, Shunning
    Li, Shucheng
    Hu, Mingyu
    Chen, Dong
    Xiao, Weiji
    Zheng, Jiaxin
    Wang, Lin-Wang
    Pan, Feng
    SCIENCE BULLETIN, 2019, 64 (09) : 612 - 616
  • [32] Discovering unusual structures from exception using big data and machine learning techniques
    Jianshu Jie
    Zongxiang Hu
    Guoyu Qian
    Mouyi Weng
    Shunning Li
    Shucheng Li
    Mingyu Hu
    Dong Chen
    Weiji Xiao
    Jiaxin Zheng
    Lin-Wang Wang
    Feng Pan
    ScienceBulletin, 2019, 64 (09) : 612 - 616
  • [33] Entropy production and the arrow of time
    Parrondo, J. M. R.
    Van den Broeck, C.
    Kawai, R.
    NEW JOURNAL OF PHYSICS, 2009, 11
  • [34] The physics of time and the arrow thereof
    Hecht, Eugene
    EUROPEAN JOURNAL OF PHYSICS, 2018, 39 (01)
  • [35] Machine Learning Approach for Maximizing Thermoelectric Properties of BiCuSeO and Discovering New Doping Element
    Parse, Nuttawat
    Pongkitivanichkul, Chakrit
    Pinitsoontorn, Supree
    ENERGIES, 2022, 15 (03)
  • [36] A machine learning attack on illegal trading
    James, Robert
    Leung, Henry
    Prokhorov, Artem
    JOURNAL OF BANKING & FINANCE, 2023, 148
  • [37] Efficient Observation Time Window Segmentation for Administrative Data Machine Learning
    Taib, Musa
    Messier, Geoffrey G.
    IEEE ACCESS, 2024, 12 : 158647 - 158656
  • [38] Time-reversal scatterer detection method using machine learning
    Han LingHe
    Di BangRang
    Hu ZiDuo
    Liu Wei
    Wang GuoQing
    Xu ZhongHua
    CHINESE JOURNAL OF GEOPHYSICS-CHINESE EDITION, 2021, 64 (09): : 3304 - 3315
  • [39] Machine Learning Techniques for THz Imaging and Time-Domain Spectroscopy
    Park, Hochong
    Son, Joo-Hiuk
    SENSORS, 2021, 21 (04) : 1 - 25
  • [40] Machine Learning in Power Systems: Is It Time to Trust It?
    Chatzivasileiadis, Spyros
    Venzke, Andreas
    Stiasny, Jochen
    Misyris, Georgios
    IEEE POWER & ENERGY MAGAZINE, 2022, 20 (03): : 32 - 41