Discovering the Arrow of Time in Machine Learning

被引:0
|
作者
Kasmire, J. [1 ]
Zhao, Anran [1 ]
机构
[1] Univ Manchester, UK Data Serv & Cathie Marsh Inst, Manchester M13 9PL, Lancs, England
关键词
machine learning; time; naive Bayes classification; recurrent neural networks; Twitter; social media data; automatic classification; INFORMATION;
D O I
10.3390/info12110439
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) is increasingly useful as data grow in volume and accessibility. ML can perform tasks (e.g., categorisation, decision making, anomaly detection, etc.) through experience and without explicit instruction, even when the data are too vast, complex, highly variable, full of errors to be analysed in other ways. Thus, ML is great for natural language, images, or other complex and messy data available in large and growing volumes. Selecting ML models for tasks depends on many factors as they vary in supervision needed, tolerable error levels, and ability to account for order or temporal context, among many other things. Importantly, ML methods for tasks that use explicitly ordered or time-dependent data struggle with errors or data asymmetry. Most data are (implicitly) ordered or time-dependent, potentially allowing a hidden 'arrow of time' to affect ML performance on non-temporal tasks. This research explores the interaction of ML and implicit order using two ML models to automatically classify (a non-temporal task) tweets (temporal data) under conditions that balance volume and complexity of data. Results show that performance was affected, suggesting that researchers should carefully consider time when matching appropriate ML models to tasks, even when time is only implicitly included.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Discovering Pathway and Cell Type Signatures in Transcriptomic Compendia with Machine Learning
    Way, Gregory P.
    Greene, Casey S.
    ANNUAL REVIEW OF BIOMEDICAL DATA SCIENCE, VOL 2, 2019, 2019, 2 : 1 - 17
  • [2] Discovering anomalies in big data: a review focused on the application of metaheuristics and machine learning techniques
    Cavallaro, Claudia
    Cutello, Vincenzo
    Pavone, Mario
    Zito, Francesco
    FRONTIERS IN BIG DATA, 2023, 6
  • [3] Machine learning approaches to facial and text analysis: Discovering CEO oral communication styles
    Choudhury, Prithwiraj
    Wang, Dan
    Carlson, Natalie A.
    Khanna, Tarun
    STRATEGIC MANAGEMENT JOURNAL, 2019, 40 (11) : 1705 - 1732
  • [4] Climate Informatics: Accelerating Discovering in Climate Science with Machine Learning
    Monteleoni, Claire
    Schmidt, Gavin A.
    McQuade, Scott
    COMPUTING IN SCIENCE & ENGINEERING, 2013, 15 (05) : 32 - 40
  • [5] Discovering Thermoelectric Materials Using Machine Learning: Insights and Challenges
    Tabib, Mandar V.
    Lovvik, Ole Martin
    Johannessen, Kjetil
    Rasheed, Adil
    Sagvolden, Espen
    Rustad, Anne Marthine
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 392 - 401
  • [6] From the Arrow of Time to the Arrow of Life
    Muriel Gargaud
    Jacques Reisse
    Earth, Moon, and Planets, 2006, 98 : 1 - 9
  • [7] From the arrow of time to the arrow of life
    Gargaud, Muriel
    Reisse, Jacques
    EARTH MOON AND PLANETS, 2006, 98 (1-4): : 1 - 9
  • [8] Discovering Tactical Memory From Observed Human Performance in Machine Learning
    Wong, Josiah
    Gonzalez, Avelino J.
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2021, 51 (05) : 474 - 483
  • [9] DISCOVERING CRASH SEVERITY FACTORS OF GRADE CROSSING WITH A MACHINE LEARNING APPROACH
    Lee, Dahye
    Warner, Jeffery
    Morgan, Curtis
    PROCEEDINGS OF THE ASME JOINT RAIL CONFERENCE, 2019, 2019,
  • [10] Discovering Depressurization Events in Service Difficulty Reports using Machine Learning
    Niraula, Nobal
    Nguyen, Hai
    Kansal, Jennifer
    Hafner, Sean
    Branscum, Logan
    Brown, Eric
    Garcia, Ricardo
    2023 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT, ICPHM, 2023, : 48 - 52