Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning

被引:13
|
作者
Ferrario, Andrea [1 ]
Demiray, Burcu [2 ,3 ,4 ]
Yordanova, Kristina [5 ,6 ,7 ]
Luo, Minxia [2 ,3 ]
Martin, Mike [2 ,3 ,4 ,8 ]
机构
[1] Swiss Fed Inst Technol, Dept Management Technol & Econ, Weinbergstr 56-58, CH-8092 Zurich, Switzerland
[2] Univ Zurich, Dept Psychol, Zurich, Switzerland
[3] Univ Zurich, Univ Res Prior Program, Zurich, Switzerland
[4] Coll Helveticum, Zurich, Switzerland
[5] Univ Rostock, Inst Comp Sci, Rostock, Germany
[6] Univ Rostock, Inst Visual & Analyt Comp, Rostock, Germany
[7] Univ Rostock, Interdisciplinary Fac Ageing Individuals & Soc, Rostock, Germany
[8] Univ Queensland, Fac Hlth & Behav Sci, Sch Psychol, Brisbane, Qld, Australia
关键词
aging; dementia; reminiscence; real-life conversations; electronically activated recorder (EAR); natural language processing; machine learning; imbalanced learning; ACTIVATED RECORDER EAR; AUTOBIOGRAPHICAL MEMORY; HEALTH; CHALLENGES; IMBALANCE; FUTURE;
D O I
10.2196/19133
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Reminiscence is the act of thinking or talking about personal experiences that occurred in the past. It is a central task of old age that is essential for healthy aging, and it serves multiple functions, such as decision-making and introspection, transmitting life lessons, and bonding with others. The study of social reminiscence behavior in everyday life can be used to generate data and detect reminiscence from general conversations. Objective: The aims of this original paper are to (1) preprocess coded transcripts of conversations in German of older adults with natural language processing (NLP), and (2) implement and evaluate learning strategies using different NLP features and machine learning algorithms to detect reminiscence in a corpus of transcripts. Methods: The methods in this study comprise (1) collecting and coding of transcripts of older adults' conversations in German, (2) preprocessing transcripts to generate NLP features (bag-of-words models, part-of-speech tags, pretrained German word embeddings), and (3) training machine learning models to detect reminiscence using random forests, support vector machines, and adaptive and extreme gradient boosting algorithms. The data set comprises 2214 transcripts, including 109 transcripts with reminiscence. Due to class imbalance in the data, we introduced three learning strategies: (1) class-weighted learning, (2) a meta-classifier consisting of a voting ensemble, and (3) data augmentation with the Synthetic Minority Oversampling Technique (SMOTE) algorithm. For each learning strategy, we performed cross-validation on a random sample of the training data set of transcripts. We computed the area under the curve (AUC), the average precision (AP), precision, recall, as well as F1 score and specificity measures on the test data, for all combinations of NLP features, algorithms, and learning strategies. Results: Class-weighted support vector machines on bag-of-words features outperformed all other classifiers (AUC=0.91, AP=0.56, precision=0.5, recall=0.45, F1=0.48, specificity=0.98), followed by support vector machines on SMOTE-augmented data and word embeddings features (AUC=0.89, AP=0.54, precision=0.35, recall=0.59, F1=0.44, specificity=0.94). For the meta-classifier strategy, adaptive and extreme gradient boosting algorithms trained on word embeddings and bag-of-words outperformed all other classifiers and NLP features; however, the performance of the meta-classifier learning strategy was lower compared to other strategies, with highly imbalanced precision-recall trade-offs. Conclusions: This study provides evidence of the applicability of NLP and machine learning pipelines for the automated detection of reminiscence in older adults' everyday conversations in German. The methods and findings of this study could be relevant for designing unobtrusive computer systems for the real-time detection of social reminiscence in the everyday life of older adults and classifying their functions. With further improvements, these systems could be deployed in health interventions aimed at improving older adults' well-being by promoting self-reflection and suggesting coping strategies to be used in the case of dysfunctional reminiscence cases, which can undermine physical and mental health.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] Detection of social media platform insults using Natural language processing and comparative study of machine learning algorithms
    Chiramel, Sruthi
    Logofatu, Doina
    Goldenthal, Gheorghe
    2020 24TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2020, : 98 - 101
  • [12] Machine learning and Natural Language Processing of social media data for event detection in smart cities
    Hodorog, Andrei
    Petri, Ioan
    Rezgui, Yacine
    SUSTAINABLE CITIES AND SOCIETY, 2022, 85
  • [13] Predicting Working Memory in Healthy Older Adults Using Real-Life Language and Social Context Information: A Machine Learning Approach
    Ferrario, Andrea
    Luo, Minxia
    Polsinelli, Angelina J.
    Moseley, Suzanne A.
    Mehl, Matthias R.
    Yordanova, Kristina
    Martin, Mike
    Demiray, Burcu
    JMIR AGING, 2022, 5 (01)
  • [14] Semi-automated title-abstract screening using natural language processing and machine learning
    Pilz, Maximilian
    Zimmermann, Samuel
    Friedrichs, Juliane
    Woerdehoff, Enrica
    Ronellenfitsch, Ulrich
    Kieser, Meinhard
    Vey, Johannes A.
    SYSTEMATIC REVIEWS, 2024, 13 (01)
  • [15] SmishGuard: Leveraging Machine Learning and Natural Language Processing for Smishing Detection
    Samad, Saleem Raja Abdul
    Ganesan, Pradeepa
    Rajasekaran, Justin
    Radhakrishnan, Madhubala
    Ammaippan, Hariraman
    Ramamurthy, Vinodhini
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 586 - 593
  • [16] Machine Learning Driven Mental Stress Detection on Reddit Posts Using Natural Language Processing
    Shaunak Inamdar
    Rishikesh Chapekar
    Shilpa Gite
    Biswajeet Pradhan
    Human-Centric Intelligent Systems, 2023, 3 (2): : 80 - 91
  • [17] Network Intrusion Detection using Natural Language Processing and Ensemble Machine Learning
    Das, Saikat
    Ashrafuzzamant, Mohammad
    Sheldon, Frederick T.
    Shiva, Sajjan
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 829 - 835
  • [18] Crime Detection and Analysis from Social Media Messages Using Machine Learning and Natural Language Processing Technique
    Lombo, Xolani
    Oyelade, Olaide N.
    Ezugwu, Absalom E.
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART V, 2022, 13381 : 502 - 517
  • [19] Subjective Answers Evaluation Using Machine Learning and Natural Language Processing
    Bashir, Muhammad Farrukh
    Arshad, Hamza
    Javed, Abdul Rehman
    Kryvinska, Natalia
    Band, Shahab S.
    IEEE ACCESS, 2021, 9 : 158972 - 158983
  • [20] Detecting Phishing Attacks Using Natural Language Processing And Machine Learning
    Banu, Reshma
    Anand, M.
    Kamath, Akshatha C.
    Ashika, S.
    Ujwala, H. S.
    Harshitha, S. N.
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 1210 - 1214