Natural Language Processing to Classify Caregiver Strategies Supporting Participation Among Children and Youth with Craniofacial Microsomia and Other Childhood-Onset Disabilities

被引:3
|
作者
Kaelin, Vera C. [1 ,2 ,3 ]
Boyd, Andrew D. [4 ]
Werler, Martha M. [5 ]
Parde, Natalie [2 ,6 ]
Khetani, Mary A. [1 ,3 ,7 ]
机构
[1] Univ Illinois, Dept Occupat Therapy, 1919 West Taylor St,Room 316A, Chicago, IL 60612 USA
[2] Univ Illinois, Dept Comp Sci, 851 South Morgan St,Room 1132, Chicago, IL 60607 USA
[3] Univ Illinois, Childrens Participat Environm Res Lab, Chicago, IL USA
[4] Univ Illinois, Biomed & Hlth Informat Sci, Chicago, IL USA
[5] Boston Univ, Epidemiol, Boston, MA USA
[6] Univ Illinois, Nat Language Proc Lab, Chicago, IL USA
[7] McMaster Univ, CanChild Ctr Childhood Disabil Res, Hamilton, ON, Canada
关键词
Pediatric rehabilitation; Artificial intelligence; Activities; Preferences; Sense of self; Environment; IMPROVING PARTICIPATION; ENVIRONMENT MEASURE; PERFORMANCE; OUTCOMES;
D O I
10.1007/s41666-023-00149-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Customizing participation-focused pediatric rehabilitation interventions is an important but also complex and potentially resource intensive process, which may benefit from automated and simplified steps. This research aimed at applying natural language processing to develop and identify a best performing predictive model that classifies caregiver strategies into participation-related constructs, while filtering out non-strategies. We created a dataset including 1,576 caregiver strategies obtained from 236 families of children and youth (11-17 years) with craniofacial microsomia or other childhood-onset disabilities. These strategies were annotated to four participation-related constructs and a non-strategy class. We experimented with manually created features (i.e., speech and dependency tags, predefined likely sets of words, dense lexicon features (i.e., Unified Medical Language System (UMLS) concepts)) and three classical methods (i.e., logistic regression, naive Bayes, support vector machines (SVM)). We tested a series of binary and multinomial classification tasks applying 10-fold cross-validation on the training set (80%) to test the best performing model on the held-out test set (20%). SVM using term frequency-inverse document frequency (TF-IDF) was the best performing model for all four classification tasks, with accuracy ranging from 78.10 to 94.92% and a macro-averaged F1-score ranging from 0.58 to 0.83. Manually created features only increased model performance when filtering out non-strategies. Results suggest pipelined classification tasks (i.e., filtering out non-strategies; classification into intrinsic and extrinsic strategies; classification into participation-related constructs) for implementation into participation-focused pediatric rehabilitation interventions like Participation and Environment Measure Plus (PEM+) among caregivers who complete the Participation and Environment Measure for Children and Youth (PEM-CY).
引用
收藏
页码:480 / 500
页数:21
相关论文
共 3 条
  • [1] Natural Language Processing to Classify Caregiver Strategies Supporting Participation Among Children and Youth with Craniofacial Microsomia and Other Childhood-Onset Disabilities
    Vera C. Kaelin
    Andrew D. Boyd
    Martha M. Werler
    Natalie Parde
    Mary A. Khetani
    Journal of Healthcare Informatics Research, 2023, 7 (4) : 480 - 500
  • [2] School participation among young people with craniofacial microsomia and other childhood-onset disabilities
    Kaelin, Vera C.
    Anaby, Dana
    Werler, Martha M.
    Khetani, Mary A.
    DEVELOPMENTAL MEDICINE AND CHILD NEUROLOGY, 2024, 66 (07): : 939 - 947
  • [3] Caregiver strategies supporting community participation among children and youth with or at risk for disabilities: a mixed-methods study
    Kaelin, Vera C.
    Saluja, Shivani
    Bosak, Dianna L.
    Anaby, Dana
    Werler, Martha
    Khetani, Mary A.
    FRONTIERS IN PEDIATRICS, 2024, 12