ProcData: An R Package for Process Data Analysis

被引:0
作者
Xueying Tang
Susu Zhang
Zhi Wang
Jingchen Liu
Zhiliang Ying
机构
[1] University of Arizona,
[2] University of Illinois at Urbana-Champaign,undefined
[3] Columbia University,undefined
来源
Psychometrika | 2021年 / 86卷
关键词
process data analysis; multidimensional scaling; autoencoder; sequence model;
D O I
暂无
中图分类号
学科分类号
摘要
Process data refer to data recorded in log files of computer-based items. These data, represented as timestamped action sequences, keep track of respondents’ response problem-solving behaviors. Process data analysis aims at enhancing educational assessment accuracy and serving other assessment purposes by utilizing the rich information contained in response processes. The R package ProcData presented in this article is designed to provide tools for inspecting, processing, and analyzing process data. We define an S3 class ‘proc’ for organizing process data and extend generic methods summary and print for ‘proc’. Feature extraction methods for process data are implemented in the package for compressing information in the irregular response processes into regular numeric vectors. ProcData also provides functions for making predictions from neural-network-based sequence models. In addition, a real dataset of response processes from the climate control item in the 2012 Programme for International Student Assessment is included in the package.
引用
收藏
页码:1058 / 1083
页数:25
相关论文
共 45 条
[1]  
Bengio Y(1994)Learning long-term dependencies with gradient descent is difficult IEEE Transactions on Neural Networks 5 157-166
[2]  
Simard P(1970)The convergence of a class of double-rank minimization algorithms 1. General considerations IMA Journal of Applied Mathematics 6 76-90
[3]  
Frasconi P(2019)Statistical analysis of complex problem-solving process data: An event history analysis approach Frontiers in Psychology 10 486-322
[4]  
Broyden CG(1970)A new approach to variable metric algorithms The Computer Journal 13 317-26
[5]  
Chen Y(1970)A family of variable-metric methods derived by variational means Mathematics of Computation 24 23-17
[6]  
Li X(2016)Taming log files from game/simulation-based assessments: Data models and data analysis tools ETS Research Report Series 2016 1-1780
[7]  
Liu J(1997)Long short-term memory Neural Computation 9 1735-939
[8]  
Ying Z(2018)Multidimensional scaling with very large datasets Journal of Computational and Graphical Statistics 27 935-407
[9]  
Fletcher R(2018)Data mining techniques in analyzing process data: A didactic Frontiers in Psychology 9 2231-656
[10]  
Goldfarb D(2019)Exploring multiple goals balancing in complex problem solving based on log data Frontiers in Psychology 10 1975-501