An Interpersonal Dynamics Analysis Procedure With Accurate Voice Activity Detection Using Low-Cost Recording Sensors

被引:0
作者
Lim, Dongcheol [1 ]
Kang, Hyewon [2 ]
Choi, Beomseok [3 ]
Hong, Woonki [4 ]
Lee, Junghye [1 ,5 ,6 ]
机构
[1] Seoul Natl Univ, Technol Management Econ & Policy Program, Seoul 08826, South Korea
[2] Sungkyunkwan Univ, Dept Intelligent Precis Healthcare Convergence, Suwon 16419, South Korea
[3] Ulsan Natl Inst Sci & Technol, Dept Business Analyt, Ulsan 44919, South Korea
[4] Konkuk Univ, Sch Business Adm, Seoul 05029, South Korea
[5] Seoul Natl Univ, Grad Sch Engn Practice, Seoul 08826, South Korea
[6] Seoul Natl Univ, Inst Engn Res, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
Feature extraction; Sensors; Recording; Smoothing methods; Voice activity detection; Wearable sensors; Machine learning; Behavioral sciences; Verbal activity detection; organizational behavior; conversational feature; machine learning; noise smoothing filter; PERFORMANCE; TECHNOLOGY; NETWORKS;
D O I
10.1109/ACCESS.2024.3387279
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Voice data is of special interest to organizational behavior researchers since it can be easily collected and provides a wealth of information for understanding interpersonal dynamics. Due to these benefits, voice activity detection (VAD) using wearable sensor badges has received considerable attention as one of the more objective and effective data-driven analysis methods rather than self-report methods for analyzing interpersonal dynamics. Moreover, with the VAD results, several prior works extracted conversational features, such as speaking times, turn-taking, and overlapped speaking, for attaining high-level organizational insights. However, there is still room for improvement in the accuracy of the VAD models, and there is a lack of research to develop reliable conversational feature extracting algorithms in the previous works. Most of these prior studies relied on sociometric badges, which are costly electronic devices, including voice recording machines and redundant sensors. In this paper, we propose an interpersonal dynamics analysis procedure based on low-cost commercial recording sensors, consisting of data-driven VAD modeling and conversational feature extracting. To accurately identify voice presence, the VAD modeling incorporates three steps: signal preprocessing, derived-variable generation, and VAD with machine learning-based classification and smoothing technique. We conducted experiments, from collecting datasets comprising the human verbal interaction of 15 three-person groups by commercial recording sensors to implementing our procedure on the datasets. Our results show that the proposed procedure excels in voice activity detection, achieving superior accuracy compared to prior studies. This remarkable accuracy subsequently ensures greater reliability in our conversational feature extraction. Thus, the proposed procedure can encourage organizational behavior researchers to acquire objective information about interpersonal dynamics and efficiently obtain high-level organizational insights with cost-effectiveness.
引用
收藏
页码:68427 / 68440
页数:14
相关论文
共 59 条
[1]   Cross-covariance-based features for speech classification in film audio [J].
Benatan, Matt ;
Ng, Kia .
JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2015, 31 :215-221
[2]   Organizational behavior, strategy, performance, and design in Management Science [J].
Boudreau, JW .
MANAGEMENT SCIENCE, 2004, 50 (11) :1463-1476
[3]   SPEECH RATE REDUCTION AND NASALITY IN NORMAL SPEAKERS [J].
BRANCEWICZ, TM ;
REICH, AR .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1989, 32 (04) :837-848
[4]  
Breiman L., 2001, MACH LEARN, V45, P5, DOI https://doi.org/10.1023/A:1010933404324
[5]  
Cai Chen, 2021, AIAM2021: 2021 3rd International Conference on Artificial Intelligence and Advanced Manufacture, P2486, DOI 10.1145/3495018.3501126
[6]   The Promise and Perils of Wearable Sensors in Organizational Research [J].
Chaffin, Daniel ;
Heidl, Ralph ;
Hollenbeck, John R. ;
Howe, Michael ;
Yu, Andrew ;
Voorhees, Clay ;
Calantone, Roger .
ORGANIZATIONAL RESEARCH METHODS, 2017, 20 (01) :3-31
[7]   STRATEGIES OF CONTROL AND ORGANIZATIONAL BEHAVIOR [J].
CHILD, J .
ADMINISTRATIVE SCIENCE QUARTERLY, 1973, 18 (01) :1-17
[8]   Investigating the Interpersonal Dynamics Between Coaches and Athletes Based on Fundamental Principles of Attachment [J].
Davis, Louise ;
Jowett, Sophia .
JOURNAL OF CLINICAL SPORT PSYCHOLOGY, 2010, 4 (02) :112-132
[9]   Deep learning with small datasets: using autoencoders to address limited datasets in construction management [J].
Delgado, Juan Manuel Davila ;
Oyedele, Lukumon .
APPLIED SOFT COMPUTING, 2021, 112
[10]   Entropy of dialogues creates coherent structures in e-mail traffic [J].
Eckmann, JP ;
Moses, E ;
Sergi, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (40) :14333-14337