Implicit Life Event Discovery From Call Transcripts Using Temporal Input Transformation Network

被引：9

作者：

Ebadi, Nima ^{[1
,2
]}

Lwowski, Brandon ^{[1
,3
]}

Jaloli, Mehrad ^{[1
,2
]}

Rad, Paul ^{[1
,2
,3
]}

机构：

[1] Univ Texas San Antonio, Secure AI & Auton Lab, San Antonio, TX 78249 USA

[2] Univ Texas San Antonio, Elect & Comp Engn Dept, San Antonio, TX 78249 USA

[3] Univ Texas San Antonio, Informat Syst & Secur Dept, San Antonio, TX 78249 USA

来源：

IEEE ACCESS | 2019年 / 7卷

关键词：

Implicit event discovery; call transcripts; deep learning; recurrent neural network; machine learning; natural language processing; text classification; topic modeling; event detection; SPEECH;

D O I：

10.1109/ACCESS.2019.2954884

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Customer-agent conversations (i.e. call transcripts) are invaluable source for companies as they convey direct information from their customers implicit and explicit behaviour. Identifying customer-related events is an important task in customer services which is possible from the call transcripts. However, call centers produces a vast amount of transcripts which makes the manual or semi-manual processing of such raw datasets quite challenging. Furthermore, customer-agent call transcripts tend not to explicitly denote events that might be beneficial to customer services. Albeit being highly researched across multiple domains in the literature, event detection, especially implicit life event detection have not been well examined from call transcripts due to a lack of proper large-scale dataset. In this research, we propose a novel deep learning approach based on latent topic modeling and deep recurrent neural networks with memory units to automatically detect implicit events from a customer's history of call transcripts. These implicit events are detected prior to the report date of that event thereby not containing any explicit topic/feature. We provide a case study on a real-life, large-scale data of more than 800K call transcripts from a large financial services company in the U.S. to examine the practical features and challenges of this problem. The evaluation results demonstrate the potential applicability of our implicit life event detection as it achieves a macro-recall score of 53 (macro-f1 of 47.5) on a highly imbalanced test set, negative samples are 95% of the data. Our model beats the the state-of-the-art text classification benchmarks by macro-f1 score of 5.6 and macro-recall of 8.8 on average, and performs better than the ensemble of all single-document and sequential classification benchmarks albeit being significantly less complex. The comparison results show the importance as well as our model's capability of capturing the mutual information of a sequence of call transcripts in detecting the implicit life events.

引用

页码：172178 / 172189

页数：12

共 62 条

[1]

Abadi M., 2015, TENSORFLOW LARGE SCA, DOI DOI 10.48550/ARXIV.1603.04467

[2]

Agrawal A., 2019, ARXIV190301855

[3] An information-theoretic perspective of tf-idf measures [J].

Aizawa, A .

INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (01) :45-65

[4]

[Anonymous], 2016, DEEP LEARNING

[5]

[Anonymous], 2015, ADV NEURAL INFORM PR

[6]

[Anonymous], 2009, P 18 ACM C INF KNOWL

[7]

[Anonymous], 2001, ADV NEURAL INF PROCE

[8]

[Anonymous], 2016, KDD16 P 22 ACM, DOI DOI 10.1145/2939672.2939785

[9]

Balahur A., 2009, Proceedings of the 1st Workshop on Opinion Mining and Sentiment Analysis WOMSA, V9, P1

[10]

Balahur A, 2008, PROCES LENG NAT, P107

← 1 2 3 4 5 6 7 →