Exploratory text data analysis for quality hypothesis generation

被引:28
作者
Allen, Theodore T. [1 ]
Sui, Zhenhuan [1 ,2 ]
Akbari, Kaveh [1 ]
机构
[1] Ohio State Univ, Integrated Syst Engn, 1971 Neil Ave,210 Baker Syst, Columbus, OH 43210 USA
[2] ANZ Hong Kong, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
cyber security; exploratory data analysis; graphical data analysis; pattern discovery; quality improvement; text analytics; twitter analysis;
D O I
10.1080/08982112.2018.1481216
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Freestyle text data such as surveys, complaint transcripts, customer ratings, or maintenance squawks can provide critical information for quality engineering. Exploratory text data analysis (ETDA) is proposed here as a special case of exploratory data analysis (EDA) for quality improvement problems with freestyle text data. The EDTA method seeks to extract useful information from the text data to identify hypotheses for additional exploration relating to key inputs or outputs. The proposed four steps of ETDA are: (1) preprocessing of text data, (2) text data analysis and display, (3) salient feature identification, and (4) salient feature interpretation. Five examples illustrate the methods.
引用
收藏
页码:701 / 712
页数:12
相关论文
共 25 条
[1]  
Allen T.T., 2016, 84 MIL OP RES SOC MO, P170
[2]   Timely Decision Analysis Enabled by Efficient Social Media Modeling [J].
Allen, Theodore T. ;
Sui, Zhenhuan ;
Parker, Nathan L. .
DECISION ANALYSIS, 2017, 14 (04) :250-260
[3]   A directed topic model applied to call center improvement [J].
Allen, Theodore T. ;
Xiong, Hui ;
Afful-Dadzie, Anthony .
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2016, 32 (01) :57-73
[4]   Pareto charting using multifield freestyle text data applied to Toyota Camry user reviews [J].
Allen, Theodore T. ;
Xiong, Hui .
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2012, 28 (02) :152-163
[5]  
[Anonymous], 2006, TEXT MINING HDB ADV
[6]  
[Anonymous], THESIS
[7]  
Bisgaard S., 1996, QUAL ENG, V9, P157, DOI [10.1080/08982119608919028, DOI 10.1080/08982119608919028]
[8]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[9]  
Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
[10]   Exploratory data analysis in quality-improvement projects [J].
de Mast, Jeroen ;
Trip, Albert .
JOURNAL OF QUALITY TECHNOLOGY, 2007, 39 (04) :301-311