Natural Language to Visualization by Neural Machine Translation

被引:55
作者
Luo, Yuyu [1 ]
Tang, Nan [2 ]
Li, Guoliang [1 ]
Tang, Jiawei [3 ]
Chai, Chengliang [1 ]
Qin, Xuedi [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Beijing, Peoples R China
[2] Hamad Bin Khalifa Univ, QCRI, Ar Rayyan, Qatar
[3] Amer Sch Doha, Doha, Qatar
基金
中国博士后科学基金;
关键词
Data visualization; Natural languages; Bars; Deep learning; Machine translation; Visualization; Transformers; Natural language interface; data visualization; neural machine translation; chart template;
D O I
10.1109/TVCG.2021.3114848
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Supporting the translation from natural language (NL) query to visualization (NL2VIS) can simplify the creation of data visualizations because if successful, anyone can generate visualizations by their natural language from the tabular data. The state-of-the-art NL2VIS approaches (e.g., NL4DV and FlowSense) are based on semantic parsers and heuristic algorithms, which are not end-to-end and are not designed for supporting (possibly) complex data transformations. Deep neural network powered neural machine translation models have made great strides in many machine translation tasks, which suggests that they might be viable for NL2VIS as well. In this paper, we present ncNet, a Transformer-based sequence-to-sequence model for supporting NL2VIS, with several novel visualization-aware optimizations, including using attention-forcing to optimize the learning process, and visualization-aware rendering to produce better visualization results. To enhance the capability of machine to comprehend natural language queries, ncNet is also designed to take an optional chart template (e.g., a pie chart or a scatter plot) as an additional input, where the chart template will be served as a constraint to limit what could be visualized. We conducted both quantitative evaluation and user study, showing that ncNet achieves good accuracy in the nvBench benchmark and is easy-to-use.
引用
收藏
页码:217 / 226
页数:10
相关论文
共 55 条
[1]  
[Anonymous], 2015, P 24 ACM INT C INF K, DOI 10.1145/2806416.2806475
[2]  
[Anonymous], 2013, Artificial Intelligence, Evolutionary Computing and Metaheuristics: In the Footsteps of Alan Turing, DOI DOI 10.1007/978-3-642-29694-9_1
[3]  
[Anonymous], 2005, ACL
[4]  
Bahdanau D., 2016, INT C LEARN REPR
[5]  
Bird S., 2006, NLTK NATURAL LANGUAG
[6]   A multi-modal natural language interface to an information visualization environment [J].
Cox K. ;
Grinter R.E. ;
Hibino S.L. ;
Jagadeesan L.J. ;
Mantilla D. .
International Journal of Speech Technology, 2001, 4 (3-4) :297-314
[7]   Text-to-Viz: Automatic Generation of Infographics from Proportion-Related Natural Language Statements [J].
Cui, Weiwei ;
Zhang, Xiaoyu ;
Wang, Yun ;
Huang, He ;
Chen, Bei ;
Fang, Lei ;
Zhang, Haidong ;
Lou, Jian-Guan ;
Zhang, Dongmei .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (01) :906-916
[8]   A Pivotal Prefix Based Filtering Algorithm for String Similarity Search [J].
Deng, Dong ;
Li, Guoliang ;
Feng, Jianhua .
SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, :673-684
[9]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]  
Devlin J., NAACL HLT, P4171