CodeLin: An in situ visualization method for understanding data transformation scripts

被引:0
作者
Cai, Xiwen [1 ]
Xiong, Kai [1 ]
Luo, Zhongsu [1 ]
Weng, Di [2 ]
Ye, Shuainan [1 ]
Wu, Yingcai [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou, Peoples R China
[2] Zhejiang Univ, Sch Software Technol, Hangzhou, Peoples R China
关键词
Data transformation; Program visualization; In situ visualization; PROVENANCE;
D O I
10.1016/j.visinf.2025.03.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding data transformation scripts is an essential task for data analysts who write code to process data. However, this can be challenging, especially when encountering unfamiliar scripts. Comments can help users understand data transformation code, but well-written comments are not always present. Visualization methods have been proposed to help analysts understand data transformations, but they generally require a separate view, which may distract users and entail efforts for connecting visualizations and code. In this work, we explore the use of in situ program visualization to help data analysts understand data transformation scripts. We present CodeLin, a new visualization method that combines word-sized glyphs for presenting transformation semantics and a lineage graph for presenting data lineage in an in situ manner. Through a use case, code pattern demonstrations, and a preliminary user study, we demonstrate the effectiveness and usability of CodeLin. We further discuss how visualization can help users understand data transformation code. (c) 2025 The Authors. Published by Elsevier B.V. on behalf of Zhejiang University and Zhejiang University Press Co. Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页数:13
相关论文
共 48 条
[1]  
Aftandilian EE, 2010, SOFTVIS 2010: PROCEEDINGS OF THE 2010 INTERNATIONAL SYMPOSIUM ON SOFTWARE VISUALIZATION, P53, DOI 10.1145/1879211.1879222
[2]   Lineage-driven Fault Injection [J].
Alvaro, Peter ;
Rosen, Joshua ;
Hellerstein, Joseph M. .
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, :331-346
[3]  
Beck F., 2013, P IEEE WORK C SOFTW, P1
[4]  
Beck F, 2013, CONF PROC INT SYMP C, P63, DOI 10.1109/ICPC.2013.6613834
[5]   Evaluation of Filesystem Provenance Visualization Tools [J].
Borkin, Michelle A. ;
Yeh, Chelsea S. ;
Boyd, Madelaine ;
Macko, Peter ;
Gajos, Krzysztof Z. ;
Seltzer, Margo ;
Pfister, Hanspeter .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (12) :2476-2485
[6]   D3: Data-Driven Documents [J].
Bostock, Michael ;
Ogievetsky, Vadim ;
Heer, Jeffrey .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (12) :2301-2309
[7]  
Callahan S.P., 2006, P 2006 ACM SIGMOD IN, P745, DOI 10.1145/1142473.1142574. 2
[8]   SenseMap: Urban Performance Visualization and Analytics Via Semantic Textual Similarity [J].
Chen, Juntong ;
Huang, Qiaoyun ;
Wang, Changbo ;
Li, Chenhui .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (09) :6275-6290
[9]   An open dataset of data lineage graphs for data governance research [J].
Chen, Yunpeng ;
Zhao, Ying ;
Li, Xuanjing ;
Zhang, Jiang ;
Long, Jiang ;
Zhou, Fangfang .
VISUAL INFORMATICS, 2024, 8 (01) :1-5
[10]  
Cheng L., 2024, IEEE Transactions on Visualization and Computer Graphics