Project-Level Encoding for Neural Source Code Summarization of Subroutines

被引：24

作者：

Bansal, Aakash ^{[1
]}

Haque, Sakib ^{[1
]}

McMillan, Collin ^{[1
]}

机构：

[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA

来源：

2021 IEEE/ACM 29TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2021) | 2021年

关键词：

source code summarization; automatic documentation generation; neural networks; PROGRAM COMPREHENSION;

D O I：

10.1109/ICPC52881.2021.00032

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Source code summarization of a subroutine is the task of writing a short, natural language description of that subroutine. The description usually serves in documentation aimed at programmers, where even brief phrase (e.g. "compresses data to a zip file") can help readers rapidly comprehend what a subroutine does without resorting to reading the code itself. Techniques based on neural networks (and encoder-decoder model designs in particular) have established themselves as the state-of-the-art. Yet a problem widely recognized with these models is that they assume the information needed to create a summary is present within the code being summarized itself - an assumption which is at odds with program comprehension literature. Thus a current research frontier lies in the question of encoding source code context into neural models of summarization. In this paper, we present a project-level encoder to improve models of code summarization. By project-level, we mean that we create a vectorized representation of selected code files in a software project, and use that representation to augment the encoder of state-of-the-art neural code summarization techniques. We demonstrate how our encoder improves several existing models, and provide guidelines for maximizing improvement while controlling time and resource costs in model size.

引用

页码：253 / 264

页数：12

共 50 条

[31] A Survey of Automatic Source Code Summarization
Zhang, Chunyan
Wang, Junchao
Zhou, Qinglei
Xu, Ting
Tang, Ke
Gui, Hairen
Liu, Fudong
SYMMETRY-BASEL, 2022, 14 (03):
[32] A Neural-Network based Code Summarization Approach by Using Source Code and its Call Dependencies
Liu, Bohong
Wang, Tao
Zhang, Xunhui
Fan, Qiang
Yin, Gang
Deng, Jinsheng
11TH ASIA-PACIFIC SYMPOSIUM ON INTERNETWARE (INTERNETWARE 2019), 2019,
[33] INTEGRATED MODEL FOR PROJECT-LEVEL MANAGEMENT OF FLEXIBLE PAVEMENTS
RADA, GR
PERL, J
WITCZAK, MW
JOURNAL OF TRANSPORTATION ENGINEERING-ASCE, 1986, 112 (04): : 381 - 399
[34] Code Summarization with Project-Specific Features
Wang, Yu
Liu, Xin
Lu, Xuesong
Zhou, Aoying
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-APPLIED DATA SCIENCE TRACK, PT IX, ECML PKDD 2024, 2024, 14949 : 190 - 206
[35] Innovation at the project-level: the role of organisational innovation capabilities
Molden, Lars Hovdan
Vakulenko, Maryna
Clausen, Tommy Hoyvarde
INTERNATIONAL JOURNAL OF TECHNOLOGY MANAGEMENT, 2025, 97 (01)
[36] Survey of screening procedures for project-level conformity analyses
Houk, J
Claggett, M
ENERGY AND ENVIRONMENTAL CONCERNS 2004, 2004, (1880): : 50 - 58
[37] Towards Modeling Human Attention from Eye Movements for Neural Source Code Summarization
Bansal A.
Sharif B.
McMillan C.
Proceedings of the ACM on Human-Computer Interaction, 2023, 7 (ETRA)
[38] Competitiveness of green and yellow Hydrogen: A project-level analysis
Weissensteiner, Felix
JOURNAL OF CLEANER PRODUCTION, 2025, 496
[39] Florida DOT Project-Level Bridge Management Models
Thompson, Paul D.
Sobanjo, John O.
Kerr, Richard
JOURNAL OF BRIDGE ENGINEERING, 2003, 8 (06) : 345 - 352
[40] Code Structure-Guided Transformer for Source Code Summarization
Gao, Shuzheng
Gao, Cuiyun
He, Yulan
Zeng, Jichuan
Nie, Lunyiu
Xia, Xin
Lyu, Michael
ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (01)

← 1 2 3 4 5 →