Application of Seq2Seq Models on Code Correction

被引:6
|
作者
Huang, Shan [1 ]
Zhou, Xiao [2 ]
Chin, Sang [2 ,3 ,4 ]
机构
[1] Boston Univ, Dept Phys, 590 Commonwealth Ave, Boston, MA 02215 USA
[2] Boston Univ, Dept Comp Sci, Boston, MA 02215 USA
[3] MIT, Dept Brain & Cognit Sci, Boston, MA USA
[4] Harvard Univ, Ctr Math Sci & Applicat, Boston, MA 02115 USA
来源
FRONTIERS IN ARTIFICIAL INTELLIGENCE | 2021年 / 4卷
基金
美国国家科学基金会;
关键词
programming language correction; seq2seq architecture; pyramid encoder; attention mechanism; transfer learning;
D O I
10.3389/frai.2021.590215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We apply various seq2seq models on programming language correction tasks on Juliet Test Suite for C/C++ and Java of Software Assurance Reference Datasets and achieve 75% (for C/C++) and 56% (for Java) repair rates on these tasks. We introduce pyramid encoder in these seq2seq models, which significantly increases the computational efficiency and memory efficiency, while achieving similar repair rate to their nonpyramid counterparts. We successfully carry out error type classification task on ITC benchmark examples (with only 685 code instances) using transfer learning with models pretrained on Juliet Test Suite, pointing out a novel way of processing small programming language datasets.
引用
收藏
页数:13
相关论文
共 33 条
  • [11] Real-time traffic accident detection and evaluation based on Seq2Seq and autoencode model
    Zhao C.
    Xie T.
    Xin G.-R.
    Wu J.
    Kongzhi yu Juece/Control and Decision, 2022, 37 (08): : 2141 - 2148
  • [12] A new seq2seq architecture for hourly runoff prediction using historical rainfall and runoff as input
    Gao, Shuai
    Zhang, Shuo
    Huang, Yuefei
    Han, Jingcheng
    Luo, Huoqian
    Zhang, Ying
    Wang, Guangqian
    JOURNAL OF HYDROLOGY, 2022, 612
  • [13] Network attack detection and visual payload labeling technology based on Seq2Seq architecture with attention mechanism
    Shi, Fan
    Zhu, Pengcheng
    Zhou, Xiangyu
    Yuan, Bintao
    Fang, Yong
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2020, 16 (04):
  • [14] FM-GRU: A Time Series Prediction Method for Water Quality Based on seq2seq Framework
    Xu, Jianlong
    Wang, Kun
    Lin, Che
    Xiao, Lianghong
    Huang, Xingshan
    Zhang, Yufeng
    WATER, 2021, 13 (08)
  • [15] DSCSSA: A Classification Framework for Spatiotemporal Features Extraction of Arrhythmia Based on the Seq2Seq Model With Attention Mechanism
    Peng, Xiangdong
    Shu, Weiwei
    Pan, Congcheng
    Ke, Zejun
    Zhu, Huaqiang
    Zhou, Xiao
    Song, William Wei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [16] Optimized Seq2Seq model based on multiple methods for short-term power load forecasting
    Dai, Yeming
    Yang, Xinyu
    Leng, Mingming
    APPLIED SOFT COMPUTING, 2023, 142
  • [17] Spatial-temporal attention-based seq2seq framework for short-term travel time prediction
    Zhang, Ningqing
    Wang, Fei
    Chen, Xiong
    Zhao, Tong
    Kang, Qi
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2022, 20 (01) : 23 - 37
  • [18] Real-time prediction of logging parameters during the drilling process using an attention-based Seq2Seq model
    Zhang, Rui
    Zhang, Chengkai
    Song, Xianzhi
    Li, Zukui
    Su, Yinao
    Li, Gensheng
    Zhu, Zhaopeng
    GEOENERGY SCIENCE AND ENGINEERING, 2024, 233
  • [19] Versatile and high-accuracy seq2seq model with self-attention mechanism for Li-ion battery characterization
    Yao, Li
    Pu, Shenghua
    Wang, Jian
    JOURNAL OF ENERGY STORAGE, 2024, 79
  • [20] PAMA-TTS: PROGRESSION-AWARE MONOTONIC ATTENTION FOR STABLE SEQ2SEQ TTS WITH ACCURATE PHONEME DURATION CONTROL
    He, Yunchao
    Luan, Jian
    Wang, Yujun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7467 - 7471