基于图卷积神经网络的函数自动命名

doi:10.15888/j.cnki.csa.008042

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月24日 3:29 星期四

首页 > 过刊浏览>2021年第30卷第8期 >256-265. DOI:10.15888/j.cnki.csa.008042

PDF HTML阅读 XML下载导出引用引用提醒

基于图卷积神经网络的函数自动命名
DOI:
                        10.15888/j.cnki.csa.008042
                    
CSTR:
                        
                    
作者:
                        王堃王堃
北京化工大学 信息科学与技术学院, 北京 100029
在期刊界中查找
在百度中查找
在本站中查找
李征李征
北京化工大学 信息科学与技术学院, 北京 100029
在期刊界中查找
在百度中查找
在本站中查找
刘勇刘勇
北京化工大学 信息科学与技术学院, 北京 100029
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金（61902015）

Automatic Function Naming Based on Graph Convolutional Network

Author:

WANG Kun
WANG Kun
College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
在期刊界中查找
在百度中查找
在本站中查找
LI Zheng
LI Zheng
College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
在期刊界中查找
在百度中查找
在本站中查找
LIU Yong
LIU Yong
College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

函数自动命名技术旨在为输入的源代码自动生成目标函数名, 增强程序代码的可读性以及加速软件开发进程, 是软件工程领域中一项重要的研究任务. 现有基于机器学习的技术主要是通过序列模型对源代码进行编码, 进而自动生成函数名, 但存在长程依赖问题和代码结构编码问题. 为了更好的提取程序中的结构信息和语义信息, 本文提出了一个基于图卷积(Graph Convolutional Network, GCN)的神经网络模型—TrGCN (a Transformer and GCN based automatic method naming). TrGCN利用了Transformer中的自注意力机制来缓解长程依赖问题, 同时采用Character-word注意力机制提取代码的语义信息. TrGCN引入了一种基于图卷积的AST Encoder结构, 丰富了AST节点特征向量的信息, 可以很好地对源代码结构信息进行建模. 在实证研究中, 使用了3个不同规模的数据集来评估TrGCN的有效性, 实验结果表明TrGCN比当前广泛使用的模型code2seq和Sequence-GNNs能更好的自动生成函数名, 其中F1分数分别提高了平均5.2%、2.1%.

关键词:深度学习;图卷积神经网络;代码表示方式

Abstract:

Automatic method naming, as an important task in software engineering, aims to generate the target function name for an input source code to enhance the readability of program codes and accelerate software development. Existing automatic method naming approaches based on machine learning mainly encode the source code through sequence models to automatically generate the function name. However, these approaches are confronted with problems of long-term dependency and code structural encoding. To better extract structural and semantic information from programs, we propose a automatic function naming method called TrGCN based on Transformer and Graph Convolutional Network (GCN). In this method, the self-attention mechanism in Transformer is used to alleviate the long-term dependency and the Character-word attention mechanism to extract the semantic information of codes. The TrGCN introduces a GCN-based AST Encoder that enriches the eigenvector information at AST nodes and models the structural information of the source code well. Empirical studies are conducted on three Java datasets. The results show that TrGCN outperforms conventional approaches, namely code2seq and Sequence-GNNs, in automatic method naming as its F1-score is 5.2% and 2.1% higher than the values of the two approaches, respectively.

Key words:deep learning;Graph Convolutional Network (GCN);code representation

引用本文

王堃,李征,刘勇.基于图卷积神经网络的函数自动命名.计算机系统应用,2021,30(8):256-265

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2020-11-23
最后修改日期:2020-12-22
录用日期:
在线发布日期: 2021-08-03
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码