JOURNAL ARTICLE
CodeGen-Search: A Code Generation Model Incorporating Similar Sample Information.
Published In: International Journal of Software Engineering & Knowledge Engineering, 2023, v. 33, n. 11/12. P. 1899 1 of 3
Database: Business Source Ultimate 2 of 3
Authored By: Li, HongWei; Kuang, JiangLing; Zhong, MaoSheng; Wang, ZhiXiang; Liu, Gen; Liu, GanLin; Xiao, YingJian 3 of 3
Abstract
Code generation has a positive significance in supporting software development, reducing labor intensity, and improving development efficiency. Some scholars use similar code information to enhance the quality of code generation. However, to improve the efficiency and accuracy of programming in daily development tasks, developers often search for similar samples as references. They get the code's syntactic structure and semantic information from similar samples to assist in programming development. Inspired by this, we argue that similar samples are helpful for code generation. This paper proposes a CodeGen-Search model to improve code generation quality by incorporating similar samples. To fully utilize the information of similar samples, the model adopts the "pre-training + fine-tuning" pattern. The model uses a minimum edit distance algorithm to find some similar samples with natural language (NL), and uses different encoders to extract the features of the NL and the code in similar samples. Experimental results show that our model efficiently improves the quality of the generated code. Compared to the state-of-the-art model, the CodeGen-Search model improves the BLEU by 1.5%, the Rough by 0.8% on the HS dataset, and the StrAcc by 0.5% on the ATIS dataset. [ABSTRACT FROM AUTHOR]
Additional Information
- Source:International Journal of Software Engineering & Knowledge Engineering. 2023/11, Vol. 33, Issue 11/12, p1899
- Document Type:Article
- Subject Area:Computer Science
- Publication Date:2023
- ISSN:0218-1940
- DOI:10.1142/S0218194023500584
- Accession Number:174823477
- Copyright Statement:Copyright of International Journal of Software Engineering & Knowledge Engineering is the property of World Scientific Publishing Company and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Looking to go deeper into this topic? Look for more articles on EBSCOhost.