Improving Text-to-Code Generation with Features of Code Graph on GPT-2

Incheon Paik; Jun-Wei Wang

doi:10.3390/electronics10212706

Electronics (Nov 2021)

Improving Text-to-Code Generation with Features of Code Graph on GPT-2

Incheon Paik,
Jun-Wei Wang

Affiliations

Incheon Paik: School of Computer Science and Engineering, The University of Aizu, Fukushima 965-8580, Japan
Jun-Wei Wang: Department of Computer Science and Information Engineering, ChaoYang University of Technology, Taichung 413310, Taiwan

DOI: https://doi.org/10.3390/electronics10212706
Journal volume & issue: Vol. 10, no. 21
p. 2706

Abstract

Read online

Code generation, as a very hot application area of deep learning models for text, consists of two different fields: code-to-code and text-to-code. A recent approach, GraphCodeBERT uses code graph, which is called data flow, and showed good performance improvement. The base model architecture of it is bidirectional encoder representations from transformers (BERT), which uses the encoder part of a transformer. On the other hand, generative pre-trained transformer (GPT)—another multiple transformer architecture—uses the decoder part and shows great performance in the multilayer perceptron model. In this study, we investigate the improvement of code graphs with several variances on GPT-2 to refer to the abstract semantic tree used to collect the features of variables in the code. Here, we mainly focus on GPT-2 with additional features of code graphs that allow the model to learn the effect of the data stream. The experimental phase is divided into two parts: fine-tuning of the existing GPT-2 model, and pre-training from scratch using code data. When we pre-train a new model from scratch, the model produces an outperformed result compared with using the code graph with enough data.

Published in Electronics

ISSN: 2079-9292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics
Website: http://www.mdpi.com/journal/electronics

About the journal

Abstract

Keywords