Unified modeling language code generation from diagram images using multimodal large language models

Averi Bates; Ryan Vavricka; Shane Carleton; Ruosi Shao; Chongle Pan

Machine Learning with Applications (Jun 2025)

Unified modeling language code generation from diagram images using multimodal large language models

Averi Bates,
Ryan Vavricka,
Shane Carleton,
Ruosi Shao,
Chongle Pan

Affiliations

Averi Bates: School of Computer Science, University of Oklahoma, 110 W. Boyd St., Norman, OK, US
Ryan Vavricka: School of Computer Science, University of Oklahoma, 110 W. Boyd St., Norman, OK, US
Shane Carleton: Enterprise Architecture, MapLarge, 1201 Peachtree Street NE, Building 400, Suite 1750, Atlanta, GA, US
Ruosi Shao: School of Communication, Florida State University, 4100 University Center, Building C, Tallahassee, FL, US
Chongle Pan: School of Computer Science, University of Oklahoma, 110 W. Boyd St., Norman, OK, US; Corresponding author.

Journal volume & issue: Vol. 20
p. 100660

Abstract

Read online

The Unified Modeling Language is a standardized visual language widely used for modeling and documenting the design of software systems. Although many tools are available that generate UML diagrams from UML code, generating executable UML code from image-based UML diagrams remains challenging. This paper proposes a new approach to generate UML code using a large multimodal language model automatically. Synthetic UML activity and sequence diagram datasets were created to train and test the model. We compared the standard fine-tuning with LoRA techniques to optimize base models. The experiments measured the code generation accuracy across different model sizes and training strategies. These results demonstrated that domain-adapted MM-LLMs perform for UML code generation automation, whereby, at the best model, it achieved BLEU and SSIM of 0.779 and 0.942 on sequence diagrams. This will enable the modernization of legacy systems and decrease the manual effort put into software development workflows.

Published in Machine Learning with Applications

ISSN: 2666-8270 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Science: Science (General): Cybernetics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.journals.elsevier.com/machine-learning-with-applications

About the journal

Abstract

Keywords