Structure-Aware Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Yahao Hu; Yifei Xie; Tianfeng Wang; Man Chen; Zhisong Pan

doi:10.3390/math11204317

Mathematics (Oct 2023)

Structure-Aware Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Yahao Hu,
Yifei Xie,
Tianfeng Wang,
Man Chen,
Zhisong Pan

Affiliations

Yahao Hu: Command and Control Engineering College, Army Engineering University of PLA, Nanjing 210007, China
Yifei Xie: Command and Control Engineering College, Army Engineering University of PLA, Nanjing 210007, China
Tianfeng Wang: Command and Control Engineering College, Army Engineering University of PLA, Nanjing 210007, China
Man Chen: Command and Control Engineering College, Army Engineering University of PLA, Nanjing 210007, China
Zhisong Pan: Command and Control Engineering College, Army Engineering University of PLA, Nanjing 210007, China

DOI: https://doi.org/10.3390/math11204317
Journal volume & issue: Vol. 11, no. 20
p. 4317

Abstract

Read online

With the growing scale of pre-trained language models (PLMs), full parameter fine-tuning becomes prohibitively expensive and practically infeasible. Therefore, parameter-efficient adaptation techniques for PLMs have been proposed to learn through incremental updates of pre-trained weights, such as in low-rank adaptation (LoRA). However, LoRA relies on heuristics to select the modules and layers to which it is applied, and assigns them the same rank. As a consequence, any fine-tuning that ignores the structural information between modules and layers is suboptimal. In this work, we propose structure-aware low-rank adaptation (SaLoRA), which adaptively learns the intrinsic rank of each incremental matrix by removing rank-0 components during training. We conduct comprehensive experiments using pre-trained models of different scales in both task-oriented (GLUE) and task-agnostic (Yelp and GYAFC) settings. The experimental results show that SaLoRA effectively captures the structure-aware intrinsic rank. Moreover, our method consistently outperforms LoRA without significantly compromising training efficiency.

Published in Mathematics

ISSN: 2227-7390 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/mathematics

About the journal

Abstract

Keywords