PME: pruning-based multi-size embedding for recommender systems

Zirui Liu; Qingquan Song; Li Li; Soo-Hyun Choi; Rui Chen; Xia Hu

doi:10.3389/fdata.2023.1195742

Frontiers in Big Data (Jun 2023)

PME: pruning-based multi-size embedding for recommender systems

Zirui Liu,
Qingquan Song,
Li Li,
Soo-Hyun Choi,
Rui Chen,
Xia Hu

Affiliations

Zirui Liu: Computer Science Department, Rice University, Houston, TX, United States
Qingquan Song: Linkedin, Sunnyvale, CA, United States
Li Li: Samsung Electronics America, Mountain View, CA, United States
Soo-Hyun Choi: Samsung Electronics America, Mountain View, CA, United States
Rui Chen: Samsung Electronics America, Mountain View, CA, United States
Xia Hu: Computer Science Department, Rice University, Houston, TX, United States

DOI: https://doi.org/10.3389/fdata.2023.1195742
Journal volume & issue: Vol. 6

Abstract

Read online

Embedding is widely used in recommendation models to learn feature representations. However, the traditional embedding technique that assigns a fixed size to all categorical features may be suboptimal due to the following reasons. In recommendation domain, the majority of categorical features' embeddings can be trained with less capacity without impacting model performance, thereby storing embeddings with equal length may incur unnecessary memory usage. Existing work that tries to allocate customized sizes for each feature usually either simply scales the embedding size with feature's popularity or formulates this size allocation problem as an architecture selection problem. Unfortunately, most of these methods either have large performance drop or incur significant extra time cost for searching proper embedding sizes. In this article, instead of formulating the size allocation problem as an architecture selection problem, we approach the problem from a pruning perspective and propose Pruning-based Multi-size Embedding (PME) framework. During the search phase, we prune the dimensions that have the least impact on model performance in the embedding to reduce its capacity. Then, we show that the customized size of each token can be obtained by transferring the capacity of its pruned embedding with significant less search cost. Experimental results validate that PME can efficiently find proper sizes and hence achieve strong performance while significantly reducing the number of parameters in the embedding layer.

Published in Frontiers in Big Data

ISSN: 2624-909X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://www.frontiersin.org/journals/big-data

About the journal

Abstract

Keywords