Jisuanji kexue (Jan 2023)

Feature Extraction Method for Public Component Libraries Based on Cross-fingerprint Analysis

  • GUO Wei, WU Zehui, WU Qianqiong, LI Xixing

DOI
https://doi.org/10.11896/jsjkx.211100121
Journal volume & issue
Vol. 50, no. 1
pp. 373 – 379

Abstract

Read online

The widespread use of software public component libraries increases the speed of software development while expanding the attack surface of software.Vulnerabilities that exist in public component libraries are widely distributed in software that uses the library files,and the compatibility,stability,and development delays make it difficult to fix such vulnerabilities and the patching period is long.Software component analysis is an important tool to solve such problems,but limited by the problem of ineffective feature selection and difficulties in extracting accurate features from public component libraries,the accuracy of component analysis is not high and generally stays at the level of kind location.In this paper,we propose a public component library feature extraction method based on cross-fingerprint analysis,build a fingerprint library based on 25 000 open source projects on GitHub platform,propose source string role classification,export function fingerprint analysis,binary compilation fingerprint analysis,etc.to extract cross-fingerprints of component libraries,realize the accurate localization of public component libraries,develop a prototype tool LVRecognizer,test and evaluate 516 real softwares,and obtain a accuracy rate of 94.74%.

Keywords