Stacked Autoencoder Based Weak Supervision for Social Image Understanding

Chaoyang Xu; Yuanfei Dai; Renjie Lin; Shiping Wang

doi:10.1109/ACCESS.2019.2898991

IEEE Access (Jan 2019)

Stacked Autoencoder Based Weak Supervision for Social Image Understanding

Chaoyang Xu,
Yuanfei Dai,
Renjie Lin,
Shiping Wang

Affiliations

Chaoyang Xu: ORCiD; School of Information Engineering, Putian University, Putian, China
Yuanfei Dai: ORCiD; College of Mathematics and Computer Science, Fuzhou University, Fuzhou, China
Renjie Lin: ORCiD; College of Mathematics and Computer Science, Fuzhou University, Fuzhou, China
Shiping Wang: College of Mathematics and Computer Science, Fuzhou University, Fuzhou, China

DOI: https://doi.org/10.1109/ACCESS.2019.2898991
Journal volume & issue: Vol. 7
pp. 21777 – 21786

Abstract

Read online

Many studies in recent years have focused on social image understanding due to the increasing number of shared images from social networks and online communities. However, previous work in social image understanding fails to learn an effective feature representation because of a large amount of missing and irrelevant tags, though matrix completion techniques are frequently utilized for this purpose. Autoencoder models have been validated to be effective in learning latent low-dimensional representations in unsupervised learning. In this paper, we propose a new social image understanding model based on deep autoencoders, which can learn the shared latent codes of social images and tags as supervision information in the deep autoencoders. First, social images are extracted with multi-modal features, which provide a comprehensive characterization to image semantic understanding. And, the social image understanding problems are transformed into the problem of minimizing an optimization objective. Second, multi-layered autoencoders with weak supervision integration are employed to learn an efficient low-dimensional representation from the multi-view feature sources that can make up the semantic gap between image features and tags through minimizing the problem formulation. Finally, we design a new balanced loss function based on binary cross entropy, in which we address highly sparse inputs for a better optimization performance. The extensive experiments on several real-world social image datasets confirm the effectiveness and robustness of the proposed model compared with the state-of-the-art methods.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords