Combining Convolutional Neural Network and Markov Random Field for Semantic Image Retrieval

Haijiao Xu; Changqin Huang; Xiaodi Huang; Chunyan Xu; Muxiong Huang

doi:10.1155/2018/6153607

Advances in Multimedia (Jan 2018)

Combining Convolutional Neural Network and Markov Random Field for Semantic Image Retrieval

Haijiao Xu,
Changqin Huang,
Xiaodi Huang,
Chunyan Xu,
Muxiong Huang

Affiliations

Haijiao Xu: School of Information Technology in Education, South China Normal University, Guangzhou, China
Changqin Huang: School of Information Technology in Education, South China Normal University, Guangzhou, China
Xiaodi Huang: School of Computing and Mathematics, Charles Sturt University, Albury, NSW, Australia
Chunyan Xu: School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China
Muxiong Huang: School of Information Technology in Education, South China Normal University, Guangzhou, China

DOI: https://doi.org/10.1155/2018/6153607
Journal volume & issue: Vol. 2018

Abstract

Read online

With the rapidly growing number of images over the Internet, efficient scalable semantic image retrieval becomes increasingly important. This paper presents a novel approach for semantic image retrieval by combining Convolutional Neural Network (CNN) and Markov Random Field (MRF). As a key step, image concept detection, that is, automatically recognizing multiple semantic concepts in an unlabeled image, plays an important role in semantic image retrieval. Unlike previous work that uses single-concept classifiers one by one, we detect semantic multiconcept by using a multiconcept scene classifier. In other words, our approach takes multiple concepts as a holistic scene for multiconcept scene learning. Specifically, we first train a CNN as a concept classifier, which further includes two types of classifiers: a single-concept fully connected classifier that is best suited to single-concept detection and a multiconcept scene fully connected classifier that is good for holistic scene detection. Then we propose an MRF-based late fusion approach that is able to effectively learn the semantic correlation between the single-concept classifier and multiconcept scene classifier. Finally, the semantic correlation among the subconcepts of images is cought to further improve detection precision. In order to investigate the feasibility and effectiveness of our proposed approach, we conduct comprehensive experiments on two publicly available image databases. The results show that our proposed approach outperforms several state-of-the-art approaches.

Published in Advances in Multimedia

ISSN: 1687-5680 (Print); 1687-5699 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://onlinelibrary.wiley.com/journal/6048

About the journal