Applied Sciences (May 2019)

A Smart System for Text-Lifelog Generation from Wearable Cameras in Smart Environment Using Concept-Augmented Image Captioning with Modified Beam Search Strategy

  • Viet-Khoa Vo-Ho,
  • Quoc-An Luong,
  • Duy-Tam Nguyen,
  • Mai-Khiem Tran,
  • Minh-Triet Tran

DOI
https://doi.org/10.3390/app9091886
Journal volume & issue
Vol. 9, no. 9
p. 1886

Abstract

Read online

During a lifetime, a person can have many wonderful and memorable moments that he/she wants to keep. With the development of technology, people now can store a massive amount of lifelog information via images, videos or texts. Inspired by this, we develop a system to automatically generate caption from lifelog pictures taken from wearable cameras. Following up on our previous method introduced at the SoICT 2018 conference, we propose two improvements in our captioning method. We trained and tested the model on the baseline MSCOCO datasets and evaluated on different metrics. The results show better performance compared to our previous model and to some other image captioning methods. Our system also shows effectiveness in retrieving relevant data from captions and achieve high rank in ImageCLEF 2018 retrieval challenge.

Keywords