Incorporating Concreteness in Multi-Modal Language Models with Curriculum Learning

Erhan Sezerer; Selma Tekir

doi:10.3390/app11178241

Applied Sciences (Sep 2021)

Incorporating Concreteness in Multi-Modal Language Models with Curriculum Learning

Erhan Sezerer,
Selma Tekir

Affiliations

Erhan Sezerer: Department of Computer Engineering, Izmir Institute of Technology, 35430 Izmir, Turkey
Selma Tekir: Department of Computer Engineering, Izmir Institute of Technology, 35430 Izmir, Turkey

DOI: https://doi.org/10.3390/app11178241
Journal volume & issue: Vol. 11, no. 17
p. 8241

Abstract

Read online

Over the last few years, there has been an increase in the studies that consider experiential (visual) information by building multi-modal language models and representations. It is shown by several studies that language acquisition in humans starts with learning concrete concepts through images and then continues with learning abstract ideas through the text. In this work, the curriculum learning method is used to teach the model concrete/abstract concepts through images and their corresponding captions to accomplish multi-modal language modeling/representation. We use the BERT and Resnet-152 models on each modality and combine them using attentive pooling to perform pre-training on the newly constructed dataset, which is collected from the Wikimedia Commons based on concrete/abstract words. To show the performance of the proposed model, downstream tasks and ablation studies are performed. The contribution of this work is two-fold: A new dataset is constructed from Wikimedia Commons based on concrete/abstract words, and a new multi-modal pre-training approach based on curriculum learning is proposed. The results show that the proposed multi-modal pre-training approach contributes to the success of the model.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords