A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair Process

Yi Liu; Hsueh-Ping Lu; Ching-Hao Lai

doi:10.1109/ACCESS.2022.3158952

IEEE Access (Jan 2022)

A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair Process

Yi Liu,
Hsueh-Ping Lu,
Ching-Hao Lai

Affiliations

Yi Liu: ORCiD; Department of Data Science Analysis, Division of Digital Technology, AU Optronics Corporation, Taichung, Taiwan
Hsueh-Ping Lu: ORCiD; Division of Digital Technology, AU Optronics Corporation, Hsinchu, Taiwan
Ching-Hao Lai: ORCiD; Department of Data Science Analysis, Division of Digital Technology, AU Optronics Corporation, Taichung, Taiwan

DOI: https://doi.org/10.1109/ACCESS.2022.3158952
Journal volume & issue: Vol. 10
pp. 33026 – 33036

Abstract

Read online

In Thin-Film Transistor Liquid-Crystal Display (TFT-LCD) manufacturing, conducting a machine learning based system with multiple data types has become actively desired to solve complicated problems. This paper proposes a multi-modal learning approach: TabVisionNet, which is modeled by utilizing the information from both tabular data and image data. A novel attention mechanism called Sequential Decision Attention was integrated into the multi-modal modeling framework that improves the comprehension of the information from two modalities. This cross-modal attention mechanism can capture the complex relationship between modalities then gain better generalization and faster convergence in the training process. Conducting an experiment, the performance of our novel approach was significantly better than single-modal and other multi-modal learning approaches in our real case scenario.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords