Applied Sciences (May 2022)

Machine Learning-Based Automatic Utterance Collection Model for Language Development Screening of Children

  • Jeong-Myeong Choi,
  • Yoon-Kyoung Lee,
  • Jong-Dae Kim,
  • Chan-Young Park,
  • Yu-Seop Kim

DOI
https://doi.org/10.3390/app12094747
Journal volume & issue
Vol. 12, no. 9
p. 4747

Abstract

Read online

To assess a child’s language development, utterance data are required. The approach of recording and transcribing the conversation between the expert and the child is mostly utilized to obtain utterance data. Because data are obtained through one-on-one interactions, this approach is costly. In addition, depending on the expert, subjective dialogue situations may be incorporated. To acquire speech data, we present a machine learning-based phrase generating model. It has the benefit of being able to cope with several children, which reduces costs and allows for the collection of objectified utterance data through consistent conversation settings. Children’s utterances are initially categorized as topic maintenance or topic change, with rule-based replies based on scenarios being formed in the instance of a topic change. When it comes to topic maintenance, it encourages the child to say more by answering with imitative phrases. The strategy we suggest has the potential to reduce the cost of collecting data for evaluating children’s language development while maintaining data collection impartiality.

Keywords