Experimentation for Chatbot Usability Evaluation: A Secondary Study

Ranci Ren; Mireya Zapata; John W. Castro; Oscar Dieste; Silvia T. Acuna

doi:10.1109/ACCESS.2022.3145323

IEEE Access (Jan 2022)

Experimentation for Chatbot Usability Evaluation: A Secondary Study

Ranci Ren,
Mireya Zapata,
John W. Castro,
Oscar Dieste,
Silvia T. Acuna

Affiliations

Ranci Ren: Departamento de Ingeniería Informática, Escuela Politécnica Superior, Universidad Autónoma de Madrid, Madrid, Spain
Mireya Zapata: ORCiD; Research Center of Mechatronics and Interactive Systems (MIST), Universidad Tecnológica Indoamérica, Quito, Ecuador
John W. Castro: ORCiD; Departamento de Ingeniería Informática y Ciencias de la Computación, Universidad de Atacama, Copiapó, Chile
Oscar Dieste: ORCiD; Escuela Técnica Superior de Ingenieros Informáticos, Universidad Politécnica de Madrid, Boadilla del Monte, Spain
Silvia T. Acuna: ORCiD; Departamento de Ingeniería Informática, Escuela Politécnica Superior, Universidad Autónoma de Madrid, Madrid, Spain

DOI: https://doi.org/10.1109/ACCESS.2022.3145323
Journal volume & issue: Vol. 10
pp. 12430 – 12464

Abstract

Read online

Interest in chatbot development is on the rise. As a usability evaluation is an essential step in chatbot development, the number of experimental studies on chatbot usability has grown as well. As a result, we think a systematic mapping study is opportune. We analyzed more than 700 sources and retrieved 28 primary studies. By aggregating the research questions and examining the characteristics and metrics used to evaluate the usability of chatbots in experiments, it is possible to identify the state of the art in chatbot usability experimentation. We conducted a systematic mapping study to identify the research questions, characteristics, and metrics used to evaluate the usability of chatbots in experiments. Most experiments adopted a within-subjects design. On the other hand, few experiments provided raw data, and only one of the identified papers was part of a family of experiments. Effectiveness, efficiency, and satisfaction are usability characteristics used to identify how well users can learn and use chatbots to achieve their goals and how satisfied users are during the interaction. Generally, the experimental results revealed that chatbots have several advantages (e.g., they provide a real-time response and they improve ease of use) and some shortcomings (e.g., natural language processing, which is rated as the weakness most in need of improvement). This research offers an overview of chatbot usability experimentation. The increasing interest in this area is very recent, as works did not start to be published until 2018. Chatbot usability experiments should be more replicable to improve the reliability and transparency of the experimental results.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords