Automatic computer science domain multiple-choice questions generation based on informative sentences

Farah Maheen; Muhammad Asif; Haseeb Ahmad; Shahbaz Ahmad; Fahad Alturise; Othman Asiry; Yazeed Yasin Ghadi

doi:10.7717/peerj-cs.1010

PeerJ Computer Science (Aug 2022)

Automatic computer science domain multiple-choice questions generation based on informative sentences

Farah Maheen,
Muhammad Asif,
Haseeb Ahmad,
Shahbaz Ahmad,
Fahad Alturise,
Othman Asiry,
Yazeed Yasin Ghadi

Affiliations

Farah Maheen: Department of Computer Science, National Textile University, Faisalabad, Pakistan
Muhammad Asif: Department of Computer Science, National Textile University, Faisalabad, Pakistan
Haseeb Ahmad: Department of Computer Science, National Textile University, Faisalabad, Pakistan
Shahbaz Ahmad: Department of Computer Science, National Textile University, Faisalabad, Pakistan
Fahad Alturise: Department of Computer, College of Science and Arts in Ar Rass, Qassim University, Ar Rass, Qassim, Saudi Arabia
Othman Asiry: Department of Information Technology, College of Computing and Information Technology at Khulais, University of Jeddah, Jeddah, Saudi Arabia
Yazeed Yasin Ghadi: Department of Computer Science/Software Engineering, Al Ain University, Abu Dhabi, UAE

DOI: https://doi.org/10.7717/peerj-cs.1010
Journal volume & issue: Vol. 8
p. e1010

Abstract

Read online Read online

Students require continuous feedback for effective learning. Multiple choice questions (MCQs) are extensively used among various assessment methods to provide such feedback. However, manual MCQ generation is a tedious task that requires significant effort, time, and domain knowledge. Therefore, a system must be present that can automatically generate MCQs from the given text. The automatic generation of MCQs can be carried out by following three sequential steps: extracting informative sentences from the textual data, identifying the key, and determining distractors. The dataset comprising of various topics from the 9th and 11th-grade computer science course books are used in this work. Moreover, TF-IDF, Jaccard similarity, quality phrase mining, K-means, and bidirectional encoder representation from transformers techniques are utilized for automatic MCQs generation. Domain experts validated the generated MCQs with 83%, 77%, and 80% accuracy, key generation, and distractor generation, respectively. The overall MCQ generation achieved 80% accuracy through this system by the experts. Finally, a desktop app was developed that takes the contents in textual form as input, processes it at the backend, and visualizes the generated MCQs on the interface. The presented solution may help teachers, students, and other stakeholders with automatic MCQ generation.

Published in PeerJ Computer Science

ISSN: 2376-5992 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://peerj.com/computer-science/

About the journal

Abstract

Keywords