PeerJ Computer Science (Aug 2022)

Automatic computer science domain multiple-choice questions generation based on informative sentences

  • Farah Maheen,
  • Muhammad Asif,
  • Haseeb Ahmad,
  • Shahbaz Ahmad,
  • Fahad Alturise,
  • Othman Asiry,
  • Yazeed Yasin Ghadi

DOI
https://doi.org/10.7717/peerj-cs.1010
Journal volume & issue
Vol. 8
p. e1010

Abstract

Read online Read online

Students require continuous feedback for effective learning. Multiple choice questions (MCQs) are extensively used among various assessment methods to provide such feedback. However, manual MCQ generation is a tedious task that requires significant effort, time, and domain knowledge. Therefore, a system must be present that can automatically generate MCQs from the given text. The automatic generation of MCQs can be carried out by following three sequential steps: extracting informative sentences from the textual data, identifying the key, and determining distractors. The dataset comprising of various topics from the 9th and 11th-grade computer science course books are used in this work. Moreover, TF-IDF, Jaccard similarity, quality phrase mining, K-means, and bidirectional encoder representation from transformers techniques are utilized for automatic MCQs generation. Domain experts validated the generated MCQs with 83%, 77%, and 80% accuracy, key generation, and distractor generation, respectively. The overall MCQ generation achieved 80% accuracy through this system by the experts. Finally, a desktop app was developed that takes the contents in textual form as input, processes it at the backend, and visualizes the generated MCQs on the interface. The presented solution may help teachers, students, and other stakeholders with automatic MCQ generation.

Keywords