Data in Brief (Apr 2022)

Dataset of network simulator related-question posts in stack overflow

  • Yusuf Sulistyo Nugroho,
  • Syful Islam,
  • Dedi Gunawan,
  • Yogiek Indra Kurniawan,
  • Md. Javed Hossain

Journal volume & issue
Vol. 41
p. 107942

Abstract

Read online

Although the use of network simulator (NS) in predicting the behavior of computer networks has increased, the users often face a variety of challenges and share them on Stack Overflow (SO). However, the challenges that users deal with have not been studied. This paper presents an NS discussion dataset extracted from SOTorrent, which consists of 2,322 NS-related question posts spanning 17 features. The process of data collection was conducted in five steps, including filtering initial post dataset using simulator tags, discovering NS-related tags, collecting the tagged posts, extracting the posts title and preprocessing for LDA (Latent Dirichlet Allocation), and finally applying the LDA topic modeling to obtain the NS posts clustered into eight different topic names. We believe that this dataset will help research community in highlighting issues faced by NS users.

Keywords