BMC Research Notes (Feb 2024)

ITC-net-audio-5: an audio streaming dataset for application identification in network traffic classification

  • Mohammad Nikbakht,
  • Mehdi Teimouri

DOI
https://doi.org/10.1186/s13104-024-06718-7
Journal volume & issue
Vol. 17, no. 1
pp. 1 – 3

Abstract

Read online

Abstract Objectives An essential aspect of network traffic classification is application identification. This involves capturing and analyzing the traffic patterns of applications. There are a few publicly available datasets that specifically capture streaming data from network-based applications. Therefore, our objective is to generate an up-to-date dataset with a focus on audio streaming data. This dataset can be a valuable resource for identifying audio streaming applications in the field of network traffic classification. Data description The dataset contains network traffic captured during audio streaming communications on five trending applications: Google Meet, Skype, Telegram, WhatsApp, and SoundCloud. It includes 500 files in PCAP format captured by Wireshark and PCAPdroid tools during voice calls and online music playback. The concurrent utilization of these tools facilitates the avoidance of capturing background traffic.

Keywords