Timely Classification and Verification of Network Traffic Using Gaussian Mixture Models

Hassan Alizadeh; Harald Vranken; Andre Zuquete; Ali Miri

doi:10.1109/ACCESS.2020.2992556

IEEE Access (Jan 2020)

Timely Classification and Verification of Network Traffic Using Gaussian Mixture Models

Hassan Alizadeh,
Harald Vranken,
Andre Zuquete,
Ali Miri

Affiliations

Hassan Alizadeh: Department of Computer Science, Open Universiteit, Heerlen, The Netherlands
Harald Vranken: ORCiD; Department of Computer Science, Open Universiteit, Heerlen, The Netherlands
Andre Zuquete: Instituto de Engenharia Electrónica e Informática de Aveiro (IEETA), University of Aveiro, Aveiro, Portugal
Ali Miri: Department of Computer Science, Ryerson University, Toronto, ON, Canada

DOI: https://doi.org/10.1109/ACCESS.2020.2992556
Journal volume & issue: Vol. 8
pp. 91287 – 91302

Abstract

Read online

We present a novel approach for timely classification and verification of network traffic using Gaussian Mixture Models (GMMs). We generate a separate GMM for each class of applications using component-wise expectation-maximization (CEM) to match the network traffic distribution generated by these applications. We apply our models for both traffic classification, where the goal is to identify the source application from which the traffic originates, by evaluating the maximum posterior probability, and for traffic verification, where the goal is to verify whether the application that claims to be the source of the traffic is as expected, by likelihood testing. Our models use only the first initial packets of truncated flows in order to provide more efficient and timely traffic classification and verification. This allows for triggering timely countermeasures before the end of flows. We demonstrate the effectiveness of our approach by experiments on a public dataset collected from a real network. Our traffic classification approach outperforms other state-of-the-art approaches that are based on machine learning, and achieves up to 97.7% flow classification accuracy when using only 9 first initial packets of flows. We show that 96.6% flow classification accuracy can still be obtained when training the GMMs using only 0.5% of all flows. Our traffic verification approach achieves a minimum Half Total Error Rate (HTER) of 7.65% when using only 6 first initial packets of flows.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords