Timely Classification and Verification of Network Traffic Using Gaussian Mixture Models

We present a novel approach for timely classification and verification of network traffic using Gaussian Mixture Models (GMMs). We generate a separate GMM for each class of applications using component-wise expectation-maximization (CEM) to match the network traffic distribution generated by these a...

Full description

Bibliographic Details
Published in:IEEE Access
Main Authors: Hassan Alizadeh, Harald Vranken, Andre Zuquete, Ali Miri
Format: Article
Language:English
Published: IEEE 2020-01-01
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9086466/
_version_ 1852733614651867136
author Hassan Alizadeh
Harald Vranken
Andre Zuquete
Ali Miri
author_facet Hassan Alizadeh
Harald Vranken
Andre Zuquete
Ali Miri
author_sort Hassan Alizadeh
collection DOAJ
container_title IEEE Access
description We present a novel approach for timely classification and verification of network traffic using Gaussian Mixture Models (GMMs). We generate a separate GMM for each class of applications using component-wise expectation-maximization (CEM) to match the network traffic distribution generated by these applications. We apply our models for both traffic classification, where the goal is to identify the source application from which the traffic originates, by evaluating the maximum posterior probability, and for traffic verification, where the goal is to verify whether the application that claims to be the source of the traffic is as expected, by likelihood testing. Our models use only the first initial packets of truncated flows in order to provide more efficient and timely traffic classification and verification. This allows for triggering timely countermeasures before the end of flows. We demonstrate the effectiveness of our approach by experiments on a public dataset collected from a real network. Our traffic classification approach outperforms other state-of-the-art approaches that are based on machine learning, and achieves up to 97.7% flow classification accuracy when using only 9 first initial packets of flows. We show that 96.6% flow classification accuracy can still be obtained when training the GMMs using only 0.5% of all flows. Our traffic verification approach achieves a minimum Half Total Error Rate (HTER) of 7.65% when using only 6 first initial packets of flows.
format Article
id doaj-art-b368a888a3584e61ba246465327f80ca
institution Directory of Open Access Journals
issn 2169-3536
language English
publishDate 2020-01-01
publisher IEEE
record_format Article
spelling doaj-art-b368a888a3584e61ba246465327f80ca2025-08-19T21:07:33ZengIEEEIEEE Access2169-35362020-01-018912879130210.1109/ACCESS.2020.29925569086466Timely Classification and Verification of Network Traffic Using Gaussian Mixture ModelsHassan Alizadeh0Harald Vranken1https://orcid.org/0000-0003-4541-6475Andre Zuquete2Ali Miri3Department of Computer Science, Open Universiteit, Heerlen, The NetherlandsDepartment of Computer Science, Open Universiteit, Heerlen, The NetherlandsInstituto de Engenharia Electrónica e Informática de Aveiro (IEETA), University of Aveiro, Aveiro, PortugalDepartment of Computer Science, Ryerson University, Toronto, ON, CanadaWe present a novel approach for timely classification and verification of network traffic using Gaussian Mixture Models (GMMs). We generate a separate GMM for each class of applications using component-wise expectation-maximization (CEM) to match the network traffic distribution generated by these applications. We apply our models for both traffic classification, where the goal is to identify the source application from which the traffic originates, by evaluating the maximum posterior probability, and for traffic verification, where the goal is to verify whether the application that claims to be the source of the traffic is as expected, by likelihood testing. Our models use only the first initial packets of truncated flows in order to provide more efficient and timely traffic classification and verification. This allows for triggering timely countermeasures before the end of flows. We demonstrate the effectiveness of our approach by experiments on a public dataset collected from a real network. Our traffic classification approach outperforms other state-of-the-art approaches that are based on machine learning, and achieves up to 97.7% flow classification accuracy when using only 9 first initial packets of flows. We show that 96.6% flow classification accuracy can still be obtained when training the GMMs using only 0.5% of all flows. Our traffic verification approach achieves a minimum Half Total Error Rate (HTER) of 7.65% when using only 6 first initial packets of flows.https://ieeexplore.ieee.org/document/9086466/Gaussian mixture model (GMM)traffic classificationtraffic anomaly detection
spellingShingle Hassan Alizadeh
Harald Vranken
Andre Zuquete
Ali Miri
Timely Classification and Verification of Network Traffic Using Gaussian Mixture Models
Gaussian mixture model (GMM)
traffic classification
traffic anomaly detection
title Timely Classification and Verification of Network Traffic Using Gaussian Mixture Models
title_full Timely Classification and Verification of Network Traffic Using Gaussian Mixture Models
title_fullStr Timely Classification and Verification of Network Traffic Using Gaussian Mixture Models
title_full_unstemmed Timely Classification and Verification of Network Traffic Using Gaussian Mixture Models
title_short Timely Classification and Verification of Network Traffic Using Gaussian Mixture Models
title_sort timely classification and verification of network traffic using gaussian mixture models
topic Gaussian mixture model (GMM)
traffic classification
traffic anomaly detection
url https://ieeexplore.ieee.org/document/9086466/
work_keys_str_mv AT hassanalizadeh timelyclassificationandverificationofnetworktrafficusinggaussianmixturemodels
AT haraldvranken timelyclassificationandverificationofnetworktrafficusinggaussianmixturemodels
AT andrezuquete timelyclassificationandverificationofnetworktrafficusinggaussianmixturemodels
AT alimiri timelyclassificationandverificationofnetworktrafficusinggaussianmixturemodels