Few-Shot Website Fingerprinting Attack with Data Augmentation

This work introduces a novel data augmentation method for few-shot website fingerprinting (WF) attack where only a handful of training samples per website are available for deep learning model optimization. Moving beyond earlier WF methods relying on manually-engineered feature representations, more...

Full description

Bibliographic Details
Main Authors: Mantun Chen, Yongjun Wang, Zhiquan Qin, Xiatian Zhu
Format: Article
Language:English
Published: Hindawi-Wiley 2021-01-01
Series:Security and Communication Networks
Online Access:http://dx.doi.org/10.1155/2021/2840289
id doaj-cf215ebe49884e49be0e5f75d7d2c4d3
record_format Article
spelling doaj-cf215ebe49884e49be0e5f75d7d2c4d32021-09-27T00:52:03ZengHindawi-WileySecurity and Communication Networks1939-01222021-01-01202110.1155/2021/2840289Few-Shot Website Fingerprinting Attack with Data AugmentationMantun Chen0Yongjun Wang1Zhiquan Qin2Xiatian Zhu3College of ComputerCollege of ComputerCollege of ComputerUniversity of SurreyThis work introduces a novel data augmentation method for few-shot website fingerprinting (WF) attack where only a handful of training samples per website are available for deep learning model optimization. Moving beyond earlier WF methods relying on manually-engineered feature representations, more advanced deep learning alternatives demonstrate that learning feature representations automatically from training data is superior. Nonetheless, this advantage is subject to an unrealistic assumption that there exist many training samples per website, which otherwise will disappear. To address this, we introduce a model-agnostic, efficient, and harmonious data augmentation (HDA) method that can improve deep WF attacking methods significantly. HDA involves both intrasample and intersample data transformations that can be used in a harmonious manner to expand a tiny training dataset to an arbitrarily large collection, therefore effectively and explicitly addressing the intrinsic data scarcity problem. We conducted expensive experiments to validate our HDA for boosting state-of-the-art deep learning WF attack models in both closed-world and open-world attacking scenarios, at absence and presence of strong defense. For instance, in the more challenging and realistic evaluation scenario with WTF-PAD-based defense, our HDA method surpasses the previous state-of-the-art results by nearly 3% in classification accuracy in the 20-shot learning case. An earlier version of this work Chen et al. (2021) has been presented as preprint in ArXiv (https://arxiv.org/abs/2101.10063).http://dx.doi.org/10.1155/2021/2840289
collection DOAJ
language English
format Article
sources DOAJ
author Mantun Chen
Yongjun Wang
Zhiquan Qin
Xiatian Zhu
spellingShingle Mantun Chen
Yongjun Wang
Zhiquan Qin
Xiatian Zhu
Few-Shot Website Fingerprinting Attack with Data Augmentation
Security and Communication Networks
author_facet Mantun Chen
Yongjun Wang
Zhiquan Qin
Xiatian Zhu
author_sort Mantun Chen
title Few-Shot Website Fingerprinting Attack with Data Augmentation
title_short Few-Shot Website Fingerprinting Attack with Data Augmentation
title_full Few-Shot Website Fingerprinting Attack with Data Augmentation
title_fullStr Few-Shot Website Fingerprinting Attack with Data Augmentation
title_full_unstemmed Few-Shot Website Fingerprinting Attack with Data Augmentation
title_sort few-shot website fingerprinting attack with data augmentation
publisher Hindawi-Wiley
series Security and Communication Networks
issn 1939-0122
publishDate 2021-01-01
description This work introduces a novel data augmentation method for few-shot website fingerprinting (WF) attack where only a handful of training samples per website are available for deep learning model optimization. Moving beyond earlier WF methods relying on manually-engineered feature representations, more advanced deep learning alternatives demonstrate that learning feature representations automatically from training data is superior. Nonetheless, this advantage is subject to an unrealistic assumption that there exist many training samples per website, which otherwise will disappear. To address this, we introduce a model-agnostic, efficient, and harmonious data augmentation (HDA) method that can improve deep WF attacking methods significantly. HDA involves both intrasample and intersample data transformations that can be used in a harmonious manner to expand a tiny training dataset to an arbitrarily large collection, therefore effectively and explicitly addressing the intrinsic data scarcity problem. We conducted expensive experiments to validate our HDA for boosting state-of-the-art deep learning WF attack models in both closed-world and open-world attacking scenarios, at absence and presence of strong defense. For instance, in the more challenging and realistic evaluation scenario with WTF-PAD-based defense, our HDA method surpasses the previous state-of-the-art results by nearly 3% in classification accuracy in the 20-shot learning case. An earlier version of this work Chen et al. (2021) has been presented as preprint in ArXiv (https://arxiv.org/abs/2101.10063).
url http://dx.doi.org/10.1155/2021/2840289
work_keys_str_mv AT mantunchen fewshotwebsitefingerprintingattackwithdataaugmentation
AT yongjunwang fewshotwebsitefingerprintingattackwithdataaugmentation
AT zhiquanqin fewshotwebsitefingerprintingattackwithdataaugmentation
AT xiatianzhu fewshotwebsitefingerprintingattackwithdataaugmentation
_version_ 1716867505852514304