Few-Shot Website Fingerprinting Attack with Data Augmentation
This work introduces a novel data augmentation method for few-shot website fingerprinting (WF) attack where only a handful of training samples per website are available for deep learning model optimization. Moving beyond earlier WF methods relying on manually-engineered feature representations, more...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi-Wiley
2021-01-01
|
Series: | Security and Communication Networks |
Online Access: | http://dx.doi.org/10.1155/2021/2840289 |
id |
doaj-cf215ebe49884e49be0e5f75d7d2c4d3 |
---|---|
record_format |
Article |
spelling |
doaj-cf215ebe49884e49be0e5f75d7d2c4d32021-09-27T00:52:03ZengHindawi-WileySecurity and Communication Networks1939-01222021-01-01202110.1155/2021/2840289Few-Shot Website Fingerprinting Attack with Data AugmentationMantun Chen0Yongjun Wang1Zhiquan Qin2Xiatian Zhu3College of ComputerCollege of ComputerCollege of ComputerUniversity of SurreyThis work introduces a novel data augmentation method for few-shot website fingerprinting (WF) attack where only a handful of training samples per website are available for deep learning model optimization. Moving beyond earlier WF methods relying on manually-engineered feature representations, more advanced deep learning alternatives demonstrate that learning feature representations automatically from training data is superior. Nonetheless, this advantage is subject to an unrealistic assumption that there exist many training samples per website, which otherwise will disappear. To address this, we introduce a model-agnostic, efficient, and harmonious data augmentation (HDA) method that can improve deep WF attacking methods significantly. HDA involves both intrasample and intersample data transformations that can be used in a harmonious manner to expand a tiny training dataset to an arbitrarily large collection, therefore effectively and explicitly addressing the intrinsic data scarcity problem. We conducted expensive experiments to validate our HDA for boosting state-of-the-art deep learning WF attack models in both closed-world and open-world attacking scenarios, at absence and presence of strong defense. For instance, in the more challenging and realistic evaluation scenario with WTF-PAD-based defense, our HDA method surpasses the previous state-of-the-art results by nearly 3% in classification accuracy in the 20-shot learning case. An earlier version of this work Chen et al. (2021) has been presented as preprint in ArXiv (https://arxiv.org/abs/2101.10063).http://dx.doi.org/10.1155/2021/2840289 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mantun Chen Yongjun Wang Zhiquan Qin Xiatian Zhu |
spellingShingle |
Mantun Chen Yongjun Wang Zhiquan Qin Xiatian Zhu Few-Shot Website Fingerprinting Attack with Data Augmentation Security and Communication Networks |
author_facet |
Mantun Chen Yongjun Wang Zhiquan Qin Xiatian Zhu |
author_sort |
Mantun Chen |
title |
Few-Shot Website Fingerprinting Attack with Data Augmentation |
title_short |
Few-Shot Website Fingerprinting Attack with Data Augmentation |
title_full |
Few-Shot Website Fingerprinting Attack with Data Augmentation |
title_fullStr |
Few-Shot Website Fingerprinting Attack with Data Augmentation |
title_full_unstemmed |
Few-Shot Website Fingerprinting Attack with Data Augmentation |
title_sort |
few-shot website fingerprinting attack with data augmentation |
publisher |
Hindawi-Wiley |
series |
Security and Communication Networks |
issn |
1939-0122 |
publishDate |
2021-01-01 |
description |
This work introduces a novel data augmentation method for few-shot website fingerprinting (WF) attack where only a handful of training samples per website are available for deep learning model optimization. Moving beyond earlier WF methods relying on manually-engineered feature representations, more advanced deep learning alternatives demonstrate that learning feature representations automatically from training data is superior. Nonetheless, this advantage is subject to an unrealistic assumption that there exist many training samples per website, which otherwise will disappear. To address this, we introduce a model-agnostic, efficient, and harmonious data augmentation (HDA) method that can improve deep WF attacking methods significantly. HDA involves both intrasample and intersample data transformations that can be used in a harmonious manner to expand a tiny training dataset to an arbitrarily large collection, therefore effectively and explicitly addressing the intrinsic data scarcity problem. We conducted expensive experiments to validate our HDA for boosting state-of-the-art deep learning WF attack models in both closed-world and open-world attacking scenarios, at absence and presence of strong defense. For instance, in the more challenging and realistic evaluation scenario with WTF-PAD-based defense, our HDA method surpasses the previous state-of-the-art results by nearly 3% in classification accuracy in the 20-shot learning case. An earlier version of this work Chen et al. (2021) has been presented as preprint in ArXiv (https://arxiv.org/abs/2101.10063). |
url |
http://dx.doi.org/10.1155/2021/2840289 |
work_keys_str_mv |
AT mantunchen fewshotwebsitefingerprintingattackwithdataaugmentation AT yongjunwang fewshotwebsitefingerprintingattackwithdataaugmentation AT zhiquanqin fewshotwebsitefingerprintingattackwithdataaugmentation AT xiatianzhu fewshotwebsitefingerprintingattackwithdataaugmentation |
_version_ |
1716867505852514304 |