Few-Shot Website Fingerprinting Attack with Data Augmentation

This work introduces a novel data augmentation method for few-shot website fingerprinting (WF) attack where only a handful of training samples per website are available for deep learning model optimization. Moving beyond earlier WF methods relying on manually-engineered feature representations, more...

Full description

Bibliographic Details
Main Authors:	Mantun Chen, Yongjun Wang, Zhiquan Qin, Xiatian Zhu
Format:	Article
Language:	English
Published:	Hindawi-Wiley 2021-01-01
Series:	Security and Communication Networks
Online Access:	http://dx.doi.org/10.1155/2021/2840289

id	doaj-cf215ebe49884e49be0e5f75d7d2c4d3
record_format	Article
spelling	doaj-cf215ebe49884e49be0e5f75d7d2c4d32021-09-27T00:52:03ZengHindawi-WileySecurity and Communication Networks1939-01222021-01-01202110.1155/2021/2840289Few-Shot Website Fingerprinting Attack with Data AugmentationMantun Chen0Yongjun Wang1Zhiquan Qin2Xiatian Zhu3College of ComputerCollege of ComputerCollege of ComputerUniversity of SurreyThis work introduces a novel data augmentation method for few-shot website fingerprinting (WF) attack where only a handful of training samples per website are available for deep learning model optimization. Moving beyond earlier WF methods relying on manually-engineered feature representations, more advanced deep learning alternatives demonstrate that learning feature representations automatically from training data is superior. Nonetheless, this advantage is subject to an unrealistic assumption that there exist many training samples per website, which otherwise will disappear. To address this, we introduce a model-agnostic, efficient, and harmonious data augmentation (HDA) method that can improve deep WF attacking methods significantly. HDA involves both intrasample and intersample data transformations that can be used in a harmonious manner to expand a tiny training dataset to an arbitrarily large collection, therefore effectively and explicitly addressing the intrinsic data scarcity problem. We conducted expensive experiments to validate our HDA for boosting state-of-the-art deep learning WF attack models in both closed-world and open-world attacking scenarios, at absence and presence of strong defense. For instance, in the more challenging and realistic evaluation scenario with WTF-PAD-based defense, our HDA method surpasses the previous state-of-the-art results by nearly 3% in classification accuracy in the 20-shot learning case. An earlier version of this work Chen et al. (2021) has been presented as preprint in ArXiv (https://arxiv.org/abs/2101.10063).http://dx.doi.org/10.1155/2021/2840289
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Mantun Chen Yongjun Wang Zhiquan Qin Xiatian Zhu
spellingShingle	Mantun Chen Yongjun Wang Zhiquan Qin Xiatian Zhu Few-Shot Website Fingerprinting Attack with Data Augmentation Security and Communication Networks
author_facet	Mantun Chen Yongjun Wang Zhiquan Qin Xiatian Zhu
author_sort	Mantun Chen
title	Few-Shot Website Fingerprinting Attack with Data Augmentation
title_short	Few-Shot Website Fingerprinting Attack with Data Augmentation
title_full	Few-Shot Website Fingerprinting Attack with Data Augmentation
title_fullStr	Few-Shot Website Fingerprinting Attack with Data Augmentation
title_full_unstemmed	Few-Shot Website Fingerprinting Attack with Data Augmentation
title_sort	few-shot website fingerprinting attack with data augmentation
publisher	Hindawi-Wiley
series	Security and Communication Networks
issn	1939-0122
publishDate	2021-01-01
description	This work introduces a novel data augmentation method for few-shot website fingerprinting (WF) attack where only a handful of training samples per website are available for deep learning model optimization. Moving beyond earlier WF methods relying on manually-engineered feature representations, more advanced deep learning alternatives demonstrate that learning feature representations automatically from training data is superior. Nonetheless, this advantage is subject to an unrealistic assumption that there exist many training samples per website, which otherwise will disappear. To address this, we introduce a model-agnostic, efficient, and harmonious data augmentation (HDA) method that can improve deep WF attacking methods significantly. HDA involves both intrasample and intersample data transformations that can be used in a harmonious manner to expand a tiny training dataset to an arbitrarily large collection, therefore effectively and explicitly addressing the intrinsic data scarcity problem. We conducted expensive experiments to validate our HDA for boosting state-of-the-art deep learning WF attack models in both closed-world and open-world attacking scenarios, at absence and presence of strong defense. For instance, in the more challenging and realistic evaluation scenario with WTF-PAD-based defense, our HDA method surpasses the previous state-of-the-art results by nearly 3% in classification accuracy in the 20-shot learning case. An earlier version of this work Chen et al. (2021) has been presented as preprint in ArXiv (https://arxiv.org/abs/2101.10063).
url	http://dx.doi.org/10.1155/2021/2840289
work_keys_str_mv	AT mantunchen fewshotwebsitefingerprintingattackwithdataaugmentation AT yongjunwang fewshotwebsitefingerprintingattackwithdataaugmentation AT zhiquanqin fewshotwebsitefingerprintingattackwithdataaugmentation AT xiatianzhu fewshotwebsitefingerprintingattackwithdataaugmentation
_version_	1716867505852514304

Few-Shot Website Fingerprinting Attack with Data Augmentation

Similar Items