Randomized Purifier Based on Low Adversarial Transferability for Adversarial Defense

Deep neural networks are generally very vulnerable to adversarial attacks. In order to defend against adversarial attacks in classifiers, Adversarial Purification (AP) was developed to neutralize adversarial perturbations using a generative model at the input stage. AP has an advantage in that it ca...

詳細記述

書誌詳細
出版年:	IEEE Access
主要な著者:	Sangjin Park, Yoojin Jung, Byung Cheol Song
フォーマット:	論文
言語:	英語
出版事項:	IEEE 2024-01-01
主題:	Adversarial attack adversarial defense computer vision deep learning image classification security
オンライン･アクセス:	https://ieeexplore.ieee.org/document/10630788/

_version_	1849714679110172672
author	Sangjin Park Yoojin Jung Byung Cheol Song
author_facet	Sangjin Park Yoojin Jung Byung Cheol Song
author_sort	Sangjin Park
collection	DOAJ
container_title	IEEE Access
description	Deep neural networks are generally very vulnerable to adversarial attacks. In order to defend against adversarial attacks in classifiers, Adversarial Purification (AP) was developed to neutralize adversarial perturbations using a generative model at the input stage. AP has an advantage in that it can defend against various attacks without the additional training of a classifier. Recently, AP techniques using energy-based models or diffusion models have achieved meaningful robustness with a randomized defense based on a stochastic process. However, since they require a great number of diffusion steps or sampling steps in purifying attacked images, their computational cost is burdensome. To significantly reduce the computational cost while maintaining the performance of the randomized defense of AP, this paper proposes a novel randomized generative model called Randomized Purifier Based on Low Adversarial Transferability (RP-LAT). First, in order to select the components to be useful for randomization, we analyze the adversarial transferability according to the model components from the AP point of view. Then, based on this analysis, we replace the existing layers with a combination of components with low transferability, and randomly select the components of each layer during the forward pass. Experimental results prove that RP-LAT is computationally efficient and achieves state-of-the-art performance in terms of robustness against various types of attacks.
format	Article
id	doaj-art-30bcbd208fbd48fca32db7b3dae85f4d
institution	Directory of Open Access Journals
issn	2169-3536
language	English
publishDate	2024-01-01
publisher	IEEE
record_format	Article
spelling	doaj-art-30bcbd208fbd48fca32db7b3dae85f4d2025-08-20T01:55:39ZengIEEEIEEE Access2169-35362024-01-011210969010970110.1109/ACCESS.2024.344090910630788Randomized Purifier Based on Low Adversarial Transferability for Adversarial DefenseSangjin Park0Yoojin Jung1Byung Cheol Song2https://orcid.org/0000-0001-8742-3433Department of Electrical and Computer Engineering, Inha University, Incheon, Republic of KoreaDepartment of Electrical and Computer Engineering, Inha University, Incheon, Republic of KoreaDepartment of Electrical and Computer Engineering, Inha University, Incheon, Republic of KoreaDeep neural networks are generally very vulnerable to adversarial attacks. In order to defend against adversarial attacks in classifiers, Adversarial Purification (AP) was developed to neutralize adversarial perturbations using a generative model at the input stage. AP has an advantage in that it can defend against various attacks without the additional training of a classifier. Recently, AP techniques using energy-based models or diffusion models have achieved meaningful robustness with a randomized defense based on a stochastic process. However, since they require a great number of diffusion steps or sampling steps in purifying attacked images, their computational cost is burdensome. To significantly reduce the computational cost while maintaining the performance of the randomized defense of AP, this paper proposes a novel randomized generative model called Randomized Purifier Based on Low Adversarial Transferability (RP-LAT). First, in order to select the components to be useful for randomization, we analyze the adversarial transferability according to the model components from the AP point of view. Then, based on this analysis, we replace the existing layers with a combination of components with low transferability, and randomly select the components of each layer during the forward pass. Experimental results prove that RP-LAT is computationally efficient and achieves state-of-the-art performance in terms of robustness against various types of attacks.https://ieeexplore.ieee.org/document/10630788/Adversarial attackadversarial defensecomputer visiondeep learningimage classificationsecurity
spellingShingle	Sangjin Park Yoojin Jung Byung Cheol Song Randomized Purifier Based on Low Adversarial Transferability for Adversarial Defense Adversarial attack adversarial defense computer vision deep learning image classification security
title	Randomized Purifier Based on Low Adversarial Transferability for Adversarial Defense
title_full	Randomized Purifier Based on Low Adversarial Transferability for Adversarial Defense
title_fullStr	Randomized Purifier Based on Low Adversarial Transferability for Adversarial Defense
title_full_unstemmed	Randomized Purifier Based on Low Adversarial Transferability for Adversarial Defense
title_short	Randomized Purifier Based on Low Adversarial Transferability for Adversarial Defense
title_sort	randomized purifier based on low adversarial transferability for adversarial defense
topic	Adversarial attack adversarial defense computer vision deep learning image classification security
url	https://ieeexplore.ieee.org/document/10630788/
work_keys_str_mv	AT sangjinpark randomizedpurifierbasedonlowadversarialtransferabilityforadversarialdefense AT yoojinjung randomizedpurifierbasedonlowadversarialtransferabilityforadversarialdefense AT byungcheolsong randomizedpurifierbasedonlowadversarialtransferabilityforadversarialdefense

Randomized Purifier Based on Low Adversarial Transferability for Adversarial Defense

類似資料