DLP: towards active defense against backdoor attacks with decoupled learning process

Abstract Deep learning models are well known to be susceptible to backdoor attack, where the attacker only needs to provide a tampered dataset on which the triggers are injected. Models trained on the dataset will passively implant the backdoor, and triggers on the input can mislead the models durin...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:	Cybersecurity
المؤلفون الرئيسيون:	Zonghao Ying, Bin Wu
التنسيق:	مقال
اللغة:	الإنجليزية
منشور في:	SpringerOpen 2023-05-01
الموضوعات:	Deep learning Backdoor attack Active defense
الوصول للمادة أونلاين:	https://doi.org/10.1186/s42400-023-00141-4

_version_	1851852078536720384
author	Zonghao Ying Bin Wu
author_facet	Zonghao Ying Bin Wu
author_sort	Zonghao Ying
collection	DOAJ
container_title	Cybersecurity
description	Abstract Deep learning models are well known to be susceptible to backdoor attack, where the attacker only needs to provide a tampered dataset on which the triggers are injected. Models trained on the dataset will passively implant the backdoor, and triggers on the input can mislead the models during testing. Our study shows that the model shows different learning behaviors in clean and poisoned subsets during training. Based on this observation, we propose a general training pipeline to defend against backdoor attacks actively. Benign models can be trained from the unreliable dataset by decoupling the learning process into three stages, i.e., supervised learning, active unlearning, and active semi-supervised fine-tuning. The effectiveness of our approach has been shown in numerous experiments across various backdoor attacks and datasets.
format	Article
id	doaj-art-87dc5eeea7ea4ca8a96f79b30a1e07eb
institution	Directory of Open Access Journals
issn	2523-3246
language	English
publishDate	2023-05-01
publisher	SpringerOpen
record_format	Article
spelling	doaj-art-87dc5eeea7ea4ca8a96f79b30a1e07eb2025-08-19T22:24:10ZengSpringerOpenCybersecurity2523-32462023-05-016111310.1186/s42400-023-00141-4DLP: towards active defense against backdoor attacks with decoupled learning processZonghao Ying0Bin Wu1State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of SciencesState Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of SciencesAbstract Deep learning models are well known to be susceptible to backdoor attack, where the attacker only needs to provide a tampered dataset on which the triggers are injected. Models trained on the dataset will passively implant the backdoor, and triggers on the input can mislead the models during testing. Our study shows that the model shows different learning behaviors in clean and poisoned subsets during training. Based on this observation, we propose a general training pipeline to defend against backdoor attacks actively. Benign models can be trained from the unreliable dataset by decoupling the learning process into three stages, i.e., supervised learning, active unlearning, and active semi-supervised fine-tuning. The effectiveness of our approach has been shown in numerous experiments across various backdoor attacks and datasets.https://doi.org/10.1186/s42400-023-00141-4Deep learningBackdoor attackActive defense
spellingShingle	Zonghao Ying Bin Wu DLP: towards active defense against backdoor attacks with decoupled learning process Deep learning Backdoor attack Active defense
title	DLP: towards active defense against backdoor attacks with decoupled learning process
title_full	DLP: towards active defense against backdoor attacks with decoupled learning process
title_fullStr	DLP: towards active defense against backdoor attacks with decoupled learning process
title_full_unstemmed	DLP: towards active defense against backdoor attacks with decoupled learning process
title_short	DLP: towards active defense against backdoor attacks with decoupled learning process
title_sort	dlp towards active defense against backdoor attacks with decoupled learning process
topic	Deep learning Backdoor attack Active defense
url	https://doi.org/10.1186/s42400-023-00141-4
work_keys_str_mv	AT zonghaoying dlptowardsactivedefenseagainstbackdoorattackswithdecoupledlearningprocess AT binwu dlptowardsactivedefenseagainstbackdoorattackswithdecoupledlearningprocess

DLP: towards active defense against backdoor attacks with decoupled learning process

مواد مشابهة