DLP: towards active defense against backdoor attacks with decoupled learning process

Abstract Deep learning models are well known to be susceptible to backdoor attack, where the attacker only needs to provide a tampered dataset on which the triggers are injected. Models trained on the dataset will passively implant the backdoor, and triggers on the input can mislead the models durin...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:Cybersecurity
المؤلفون الرئيسيون: Zonghao Ying, Bin Wu
التنسيق: مقال
اللغة:الإنجليزية
منشور في: SpringerOpen 2023-05-01
الموضوعات:
الوصول للمادة أونلاين:https://doi.org/10.1186/s42400-023-00141-4
_version_ 1851852078536720384
author Zonghao Ying
Bin Wu
author_facet Zonghao Ying
Bin Wu
author_sort Zonghao Ying
collection DOAJ
container_title Cybersecurity
description Abstract Deep learning models are well known to be susceptible to backdoor attack, where the attacker only needs to provide a tampered dataset on which the triggers are injected. Models trained on the dataset will passively implant the backdoor, and triggers on the input can mislead the models during testing. Our study shows that the model shows different learning behaviors in clean and poisoned subsets during training. Based on this observation, we propose a general training pipeline to defend against backdoor attacks actively. Benign models can be trained from the unreliable dataset by decoupling the learning process into three stages, i.e., supervised learning, active unlearning, and active semi-supervised fine-tuning. The effectiveness of our approach has been shown in numerous experiments across various backdoor attacks and datasets.
format Article
id doaj-art-87dc5eeea7ea4ca8a96f79b30a1e07eb
institution Directory of Open Access Journals
issn 2523-3246
language English
publishDate 2023-05-01
publisher SpringerOpen
record_format Article
spelling doaj-art-87dc5eeea7ea4ca8a96f79b30a1e07eb2025-08-19T22:24:10ZengSpringerOpenCybersecurity2523-32462023-05-016111310.1186/s42400-023-00141-4DLP: towards active defense against backdoor attacks with decoupled learning processZonghao Ying0Bin Wu1State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of SciencesState Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of SciencesAbstract Deep learning models are well known to be susceptible to backdoor attack, where the attacker only needs to provide a tampered dataset on which the triggers are injected. Models trained on the dataset will passively implant the backdoor, and triggers on the input can mislead the models during testing. Our study shows that the model shows different learning behaviors in clean and poisoned subsets during training. Based on this observation, we propose a general training pipeline to defend against backdoor attacks actively. Benign models can be trained from the unreliable dataset by decoupling the learning process into three stages, i.e., supervised learning, active unlearning, and active semi-supervised fine-tuning. The effectiveness of our approach has been shown in numerous experiments across various backdoor attacks and datasets.https://doi.org/10.1186/s42400-023-00141-4Deep learningBackdoor attackActive defense
spellingShingle Zonghao Ying
Bin Wu
DLP: towards active defense against backdoor attacks with decoupled learning process
Deep learning
Backdoor attack
Active defense
title DLP: towards active defense against backdoor attacks with decoupled learning process
title_full DLP: towards active defense against backdoor attacks with decoupled learning process
title_fullStr DLP: towards active defense against backdoor attacks with decoupled learning process
title_full_unstemmed DLP: towards active defense against backdoor attacks with decoupled learning process
title_short DLP: towards active defense against backdoor attacks with decoupled learning process
title_sort dlp towards active defense against backdoor attacks with decoupled learning process
topic Deep learning
Backdoor attack
Active defense
url https://doi.org/10.1186/s42400-023-00141-4
work_keys_str_mv AT zonghaoying dlptowardsactivedefenseagainstbackdoorattackswithdecoupledlearningprocess
AT binwu dlptowardsactivedefenseagainstbackdoorattackswithdecoupledlearningprocess