DLP: towards active defense against backdoor attacks with decoupled learning process
Abstract Deep learning models are well known to be susceptible to backdoor attack, where the attacker only needs to provide a tampered dataset on which the triggers are injected. Models trained on the dataset will passively implant the backdoor, and triggers on the input can mislead the models durin...
| الحاوية / القاعدة: | Cybersecurity |
|---|---|
| المؤلفون الرئيسيون: | , |
| التنسيق: | مقال |
| اللغة: | الإنجليزية |
| منشور في: |
SpringerOpen
2023-05-01
|
| الموضوعات: | |
| الوصول للمادة أونلاين: | https://doi.org/10.1186/s42400-023-00141-4 |
| _version_ | 1851852078536720384 |
|---|---|
| author | Zonghao Ying Bin Wu |
| author_facet | Zonghao Ying Bin Wu |
| author_sort | Zonghao Ying |
| collection | DOAJ |
| container_title | Cybersecurity |
| description | Abstract Deep learning models are well known to be susceptible to backdoor attack, where the attacker only needs to provide a tampered dataset on which the triggers are injected. Models trained on the dataset will passively implant the backdoor, and triggers on the input can mislead the models during testing. Our study shows that the model shows different learning behaviors in clean and poisoned subsets during training. Based on this observation, we propose a general training pipeline to defend against backdoor attacks actively. Benign models can be trained from the unreliable dataset by decoupling the learning process into three stages, i.e., supervised learning, active unlearning, and active semi-supervised fine-tuning. The effectiveness of our approach has been shown in numerous experiments across various backdoor attacks and datasets. |
| format | Article |
| id | doaj-art-87dc5eeea7ea4ca8a96f79b30a1e07eb |
| institution | Directory of Open Access Journals |
| issn | 2523-3246 |
| language | English |
| publishDate | 2023-05-01 |
| publisher | SpringerOpen |
| record_format | Article |
| spelling | doaj-art-87dc5eeea7ea4ca8a96f79b30a1e07eb2025-08-19T22:24:10ZengSpringerOpenCybersecurity2523-32462023-05-016111310.1186/s42400-023-00141-4DLP: towards active defense against backdoor attacks with decoupled learning processZonghao Ying0Bin Wu1State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of SciencesState Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of SciencesAbstract Deep learning models are well known to be susceptible to backdoor attack, where the attacker only needs to provide a tampered dataset on which the triggers are injected. Models trained on the dataset will passively implant the backdoor, and triggers on the input can mislead the models during testing. Our study shows that the model shows different learning behaviors in clean and poisoned subsets during training. Based on this observation, we propose a general training pipeline to defend against backdoor attacks actively. Benign models can be trained from the unreliable dataset by decoupling the learning process into three stages, i.e., supervised learning, active unlearning, and active semi-supervised fine-tuning. The effectiveness of our approach has been shown in numerous experiments across various backdoor attacks and datasets.https://doi.org/10.1186/s42400-023-00141-4Deep learningBackdoor attackActive defense |
| spellingShingle | Zonghao Ying Bin Wu DLP: towards active defense against backdoor attacks with decoupled learning process Deep learning Backdoor attack Active defense |
| title | DLP: towards active defense against backdoor attacks with decoupled learning process |
| title_full | DLP: towards active defense against backdoor attacks with decoupled learning process |
| title_fullStr | DLP: towards active defense against backdoor attacks with decoupled learning process |
| title_full_unstemmed | DLP: towards active defense against backdoor attacks with decoupled learning process |
| title_short | DLP: towards active defense against backdoor attacks with decoupled learning process |
| title_sort | dlp towards active defense against backdoor attacks with decoupled learning process |
| topic | Deep learning Backdoor attack Active defense |
| url | https://doi.org/10.1186/s42400-023-00141-4 |
| work_keys_str_mv | AT zonghaoying dlptowardsactivedefenseagainstbackdoorattackswithdecoupledlearningprocess AT binwu dlptowardsactivedefenseagainstbackdoorattackswithdecoupledlearningprocess |
