TY - GEN
T1 - MSDT
T2 - 6th IEEE International Conference on Universal Village, UV 2022
AU - Roh, Jaechul
AU - Cheng, Minhao
AU - Fang, Yajun
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Pre-trained language models allowed us to process downstream tasks with the help of fine-tuning, which aids the model to achieve fairly high accuracy in various Natural Language Processing (NLP) tasks. Such easily-downloaded language models from various websites empowered the public users as well as some major institutions to give a momentum to their real-life application. However, it was recently proven that models become extremely vulnerable when they are backdoor attacked with trigger-inserted poisoned datasets by malicious users. The attackers then redistribute the victim models to the public to attract other users to use them, where the models tend to misclassify when certain triggers are detected within the training sample. In this paper, we will introduce a novel improved textual backdoor defense method, named MSDT, that outperforms the current existing defensive algorithms in specific datasets. The experimental results illustrate that our method can be effective and constructive in terms of defending against backdoor attack in text domain.
AB - Pre-trained language models allowed us to process downstream tasks with the help of fine-tuning, which aids the model to achieve fairly high accuracy in various Natural Language Processing (NLP) tasks. Such easily-downloaded language models from various websites empowered the public users as well as some major institutions to give a momentum to their real-life application. However, it was recently proven that models become extremely vulnerable when they are backdoor attacked with trigger-inserted poisoned datasets by malicious users. The attackers then redistribute the victim models to the public to attract other users to use them, where the models tend to misclassify when certain triggers are detected within the training sample. In this paper, we will introduce a novel improved textual backdoor defense method, named MSDT, that outperforms the current existing defensive algorithms in specific datasets. The experimental results illustrate that our method can be effective and constructive in terms of defending against backdoor attack in text domain.
UR - http://www.scopus.com/inward/record.url?scp=85167779169&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85167779169&partnerID=8YFLogxK
U2 - 10.1109/UV56588.2022.10185524
DO - 10.1109/UV56588.2022.10185524
M3 - Conference contribution
AN - SCOPUS:85167779169
T3 - 6th IEEE International Conference on Universal Village, UV 2022
BT - 6th IEEE International Conference on Universal Village, UV 2022
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 22 October 2022 through 25 October 2022
ER -