Pre-trained Encoders in Self-Supervised Learning Improve Secure and Privacy-preserving Supervised Learning

  • Hongbin Liu
  • , Wenjie Qu
  • , Jinyuan Jia
  • , Neil Zhenqiang Gong

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

Classifiers in supervised learning have various security and privacy issues, e.g., 1) data poisoning attacks, backdoor attacks, and adversarial exampleson the security side as well as 2) inference attacksto the training data on the privacy side. Various secure and privacy-preserving supervised learning algorithms with formal guarantees have been proposed to address these issues. However, they suffer from various limitations such as accuracy loss, small certified security guarantees, and/or inefficiency. Self-supervised learning pre-trains encoders using unlabeled data. Given a pre-trained encoder as a feature extractor, supervised learning can train a simple yet accurate classifier using a small amount of labeled training data. In this work, we perform the first systematic, principled measurement study to understand whether and when a pre-trained encoder can address the limitations of secure or privacy-preserving supervised learning algorithms. Our key findings are that a pre-trained encoder substantially improves 1) both accuracy under no attacks and certified security guarantees against data poisoning and backdoor attacks of state-of-the-art secure learning algorithms (i.e., bagging and KNN), 2) certified security guarantees of randomized smoothing against adversarial examples without sacrificing its accuracy under no attacks, 3) accuracy of differentially private classifiers.

Original languageEnglish (US)
Title of host publicationProceedings - 45th IEEE Symposium on Security and Privacy Workshops, SPW 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages144-156
Number of pages13
ISBN (Electronic)9798350354874
DOIs
StatePublished - 2024
Event45th IEEE Symposium on Security and Privacy Workshops, SPW 2024 - San Francisco, United States
Duration: May 23 2024 → …

Publication series

NameProceedings - 45th IEEE Symposium on Security and Privacy Workshops, SPW 2024

Conference

Conference45th IEEE Symposium on Security and Privacy Workshops, SPW 2024
Country/TerritoryUnited States
CitySan Francisco
Period5/23/24 → …

All Science Journal Classification (ASJC) codes

  • Communication
  • Artificial Intelligence
  • Computer Networks and Communications
  • Information Systems
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'Pre-trained Encoders in Self-Supervised Learning Improve Secure and Privacy-preserving Supervised Learning'. Together they form a unique fingerprint.

Cite this