Surrogate-guided adversarial attacks: enabling white-box methods in black-box scenarios

  • Dimitrios Christos Asimopoulos
  • , Panagiotis Radoglou-Grammatikis
  • , Panagiotis Fouliras
  • , Konstandinos Panitsidis
  • , Georgios Efstathopoulos
  • , Thomas Lagkas
  • , Vasileios Argyriou
  • , Igor Kotsiuba
  • , Panagiotis Sarigiannidis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Adversarial attacks pose significant threats to machine learning models, with white-box attacks such as Fast Gradient Sign Method (FGSM), Projected Gradient Descent (PGD), and Basic Iterative Method (BIM) achieving high success rates when model gradients are accessible. However, in real-world scenarios, direct access to model internals is often restricted, necessitating black-box attack strategies that typically suffer from lower effectiveness. In this work, we propose a novel approach to transform white-box attacks into black-box attacks by leveraging state-of-the-art surrogate models, including MultiLayer Perceptrons (MLP) and XGBoost (XGB). Our method involves training a surrogate model to mimic the decision boundaries of an inaccessible target model using pseudo-labeling, thereby enabling the application of gradient-based white-box attacks in a black-box setting. We systematically compare our approach against conventional black-box attacks, such as Zero Order Optimization (ZOO), evaluating their effectiveness in terms of attack success rates, transferability, and computational efficiency. The results demonstrate that surrogate-assisted attacks perform as good as standard black-box methods, bridging the performance gap between white-box and black-box adversarial attacks. This study highlights the power of surrogate models in enhancing adversarial transferability and provides insights into the robustness of different machine learning architectures against adversarial threats.

Original languageEnglish
Title of host publicationProceedings of the 2025 IEEE International Conference on Cyber Security and Resilience, CSR 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages950-956
Number of pages7
ISBN (Electronic)9798331535919
ISBN (Print)9798331535926
DOIs
Publication statusPublished - 26 Aug 2025
Event5th IEEE International Conference on Cyber Security and Resilience, CSR 2025 - Chania, Greece
Duration: 4 Aug 20256 Aug 2025

Publication series

NameProceedings of the 2025 IEEE International Conference on Cyber Security and Resilience, CSR 2025

Conference

Conference5th IEEE International Conference on Cyber Security and Resilience, CSR 2025
Country/TerritoryGreece
CityChania
Period4/08/256/08/25

Keywords

  • Adversarial attacks
  • Black-box
  • evasion
  • surrogate-model
  • transferability
  • white-box

Fingerprint

Dive into the research topics of 'Surrogate-guided adversarial attacks: enabling white-box methods in black-box scenarios'. Together they form a unique fingerprint.

Cite this