SCA: Highly Efficient Semantic-Consistent Unrestricted Adversarial Attack

Abstract

Unrestricted adversarial attacks typically manipulate the semantic content of an image (e.g., color or texture) to create adversarial examples that are both effective and photorealistic. Recent works have utilized the Diffusion Inversion process to map images into a latent space, where high-level semantics are manipulated by introducing perturbations. However, they often results in substantial semantic distortions in the denoised output and suffers from low efficiency. In this study, we propose a novel framework called Semantic Consistent Unrestricted Adversarial Attacks (SCA), which employs an inversion method to extract edit-friendly noise maps and utilizes Multimodal Large Language Model (MLLM) to provide semantic guidance throughout the process. Under the condition of rich semantic information provided by MLLM, we perform the DDPM denoising process of each step using a series of edit-friendly noise maps, and leverage DPM Solver++ to accelerate this process, enabling efficient sampling with semantic consistency. Compared to existing methods, our framework enables the efficient generation of adversarial examples that exhibit minimal discernible semantic changes. Consequently, we for the first time introduce Semantic Consistent Adversarial Examples (SCAE). Extensive experiments and visualizations have demonstrated the high efficiency of SCA, particularly in being on average 12 times faster than the state-of-the-art attacks.

Pipeline

The core idea of Semantic-Consistent Unrestricted Adversarial Attack is to enhance semantic control throughout the entire generation process of unrestricted adversarial examples, which is achieved by introducing a novel inversion method that can ”imprint” the image more strongly onto the noise maps. Furthermore, the powerful semantic guidance provided by MLLMs further restricts the direction of perturbations in the latent space, enabling the clean image to produce our desired adversarial changes, namely imperceptibility and naturalness. Specifically, our method can be divided into two parts: Semantic Fixation Inversion and Semantically Guided Perturbation. We first map the clean image into a latent space through Semantic Fixation Inversion, and then iteratively optimize the adversarial objective in the latent space under semantic guidance, causing the content of the image to shift in the direction of deceiving the model until the attack is successful

You need to download the test image dataset and checkpoints of the target model yourself.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
LICENSE		LICENSE
README.md		README.md
adversarial_latent_optimization.py		adversarial_latent_optimization.py
get_model.py		get_model.py
image_latent_mapping.py		image_latent_mapping.py
labels.txt		labels.txt
pipeline.png		pipeline.png
scheduling_dpmsolver_multistep_inject.py		scheduling_dpmsolver_multistep_inject.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SCA: Highly Efficient Semantic-Consistent Unrestricted Adversarial Attack

Abstract

Pipeline

About

Releases

Packages

Languages

License

Pan-Zihao/SCA

Folders and files

Latest commit

History

Repository files navigation

SCA: Highly Efficient Semantic-Consistent Unrestricted Adversarial Attack

Abstract

Pipeline

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages