딥페이크 생성 방지를 위한 확산 모델 맞춤형 방어 기법 연구

Yeongmin Ko
Pusan National University

Abstract

Remarkable advancements in latent diffusion models (LDMs) allow photo-realistic image generation from simple text prompts. However, their misuse by malicious actors has raised ethical concerns, particularly regarding facial forgery and privacy violations. Therefore, recent studies have proposed anti-customization methods to deal with these problems proactively. Unfortunately, their focus on protective performance often results in noticeable perturbations, making them impractical for real-world applications. Moreover, these methods struggle to remain effective against common image transformations such as JPEG compression and Gaussian blur. In this paper, we propose a novel anti-customization algorithm, dubbed AEGIS, that generates highly imperceptible perturbations while remaining robust to various image transformations. We first generate perturbation intensity maps using face-parsing and superpixel-based random perturbation scaling, which spatially control the magnitude of the perturbation updates during the noise optimization process. This allows us to arrange adversarial perturbations in perceptually insensitive regions while considering the optimal tradeoff between defense effectiveness and visual imperceptibility. Furthermore, we integrate the Expectation Over Transformation (EOT) to improve robustness against various image transformations. Extensive experiments on CelebA-HQ and VGGFace2 demonstrate that our method significantly reduces the visibility of perturbations while maintaining defense performance comparable to that of existing approaches.

Background

TBA

Motivation

TBA

Key Idea

  1. TBD

Figure 1: TBD