跳到主要导航 跳到搜索 跳到主要内容

ID-Booth: Identity-consistent Face Generation with Diffusion Models

  • Darian Tomasevic
  • , Fadi Boutros
  • , Chenhao Lin
  • , Naser Damer
  • , Vitomir Struc
  • , Peter Peer
  • University of Ljubljana
  • Fraunhofer Institute for Computer Graphics Research
  • Technische Universität Darmstadt

科研成果: 书/报告/会议事项章节会议稿件同行评审

5 引用 (Scopus)

摘要

Recent advances in generative modeling have enabled the generation of high-quality synthetic data that is applicable in a variety of domains, including face recognition. Here, state-of-the-art generative models typically rely on conditioning and fine-tuning of powerful pretrained diffusion models to facilitate the synthesis of realistic images of a desired identity. Yet, these models often do not consider the identity of subjects during training, leading to poor consistency between generated and intended identities. In contrast, methods that employ identity-based training objectives tend to overfit on various aspects of the identity, and in turn, lower the diversity of images that can be generated. To address these issues, we present in this paper a novel generative diffusion-based framework, called ID-Booth. ID-Booth consists of a denoising network responsible for data generation, a variational auto-encoder for mapping images to and from a lower-dimensional latent space and a text encoder that allows for prompt-based control over the generation procedure. The framework utilizes a novel triplet identity training objective and enables identity-consistent image generation while retaining the synthesis capabilities of pretrained diffusion models. Experiments with a state-of-the-art latent diffusion model and diverse prompts reveal that our method facilitates better intra-identity consistency and inter-identity separability than competing methods, while achieving higher image diversity. In turn, the produced data allows for effective augmentation of small-scale datasets and training of betterperforming recognition models in a privacy-preserving manner. The source code for the ID-Booth framework is publicly available at https://github.com/dariant/ID-Booth.

源语言英语
主期刊名2025 IEEE 19th International Conference on Automatic Face and Gesture Recognition, FG 2025
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798331553418
DOI
出版状态已出版 - 2025
活动19th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2025 - Tampa, 美国
期限: 26 5月 202530 5月 2025

出版系列

姓名2025 IEEE 19th International Conference on Automatic Face and Gesture Recognition, FG 2025

会议

会议19th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2025
国家/地区美国
Tampa
时期26/05/2530/05/25

学术指纹

探究 'ID-Booth: Identity-consistent Face Generation with Diffusion Models' 的科研主题。它们共同构成独一无二的指纹。

引用此