TY - GEN
T1 - An Improved Deep Convolutional Generative Adversarial Network for Anime-Style Character Image Painting
AU - Hua, Zhong
AU - Jie, Lin
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The application of Deep Convolutional Generative Adversarial Networks (DCGANs) has gained significant popularity in the domain of anime-style image painting. However, the model often suffers from the imbalanced training process and other certain limitations, such as the presence of blurred regions and feature misalignment. In our work, an enhanced DCGAN is proposed to tackle these challenges. First of all, we introduce a scoring mechanism in each convolutional layer of the discriminator to enhance its ability to capture multi-scale features for accurate rating. From another perspective, we integrate the self-attention mechanism and modify the sizes of convolutional kernels in the generator, resulting in greater capacity to generate details. To ensure training stability, we employ the WGAN-GP loss, which also improves the generation diversity. Furthermore, in order to provide a fair and thorough evaluation of the generated outputs, we employ a comprehensive human-machine scoring method to evaluate the outcomes. The experimental results demonstrate that our improved DCGAN exhibits stronger feature extraction capabilities and effectively enhances the quality of generated images.
AB - The application of Deep Convolutional Generative Adversarial Networks (DCGANs) has gained significant popularity in the domain of anime-style image painting. However, the model often suffers from the imbalanced training process and other certain limitations, such as the presence of blurred regions and feature misalignment. In our work, an enhanced DCGAN is proposed to tackle these challenges. First of all, we introduce a scoring mechanism in each convolutional layer of the discriminator to enhance its ability to capture multi-scale features for accurate rating. From another perspective, we integrate the self-attention mechanism and modify the sizes of convolutional kernels in the generator, resulting in greater capacity to generate details. To ensure training stability, we employ the WGAN-GP loss, which also improves the generation diversity. Furthermore, in order to provide a fair and thorough evaluation of the generated outputs, we employ a comprehensive human-machine scoring method to evaluate the outcomes. The experimental results demonstrate that our improved DCGAN exhibits stronger feature extraction capabilities and effectively enhances the quality of generated images.
KW - Deep Convolution Generative Adversarial Networks
KW - anime-style image painting
KW - comprehensive rating
KW - self-attention mechanism
UR - https://www.scopus.com/pages/publications/85201260171
U2 - 10.1109/ICCCS61882.2024.10603201
DO - 10.1109/ICCCS61882.2024.10603201
M3 - 会议稿件
AN - SCOPUS:85201260171
T3 - 2024 9th International Conference on Computer and Communication Systems, ICCCS 2024
SP - 102
EP - 106
BT - 2024 9th International Conference on Computer and Communication Systems, ICCCS 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th International Conference on Computer and Communication Systems, ICCCS 2024
Y2 - 19 April 2024 through 22 April 2024
ER -