基于生成对抗网络的服装图像生成研究进展

现代纺织技术 ›› 2023, Vol. 31 ›› Issue (2): 36-.

基于生成对抗网络的服装图像生成研究进展

浙江理工大学,a.服装学院；b.服装数字化技术浙江省工程实验室，杭州310018

收稿日期:2022-03-24 出版日期:2023-03-10 网络出版日期:2023-03-20
作者简介:施倩(1997—)，女，河南信阳人，硕士研究生，主要从事数字服装、计算机视觉方面的研究。
基金资助:
浙江省一般软科学研究计划(2022C35099)；浙江省丝绸与文化艺术研究中心培育项目(ZSFCRC20204PY)

Research progress of clothing image generation based on Generative Adversarial Networks

a.School of Fashion Design & Engineering; b.Zhejiang Provincial Engineering Laboratory of Fashion Digital Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China

Received:2022-03-24 Published:2023-03-10 Online:2023-03-20

摘要/Abstract

摘要： 生成式人工智能正逐步运用在服装零售、电子商务、趋势预测、虚拟现实以及增强现实等服装产业技术中，并广泛覆盖相关品类的服务与产品。深度学习领域，图像生成模型主要包括深度信念网络(DBN)、变分自编码器(VAE)、生成对抗网络(GAN)。文章围绕GAN研究前沿对其变体发展进行分类梳理，将应用于服装领域最为广泛的条件生成对抗网络(CGAN)在Text to Image、Image to Image、Image to Video中的相关研究成果加以介绍，并分析其优化历程、优缺点以及适用类型；列举GAN在服装图像生成中的具体应用，包括服装横幅广告自动生成、个性化服装推荐与生成、服装与图案设计、虚拟试衣；最后针对服装图像生成研究挑战作出总结。研究认为，未来可在多模态生成模型研发、大规模时尚服装数据集构建、服装生成客观评估指标的3个方向展开研究。

关键词: 深度学习, 生成对抗网络, 服装生成, 智能广告设计, 虚拟试衣

Abstract:

The depth generation model mainly includes Deep Belief Network (DBN), Variational SelfEncoder (VAE) and Generative Adversarial Network (GAN). GAN, as a popular indepth learning framework in recent years, constructs two networks G and D which are mutually antagonistic and game, so that they can achieve Nash equilibrium through continuous iterative training and then realize the automatic generation of images. GAN can be applied in many fields, including semisupervised learning, sequence data generation, image processing, domain adaptation, etc. The image processing field can be subdivided into multiple scenes, such as image generation, image superresolution, image style transformation, object transformation and object detection. The most widely used and successful part of GAN in image processing is image generation. Clothing image generation based on 5G, big data, depth learning and other technologies can effectively promote the digital development process of apparel ecommerce.
Conditional Generative Adversarial Network (CGAN) implements constraints on sample generation by adding constraint condition Y to GAN. CGAN can direct the generator to synthesize data in a directional way against expected samples. Therefore, CGAN is an effective model to realize automatic generation of clothing images that meet specific needs. During the training, the generator learns to generate realistic samples matching the labels of the training data set, and the discriminator learns to match the correct labels for the identified real samples. At present, the public clothing data sets that can be applied to clothing image generation mainly include FashionMNIST, Deep Fashion, Fashion AI, etc. According to the data morphology classification of the input model and the output model, the main forms of CGAN implementation in the clothing field are TexttoImage, ImagetoImage and ImagetoVideo. The three data synthesis forms respectively contain various derived GAN models, which correspond to different clothing generation application scenarios. TexttoImage aims to generate the required clothing image based on the given description text, which can be specifically applied to a given model change, clothing texture rendering, character pose and clothing attribute generation, clothing category and background classification; ImagetoImage is mostly applied in clothing design, such as clothing pattern design, clothing image conversion (from sketch to cartoon clothing, model to dress), style transfer, virtual fitting, fashion trend forecast, etc.; ImagetoVideo is usually applied to facial expression video frame prediction, anonymous model video generation, virtual fitting and other scenes.
In recent years, research on GANapplied clothing image generation industry is mostly distributed in the field of ecommerce, including automatic generation of clothing banner advertisements, personalized clothing recommendation system, clothing and pattern design, virtual fitting towards video presentation, etc., which greatly enables the upgrading of related digital clothing industry. However, the industry is still facing the problems of single utility of generation model, narrow application of clothing data set and lack of objective and unified criteria for generation evaluation. The research in the future will focus on the research and development of integrated multimodal generation model, the collection of largescale clothing data sets, and the development of objective criteria for clothing image generation and evaluation.

Key words: deep learning, GAN, clothing generation, intelligent ad design, virtual fitting.

中图分类号:

TN941.26

施倩, 罗戎蕾. 基于生成对抗网络的服装图像生成研究进展[J]. 现代纺织技术, 2023, 31(2): 36-.

SHI Qian, LUO Ronglei. Research progress of clothing image generation based on Generative Adversarial Networks[J]. Advanced Textile Technology, 2023, 31(2): 36-.

参考文献

［1］HINTON G E, OSINDERO S, TEH Y W. A fast learning algorithm for deep belief nets［J］. Neural computation, 2006, 18(7): 1527-1554.

［2］DIEDERIK P K, MAX W. Auto-encoding variational bayes . ArXiv,2013:1312.6114.

［3］GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial nets［C］ Proceedings of the 27th International Conference on Neural Information Processing Systems. Montreal, Canada, 2014：2672-2680.

［4］PARK T, LIU M Y, WANG T C, et al. Semantic image synthesis with spatially-adaptive normalization［C］Proceedings of the IEEE CVF Conference on Computer Vision and Pattern Recognition. Long Beach, CA, USA, 2019: 2337-2346.

［5］KAMADA S, ICHIMURA T, HARA A, et al. Adaptive structure learning method of deep belief network using neuron generation-annihilation and layer generation［J］. Neural Computing and Applications, 2019, 31(11): 8035-8049.

［6］HINTON G E, SALAKHUTDINOV R R. Reducing the dimensionality of data with neural networks［J］. Science, 2006, 313(5786): 504-507.

［7］SUSSKIND J M, HINTON G E, MOVELLAN J R, et al. Generating facial expressions with deep belief nets［J］. Affective Computing, Emotion Modelling, Synthesis and Recognition, 2008, 2008(5): 421-440.

［8］OSINDERO S, HINTON G E. Modeling image patches with a directed hierarchy of Markov random fields［C］Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, 2007: 1121-1128.

［9］TORRALBA A, FERGUS R, WEISS Y. Small codes and large image databases for recognition［C］Conference on Computer Vision and Pattern Recognition. IEEE, 2008: 1-8.

［10］VAHDAT A, KAUTZ J. Nvae: A deep hierarchical variational autoencoder . ArXiv,2020:2007.03898.

[1]	薛萧昱, 何佳臻, 王敏. 三维虚拟试衣技术在服装设计与性能评价中的应用进展[J]. 现代纺织技术, 2023, 31(2): 12-.
[2]	曾华福, 杨杰, 李林红. 基于改进ShuffleNet v1的服装图像分类算法[J]. 现代纺织技术, 2023, 31(2): 23-.
[3]	阮梦玉, 李敏, 何儒汉, 姚迅. 基于改进RefineDet的织物疵点检测[J]. 现代纺织技术, 2022, 30(5): 12-20.
[4]	仇惠玲, 刘雅玲. 线上虚拟试衣平台服务质量主题挖掘及指标构建[J]. 现代纺织技术, 2022, 30(4): 222-229.
[5]	从明芳, 李子印, 卢鸯, 韩高锋, 谢凌佳, 王启真. 基于Mask R-CNN深度学习的羊绒羊毛纤维识别技术[J]. 现代纺织技术, 2022, 30(2): 36-40.
[6]	杨佑国，徐平华，徐明慧，丁雪梅，王伊雯. 基于虚拟试衣的着装应力分布效应评价[J]. 现代纺织技术, 2021, 0(6): 106-112.
[7]	汪坤,史伟民,李建强,彭来湖. 基于深度学习的织物印花分割算法研究[J]. 现代纺织技术, 2021, 0(3): 45-50.
[8]	夏馨，王利君，王洁. 不同姿势下直筒裙造型与面料性能关系研究[J]. 现代纺织技术, 2015, 0(03): 31-36.