现代纺织技术

• •    

基于扩散模型的多阶段一致可控虚拟试衣

  

  • 网络出版日期:2025-04-16

Multi-stage consistent and controllable virtual try-on based on diffusion models

  • Online:2025-04-16

摘要: 为解决现有扩散模型在虚拟试衣任务中难以保持服装纹理和色彩一致性的问题,提出了一种新的多阶段虚拟试衣方法。该方法包括两个阶段:在第一阶段,设计了一个基于多尺度特征注意力模块的服装形变网络,以抑制参考服装在形变过程中出现的纹理不自然现象,确保服装纹理和色彩的准确传递。在第二阶段,通过结合ControlNet和边缘与角落区域掩码,构建了一个局部修复网络,该网络可以在保持服装纹理细节的同时,专注于修复服装与人体的边界区域。结果表明:服装形变网络能够保持服装主干区域的色彩和纹理特征,局部修复网络能够填补人体和服装边界区域的纹理,从而提高虚拟试衣的真实感。相比现有虚拟试衣方法,该多阶段虚拟试衣方法在视觉一致性方面表现出优越的效果,提升了虚拟试衣体验。

关键词: 虚拟试衣, 扩散模型, 一致性, ControlNet

Abstract: "With the rapid advancement of image generation technology, diffusion models have emerged as a dominant approach in in the field of image synthesis and demonstrated significant potential across various application scenarios, with virtual try-on being one of its crucial application directions. Compared to traditional generative models, diffusion models excel in producing higher-quality images. However, due to their inherent randomness, it is challenging to ensure consistency between the generated try-on clothing and the reference clothing. Currently, despite extensive efforts to enhance consistency by incorporating conditional control into diffusion models, but they still face challenges such as accurately preserving clothing details and optimizing the integration quality between clothing and the human body. To address these issues, this paper proposes a multi-stage virtual try-on framework aimed at improving clothing consistency and enhancing the integration effect between clothing and the human body. The proposed method consists of two primary stages. In the first stage, a clothing deformation network integrated with a multi-scale feature attention module is designed to reduce texture distortion during the clothing deformation process, improve the stability of clothing deformation, and enhance the ability to preserve detailed textures. In the second stage, a local refinement network based on ControlNet is introduced to perform detailed refinement on the boundary regions between clothing and the human body. By incorporating dense pose mapping and clothing reference images as additional control signals, this network can effectively reconstruct clothing details, achieving a more natural integration of clothing with the human body. Experimental results demonstrate that the proposed method offers significant advantages in maintaining clothing consistency while improving the realism of the try-on images. Ablation experiments and comparative analysis with current mainstream virtual try-on frameworks further validate the superiority of this approach in preserving clothing details and optimizing the clothing deformation effect. The results indicate that this method has achieved significant improvements in enhancing the quality of virtual try-on, but there are still certain limitations. To streamline the workflow and improve overall try-on quality, future research will explore end-to-end training strategies and conduct training and testing on larger, more diverse datasets to enhance the model's generalization capability. By further optimizing the method and addressing existing challenges, this study aims to provide more reliable technical support for intelligent clothing design and digital clothing applications. The proposed method effectively tackles issues related to clothing deformation and detail preservation, significantly elevating the visual experience of virtual try-on and providing users with more realistic and reliable try-on effects. These improvements open up new possibilities for applications in intelligent clothing design, online shopping platforms, and digital clothing displays."

Key words: "virtual try-on, diffusion model, consistency, ControlNet

中图分类号: