现代纺织技术 ›› 2025, Vol. 33 ›› Issue (01): 58-64.

• • 上一篇    下一篇

融合注意力机制与改进ResNet50的服装图像属性预测方法

  

  1. 1. 常州纺织服装职业技术学院, a.智能制造学院;b.智能纺织与材料学院, 江苏常州 213164; 2. 江苏省碳纤维先进材料智能制造工程技术研究开发中心, 江苏常州 213164
  • 出版日期:2025-01-10 网络出版日期:2025-02-18

A clothing image attribute prediction method integrating attention mechanism and improved ResNet50

  1. 1. School of Intelligent Manufacturing; b. School of Intelligent Textiles and Materials, Changzhou Vocational Institute of Textile and Garment, Changzhou 213164, China; 2. Jiangsu Research Center of Intelligent Manufacturing Technology for Carbon Fiber and Advanced Material, Changzhou 213164, China
  • Published:2025-01-10 Online:2025-02-18

摘要: 为了解决人工标注服装图像属性效率低下的问题,提出了一种融合注意力机制与改进ResNet50的服装图像属性预测方法。文章首先对传统多标签分类方法中的模型进行了改进,改进后的方法能更充分利用任务之间的相关性,并减少数据稀缺问题带来的影响;接着文章引入CBAM注意力机制,用于捕捉服装属性上的细节特征。结果表明:在未引入注意力机制的情况下,基于改进ResNet50的方法在多项评价指标上均优于传统多标签分类方法,准确率提高了25.96%,与ResNet34、EfficientNet_V2、VGG16模型相比,ResNet50模型在服装图像属性预测方面整体表现更佳;引入CBAM注意力机制后,基于改进ResNet50的方法的准确率再提高了1.72%。文章所提的融合注意力机制与改进ResNet50的服装图像属性预测方法,能够有效预测服装图像属性,为实现服装图像属性的自动化标注提供了新的思路。

关键词: 服装图像, 属性预测, 注意力机制, ResNet50, 深度学习

Abstract: In recent years, with the popularity of online shopping, a large number of clothing images have emerged on the Internet. How to automatically extract key information from these massive clothing images has become a hot topic in current research. Through analyzing and identifying the relevant attributes of these clothing images and combining them with information such as price, sales volume and user comments, intelligent recommendations and trend predictions can be further achieved. This not only helps merchants grasp market demand in advance and formulate more accurate marketing strategies and business decisions but also provides designers with valuable creative inspiration. However, labeling the attributes of a large number of clothing images is also a tedious and costly task for online clothing sellers. Therefore, researching the classification and prediction of clothing image attributes has important practical significance and application value. 
To improve the prediction accuracy of clothing image attributes and to address the inefficiency of manual labeling of clothing image attributes, this paper proposes a clothing image attribute prediction method integrating the attention mechanism and improved ResNet50. This method improves the network structure of the ResNet50 model to adapt to the clothing multi-attribute prediction task and introduces the attention mechanism into the improved ResNet50 model to capture the detailed features of clothing attributes to improve the prediction accuracy. The method not only applies the improved deep learning algorithm to clothing attribute prediction, but also verifies the effectiveness of the method in clothing attribute prediction. It can effectively improve the accuracy of clothing image attribute prediction and identify attribute categories with superior prediction outcomes, providing new ideas for realizing the automatic labeling of clothing image attributes. The experimental results show that in the absence of the attention mechanism, the method based on the improved ResNet50 outperforms the traditional multi-label classification method in terms of accuracy, precision, recall, and F1 score, with the accuracy increasing by 25.96%. On the whole, the ResNet50 model performs better than the ResNet34, EfficientNet_V2, and VGG16 models in terms of accuracy, precision, recall, and F1 score. Compared with the method without the introduction of the CBAM attention mechanism, the ResNet50 method enhanced with CBAM improves the accuracy by 1.72%. In the prediction of each attribute category, the pattern, sleeve type, and style performed well, while the accuracy of the collar type is only 0.684, which is not good. In addition, the fabric and tightness show high accuracy but low recall. In future research, higher quality datasets can be built for training, and certain clothing attribute categories can also be studied separately to improve the prediction accuracy of the model. 

Key words: clothing images, attribute prediction, attention mechanism, ResNet50, deep learning

中图分类号: