Generation of Captions Highlighting the Differences between a Clothing Image Pair with Attribute Prediction

Abe, Kohei; Yokoyama, Soichiro; Yamashita, Tomohisa; Kawamura, Hidenori

Home // INTELLI 2024, The Thirteenth International Conference on Intelligent Systems and Applications // View article

Generation of Captions Highlighting the Differences between a Clothing Image Pair with Attribute Prediction

Authors:
Kohei Abe
Soichiro Yokoyama
Tomohisa Yamashita
Hidenori Kawamura

Keywords: deep learning, image captioning, consumer support, information provision.

Abstract:
Detailed information for comparisons between products is necessary in consumers’ product purchasing process, especially during the information search and choice evaluation phases. However, conventional product descriptions, which are the main source of information, tend to focus only on the product in question, and thus do not adequately express the differences between products. To solve this problem, garments are treated as target products, and a caption-generation method that emphasizes the differences between pairs of garment images using a deep-learning model for image caption-generation is proposed and its effectiveness verified. The proposed method selects and outputs captions that express differences in features from a set of captions generated for input-garment image pairs. Subject experiments confirmed that the proposed method accurately represented the feature differences between garments and provided useful information for consumers to compare garments. In particular, the proposed method is highly effective for garment pairs with similar features.

Pages: 7 to 16

Copyright: Copyright (c) IARIA, 2024

Publication date: March 10, 2024

Published in: conference

ISSN: 2308-4065

ISBN: 978-1-68558-132-9

Location: Athens, Greece

Dates: from March 10, 2024 to March 14, 2024