Home // CONTENT 2023, The Fifteenth International Conference on Creative Content Technologies // View article


Using Text Queries to Look Up Unlabeled Images: A Command-Line Search Tool Based on CLIP

Authors:
Yurij Mikhalevich

Keywords: image search, image indexing, photo management, computer vision, natural language processing

Abstract:
This paper presents a practical, scalable implementation of an image search engine using OpenAI’s Contrastive Language-Image Pre-Training (CLIP) model. The method pro- vides a convenient Command-Line Interface (CLI) and introduces a cache layer powered by SQLite 3 relational database management system (RDBMS) that facilitates efficient repetitive image searches within extensive image databases using natural language queries. The method’s effectiveness was evaluated on ImageNet-1k and CIFAR-100 datasets, yielding a 31.17% top-1 accuracy on the ImageNet-1k train set and 55.15% top-1 accuracy on the CIFAR-100 test set. The scalability study showed that indexing time scales linearly with the number of images, and image search time increases only slightly; for example, on an Apple M1 Max CPU, indexing over a million images took 26.36 times more than indexing 50,000 images, while querying the larger image set took just 2.75 times longer. This approach is particularly relevant for industries managing vast volumes of visual data, such as media and entertainment, security, and healthcare.

Pages: 7 to 11

Copyright: Copyright (c) IARIA, 2023

Publication date: June 26, 2023

Published in: conference

ISSN: 2308-4162

ISBN: 978-1-68558-048-3

Location: Nice, France

Dates: from June 26, 2023 to June 30, 2023