clip image search Download - clip image search Source code download

clip image search

Other source code

1.0.0

Download

Image Search using CLIP

Retrieve images based on a query (text or image), using Open AI's pretrained CLIP model.

Text as query.

Image as query.

Introduction

CLIP (Contrastive Language-Image Pre-Training) is a neural network trained on a variety of (image, text) pairs. It can map images and text into the same latent space, so that they can be compared using a similarity measure.

Extending the work in this repository, I created a simple image search engine that can take both text and images as query. The search engine works as follows:

Use the image encoder to compute the feature vector of the images in the dataset.

Index the images in the following format:

image_id: {"url": https://abc.com/xyz, "feature_vector": [0.1, 0.3, ..., 0.2]}

Compute the feature vector of the query. (Use text encoder if query is text. Use image encoder if query is image.)
Compute the cosine similarities between the feature vector of the query and the feature vector of the images in the dataset.
Return $k$ images that have the highest similarity.

I used the lite version of the Unsplash dataset that contains 25,000 images. The k-Nearest Neighbor search is powered by Amazon Elasticsearch Service. I deployed the query service as an AWS Lambda function and put an API gateway in front of it. The frontend is developed using Streamlit.

Possible Improvements

The feature vector outputted by CLIP is a 32-bit floating point vector with 512 dimensions. To reduce storage cost and increase query speed, we may consider using a dimension reduction technique such as PCA to reduce the number of features. If we want to scale the system to billions of images, we may even consider binarizing the features, as is done in Pinterest.

How to Use

Install dependencies

pip install -e . --no-cache-dir

Download the Unsplash dataset

python scripts/download_unsplash.py --image_width=480 --threads_count=32

This will download and extract a zip file that contains the metadata about the photos in the dataset. The script will use the URLs of the photos to download the actual images to unsplash-dataset/photos. The download may fail for a few images (see this issue). Since CLIP will downsample the images to 224 x 224 anyway, you may want to adjust the width of the downloaded images to reduce storage space. You may also want to increase the threads_count parameter to achieve a faster performance.

Create index and upload image feature vectors to Elasticsearch

python scripts/ingest_data.py

The script will download the pretrained CLIP model and process the images by batch. It will use GPU if there is one.

Build Docker image

Build Docker image for AWS Lambda.

docker build --build-arg AWS_ACCESS_KEY_ID=YOUR_AWS_ACCESS_KEY_ID 
             --build-arg AWS_SECRET_ACCESS_KEY=YOUR_AWS_SECRET_ACCESS_KEY 
             --tag clip-image-search 
             --file server/Dockerfile .

Run the Docker image as a container.

docker run -p 9000:8080 -it --rm clip-image-search

Test the container with a POST request.

curl -XPOST "http://localhost:9000/2015-03-31/functions/function/invocations" -d '{"query": "two dogs", "input_type": "text"}'

Run Streamlit app

streamlit run streamlit_app.py

Acknowledgement

open-ai/CLIP
haltakov/natural-language-image-search

Expand

Additional Information

Version 1.0.0
Type Other source code
Update Time 2025-03-11
size 15.06KB
From Github

Related Applications

Word Search 800

2024-11-08
Inf CLIP

2024-11-03
Clip Bucket

2011-05-24
CF image host

2011-04-26
VSO Image Resizer

2009-06-04
Super Image Plugin

2009-04-18

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3

Related Information All