vector database food similarity Download - vector database food similarity Source code download

vector database food similarity

Other source code

1.0.0

Download

Vector database for recommandation

Use case of similarity recommendations for food products with Chroma vector database based on Open Food Facts.

How it works

Schema

1. Data mining and preparation (data-mining.js)

Around 10,000 products are retrieved using the Open Food Facts API but some products lack information, so after processing, we have around 4,300 products. Each product has a unique identifier, name, image URL, and the percentage of ingredients. There are over 1,500 indexed vectors, with each vector representing the percentage of an ingredient in the total food product (such as sugar, oil, water, etc.).
To initiate the generation of datasets, run the following command:

npm run data-mining

2. Migration on vector database (migration.js)

Migration will export products datasets (products.json) to local Chroma database.
To launch the migration, run:

npm run migration

3. Query the database (query.js)

To perform a query to the vector database, you need to generate vectors from product data.
First, we call the Open Food Facts product API. Then, we generate vectors for embedding (likeliest_recipes.json) and use them to make a request to the database.
By default, when a query is performed on the Chroma database, similar products are determined using the Squared L2 Norm vector norm:

L2 Norm

Others vector norms are also available such as the Inner product or Cosine similarity, more details on Chroma and Hnswlib.

Installation

Install required packages using npm
```
npm ci
```
To run the Chroma database in a Docker container, expose it on port 8080
```
docker run -p 8000:8000 chromadb/chroma:0.4.21
```
Query the database using the unique product code and display the 3 closest matches in the console.
The unique product code can be found in the Open Food Facts URL (example with 3017620429484: https://world.openfoodfacts.org/product/3017620429484/nutella-hazelnut-spread-ferrero)
```
node ./query.js product=3017620429484
```