So this could be used to find similar images in a database? For now I sort of te...

kacperlukawski · on Feb 28, 2023

It is not only about visual similarity, but the semantics of the images as well (for example, all the dog photos should be close to each other, no matter the colours or the scene).

The simplest way to do that is probably to use one of the pretrained neural networks (like resnet), convert the images into embeddings, index them and use for search.

I'm sharing my article describing how to implement it: https://medium.com/p/5515270d27e3

xrd · on Feb 28, 2023

This is really great. Thanks for sharing.

simonw · on Feb 28, 2023

Have you looked at CLIP? You can use that to create a vector embedding for an image that includes semantic information (what's actually in the image - animals, colours, etc) - those could then be used with pgvector to find similar images.

CLIP is how search engines like https://lexica.art/?q=6dc768e2-7a7c-494d-9a39-fd8f27e69248 work.

mazzystar · on Feb 28, 2023

Use CLIP could let you search by text sentences. And this could be used to accelerate the embedding similarity sorting part. Like https://mazzzystar.github.io/2022/12/29/Run-CLIP-on-iPhone-t...