In developmentPythonPyTorchPostgreSQL+7 more

ClairVision - Scalable AI Image Curation & Vector Search Engine

A distributed, backend-first AI image curation system that streams remote images, builds vector embeddings, enables similarity graph clustering, and powers instant face + visual retrieval across events.

Timeline

Oct 2025 – Present

Role

Backend & ML Systems Architecture

Team

3

Status

In development

Source Code

Technology Stack

Python

PyTorch

PostgreSQL

Async Queues / Workers

FAISS

CLIP ViT-L/14

FaceNet (InceptionResnetV1)

Node.js

NextJS

WebGL

Key Challenges

Streaming Large Remote Image Datasets
Scalable Vector Indexing & Similarity Graph Construction
Async Pipeline Orchestration
Low-Latency On-Demand High-Res Streaming
Cross-Event Identity Matching

Key Learnings

Vector Database Architecture (pgvector)
Similarity Graph Modeling
Async Worker Queue Design
High-Throughput Ingestion Pipelines
Backend-Driven ML System Design

Overview

Last Updated - 02 March 2026

ClairVision is a backend-first AI system designed to automate large-scale image curation and visual search. The system is architected as a scalable ingestion and retrieval engine capable of indexing thousands of remote images without permanently storing originals.

Images are streamed through asynchronous workers, embeddings are generated using CLIP and FaceNet, and vectors are stored in PostgreSQL via pgvector. A similarity graph is then constructed to enable instant clustering, duplicate grouping, and cross-event identity matching.

The objective is to transform raw event photography into a structured, searchable visual database powered by vector similarity.

Key Features

Distributed Image Ingestion Pipeline

Streaming-Based Processing: Indexes 3000+ remote images without storing raw originals.
Async Worker Architecture: Queue-driven embedding extraction and metadata generation.
Fault-Tolerant Pipeline: Retry-safe ingestion and background processing via Redis-backed workers.
Scalable Design: Ready for horizontal scaling across larger datasets.

Vector Search & Similarity Graph

CLIP ViT-L/14 Embeddings for semantic image similarity.
FaceNet (InceptionResnetV1) for identity-based embeddings.
pgvector + FAISS Indexing for efficient cosine similarity search.
Similarity Graph Construction enabling near-duplicate clustering and visual grouping.

High-Performance Retrieval System

Low-Latency Vector Queries via indexed nearest-neighbour search.
Signed On-Demand Streaming for high-resolution previews.
Cross-Event Identity Matching across independent image collections.
Instant Visual Clustering powered by similarity thresholds.

Interactive Visualization Layer

WebGL-Powered 3D Mesh UI
Embedding cluster exploration
Real-time backend similarity queries
Graph-based visual navigation

Architecture Overview

Remote image ingestion
Async worker queue processing
Embedding extraction (CLIP + FaceNet)
Vector storage in PostgreSQL (pgvector)
Similarity graph generation
FAISS / cosine similarity querying
API-driven search and streaming

Development Progress

Current Status

Fully operational ingestion + embedding pipeline
pgvector-based vector indexing implemented
Redis-backed async worker system running
Backend similarity search APIs live
Cross-event identity matching enabled

Completed

3000+ image indexing via streaming ingestion
Similarity graph construction
Visual + identity search endpoints
Signed streaming system for previews
Backend-first ML architecture

Next Up

Horizontal scaling for 10k+ images
Distributed worker orchestration
Hybrid vector search (semantic + identity fusion)
Dockerized deployment + CI/CD
Public demo interface

Previous Project

UniPool - University Carpooling Platform

Next Project

Real-Time Bot Detection System (Walmart Sparkathon 2025)

Related Projects

Event Check-In App

Completed

A full-stack real-time event engagement platform where users can browse upcoming events, check in, and see live attendee updates. Built with React Native (Expo), Node.js, GraphQL, Prisma, PostgreSQL, and Socket.io.

React NativeExpoTypeScript+10