Timeline-VLM

Upload images of visual artifacts and the model estimates when they were created by projecting their visual embeddings onto a learned temporal manifold.

📄 Paper · 💻 GitHub

Embedding backbone

CLIP ViT-B/32 EVA01-CLIP-g-14

Visualization

Plotly Matplotlib

📂 Upload images

Drop or click to upload

🗂️ On the timeline

Temporal embedding space

📸 Examples

Click to add to timeline

How it works

Text prompts ("an artifact from the year XXXX") are encoded with CLIP to build a temporal embedding space
Kernel PCA (cosine kernel) reduces these to 3-D, revealing a temporal manifold
A Bezier curve is fitted through the manifold as a smooth temporal axis
Your image is encoded and projected onto that curve, and its position gives the estimated year