Teletext is a TV broadcast system that delivers highly concise, space-constrained news texts. That conciseness turns out to be a real advantage for semantic search and retrieval: dense information, little fluff, and consistent structure over decades.
The project runs fully offline and uses embeddings + Postgres/pgvector to explore, search, and query teletext news from the last 25 years with modern LLM techniques.