Speaker: Edo Liberty
Title: The Rise of the Vector Database
Abstract: Modern Machine Learning (ML) represents everything as vectors, from documents, to videos, to user behavior. This representation makes it possible to accurately search, retrieve, rank, and classify different items by similarity and relevance. Running real-time applications that rely on large numbers of such high dimensional vectors requires a dedicated data infrastructure called a Vector Database. In this talk we will discuss the need for such infrastructure, the algorithmic and engineering challenges in building a vector database, and open problems we still have no adequate solutions for. Time permits, I will introduce Pinecone, the first serverless vector database.