Unlocking the Power of Unstructured Data

Chang She on the power of Lance, a columnar data format for AI/ML.

Subscribe: AppleSpotify OvercastPocket CastsAntennaPodPodcast AddictAmazon •  RSS.

Chang She is CEO and co-founder of LanceDB, an open-source database designed for multimodal AI applications, offering scalable vector search, streaming training data, and interactive exploration of large AI datasets. In this episode we discuss Lance, an open-source columnar data format that tackles the unique challenges posed by modern AI and machine learning workloads. Specifically engineered for efficiency, Lance addresses limitations of existing formats like Parquet and ORC by optimizing the storage and retrieval of large, complex data types, including images, videos, and vector embeddings.

Subscribe to the Gradient Flow Newsletter

Interview highlights – key sections from the video version:

 

Related content:


If you enjoyed this episode, please support our work by encouraging your friends and colleagues to subscribe to our newsletter: