Site icon The Data Exchange

Data quality is key to great AI products and services

The Data Exchange Podcast: Abe Gong on building tools that help teams improve and maintain data quality.


SubscribeApple • Android • Spotify • Stitcher • Google • RSS

In this episode of the Data Exchange, I speak with Abe Gong, CEO and co-founder at Superconductive,  a startup founded by the team behind the Great Expectations (GE) open source project. GE is one of a growing number of tools aimed at improving data quality through tools for validation and testing. Other projects in this area include TensorFlow DV, assertr, dataframe-rules-engine, deequ, data-describe, and Apache Griffin.

Download the 2021 Trends in Data and AI Report, and learn emerging trends in Data, Machine Learning, and AI.

In this episode Abe provided an overview of the GE open source project and its growing community of users and contributors. We also discussed a variety of topics in data engineering and DataOps including data quality, pipelines, and data integration. Abe described what led them to start the GE project:

A few years ago, Ihab Ilyas and I wrote a post describing the emergence of machine learning tools for data quality. We highlighted the academic project HoloClean, a probabilistic cleaning framework for ML-based automatic error detection and repair. In addition to machine learning, today we’re seeing the rise of exciting new tools that use knowledge graphs and category theory.

Subscribe to our Newsletter:
We also publish a popular newsletter where we share highlights from recent episodes, trends in AI / machine learning / data, and a collection of recommendations.

Related content and resources:

FREE Report


[Image by Sipotek-Visual-Inspection-Machine from Wikimedia.]

Exit mobile version