Improving performance and scalability of data science libraries

The Data Exchange Podcast: Wes McKinney on the importance of having a shared infrastructure for data science.


SubscribeiTunesAndroidSpotifyStitcherGoogle, and RSS.

In this episode of the Data Exchange I speak with Wes McKinney, Director of Ursa Labs and an Apache Arrow PMC Member. Wes is the creator of pandas, one of the most widely used Python libraries for data science. He is also the author of the best-selling book, “Python for Data Analysis” – a book that has become essential reading for both aspiring and experienced data scientists.

Ray Summit has been postponed until the Fall. In the meantime, enjoy an amazing series of FREE monthly virtual conferences, the next one is scheduled for June 10th: The Road to AutoML – The Challenges of Hyperparameter Tuning. Go to anyscale.com/events to register.

Our conversation focused on data science tools and other topics including:

  • Two open source projects Wes has long been associated with: pandas and Apache Arrow.
  • The need for a shared infrastructure for data science.
  • Ursa Labs: its mission and structure.

Subscribe to our Newsletter:
We also publish a popular newsletter where we share highlights from recent episodes, trends in AI / machine learning / data, and a collection of recommendations.

Related content:


[Image: Irrigating the desert by Dean Wampler, used with permission.]