Fast Data Access in R and Python with Apache Arrow
This course is available only as a part of subscription plans.
Apache Arrow is a cross-language development platform for in-memory analytics. In this tutorial, I'll show how you can use Arrow in Python and R, both separately and together, to speed up data analysis on datasets that are bigger than memory. We'll cover the fundamentals of Arrow in Python in R, then explore in depth Arrow's Dataset feature, which provides for fast, efficient querying of large, multi-file datasets. Finally, we'll discuss Flight, an Arrow-native client-server framework for transporting data, and show how to set up a server and query against it.
Workshop Overview and Author Bio
Getting Started
Fast Data Access in R and Python with Apache Arrow
Tutorial Slides
Neal Richardson,PhD
Neal Richardson