Feather — A Fast On-Disk Format for R and Python Data Frames
R and Python are two widely used tools or languages by the data analyst and Scientists. So, it will be great if there is any way to exchange data between these two. Here comes “Feather” — A fast, lightweight, language agnostic and easy-to-use binary file format for storing data frames. It is language agnostic!
“Feather” provides binary columnar serialization for data frames designed to make efficient reading and writing of data frames. It uses Apache Arrow columnar memory specification to represent binary data on disk.
Installation in Python:
conda install -c conda-forge feather-format OR pip install feather-format
Installation in R (Installation of feather goes well from R version ≥ 3.3.0):
install.packages(pkgs=’feather’,dependencies = T) OR devtools::install_github(“wesm/feather/R”)
Now, let's explore through small example codes…
In step1, we will create a pandas dataframe and write the content to disk in feather format.
In step2, we will read the same content in R
Feather: fast, introperable binary data frame storage for Python, R, and more powered by Apache Arrow - wesm/feather