Arrow Optimization¶
Arrow Optimization is an optimization that uses Apache Arrow for columnar data transfers in the following:
- pyspark.sql.DataFrame.toPandas
- pyspark.sql.SparkSession.createDataFrame (when called with a Pandas
DataFrame
or a NumPyndarray
)
The following data types are unsupported: ArrayType
of TimestampType
.