Skip to content


DataFrame is a Python class with PandasMapOpsMixin and PandasConversionMixin mixins.

DataFrame lives in pyspark.sql.dataframe module (together with DataFrameNaFunctions and DataFrameStatFunctions).

from pyspark.sql.dataframe import DataFrame

Creating Instance

DataFrame takes the following to be created:


groupBy(self, *cols)

groupBy requests the _jdf to groupBy and creates a GroupedData with it.

Last update: 2021-03-03