DataFrame¶
DataFrame
is a Python class with PandasMapOpsMixin and PandasConversionMixin mixins.
DataFrame
lives in pyspark.sql.dataframe
module (together with DataFrameNaFunctions
and DataFrameStatFunctions
).
from pyspark.sql.dataframe import DataFrame
Creating Instance¶
DataFrame
takes the following to be created:
- jdf
- SQLContext
groupBy¶
groupBy(self, *cols)
groupBy
requests the _jdf to groupBy
and creates a GroupedData with it.
Last update: 2021-03-03