PySpark MLlib¶ PySpark MLlib is a Python module to work with Spark MLlib for DataFrame-based machine learning pipelines. from pyspark.ml import *