PySpark on YARN in self-contained environments Author: https://github.com/seanorama Note: This was tested on HDP 3.1. It may not work with other Spark/YARN distributions. Reference: https://community.cloudera.com/t5/Community-Articles/Using-VirtualEnv-with-PySpark/ta-p/245905 https://community.cloudera.com/t5/Community-Articles/Running-PySpark-with-Conda-Env/ta-p/247551