pysparkEasy Playground

Python API for Apache Spark distributed data processing

Getting started with pysparkRun locally
Install
pip install pyspark
Python CodeRun locally
Expected Output
# Expected output shown below
# (Run locally with: pyspark)

pyspark is a third-party package. Python API for Apache Spark distributed data processing. Install with: pip install pyspark

Challenge

Try modifying the code above to explore different behaviors. Can you extend the example to handle a new use case?