Pilot Flying J Interview Question

How would you handle distributed data processing in pySpark.