Wykonując testy, warto mieć pod ręką coś małego, co powinno zawsze zadziałać. Oto moja propozycja:
import findspark
findspark.init()
findspark.find()
from pyspark.sql import SparkSession
# Create a SparkSession object
spark = SparkSession.builder.appName("CreateDataFrame").getOrCreate()
# Use the SparkSession object to create a DataFrame
df_day_of_week = spark.createDataFrame([(0, "Sunday"), (1, "Monday"),
(2, "Tuesday"), (3, "Wednesday"),
(4, "Thursday"), (5, "Friday"),
(6, "Saturday")],
["day_of_week_num", "day_of_week"])
# Show the DataFrame
df_day_of_week.show()
Snippet pochodzi z https://stackoverflow.com/questions/76743484/configuration-of-pyspark-py4jjavaerror