I have a dataset of 3 items. I call a function on each item using map() but the function is never called.
object MyProgram { val events = Seq("A","B","C") def main(args: Array[String]): Unit = { val spark = SparkSession .builder .appName("MyApp") .config("spark.master", "local") .getOrCreate() import spark.implicits._ val eventsDS = events.toDS() System.out.println("Before") val tempDS = eventsDS.rdd.map(x => doSomething(x)) System.out.println("After") } def doSomething(event: String) : Unit = { System.out.println("Do Something!") } } Output:
Before
After
eventsDS.rdd.collect.map(x => doSomething(x))