Below is the content in my csv file :
A1,B1,C1 A2,B2,C2,D1 A3,B3,C3,D2,E1 A4,B4,C4,D3 A5,B5,C5,,E2 So, there are 5 columns but only 3 values in the first row.
I read it using the following command :
val csvDF : DataFrame = spark.read .option("header", "false") .option("delimiter", ",") .option("inferSchema", "false") .csv("file.csv") And following is what i get using csvDF.show()
+---+---+---+ |_c0|_c1|_c2| +---+---+---+ | A1| B1| C1| | A2| B2| C2| | A3| B3| C3| | A4| B4| C4| | A5| B5| C5| +---+---+---+ How can i read all the data in all the columns?