Fetching data via JDBC using pyspark
https://spark.apache.org/docs/latest/sql-data-sources-jdbc.html
Data used - academia.stackexchange.com.7z from https://archive.org/details/stackexchange
Data from multiple tables are there in academia.stackexchange.com folder. I used PostHistory.xml.