Monthly Archive: November 2018
JDBC in Spark SQL
Apache Spark has very powerful built-in API for gathering data from a relational database. Effectiveness and efficiency, following the usual Spark approach, is managed in a transparent way....
Machine Learning: Logistic Regression using Apache Spark
In this blog post, I’ll help you get started using Apache Spark’s spark.ml Logistic Regression for predicting whether or not someone makes more or less than $50,000. Classification Classification...