Simple Spark ml pipeline

Mediative recently hosted a Apache Spark Montreal Meetup’s project night where some of us decided to create a simple ML pipeline. To spare the installation of Spark, we used the Databricks community edition. Since the goal was to see if we could make it work, we wanted to use data that we knew was correlated. But to make the project a little more fun, we decided to explore something else than the usual data sets so we went for the Dow Jones and Nasdaq. »