Valerii Veseliak – Introduction to scalable Machine learning pipelines with Apache Spark


Apache Spark is a famous framework for working with Big Data. In this presentation, we will cover some background about the main concepts of Apache Spark and Machine Learning. During the presentation, we will review common machine learning and statistical algorithms that are implemented in Spark. Also, we will talk about how to use Spark to build scalable machine learning pipelines with Spark MLLib.

Leave a Reply

Your email address will not be published. Required fields are marked *