Spark Roadmap

How to learn SPARK?

Spark is lightning-fast cluster computing designed for fast computational programming.

It is built on top of Hadoop MapReduce and it extends the computational power of MapReduce.

Today, many companies like Visa, Microsoft, Cisco, etc use spark.

Now, let’s get started with our roadmap.

Prerequisites: Before starting with spark, it is advised to have some basic knowledge of Scala programming, database concepts, and Linux

Step1:

Get familiar with Spark. Install it on your system.

Step2:

Learn RDD, interactive operations on MapReduce.

Step3:

Understand its core programming and deployment.

Step4:

Understand concepts like Broadcast variables and accumulators.

Step5:

Whoa! We are halfway done with python. Now is the time to go a little deep into some very important concepts. Start with learning Object-Oriented Programming (OOPs), RegEx(Regular Expressions), Network programming, CGI programming.

Step6:

We are done with Spark. Keep exploring more to get better at it..