Submitted by sneha on Sun, 06/04/2017 - 18:59
Apache Spark is an open-source cluster-computing framework. Apart from Spark core, we will also learn about its components such as Spark SQL, Spark Streaming, Mlib and GraphX. Though Spark can be used with Hadoop, it can also be used without. So for most notes, knowlwdge of Hadoop is not required.
Submitted by heartin on Wed, 02/01/2017 - 23:23
Get started learning about Hadoop and its ecosystem components through simple theory and Hands on exercises.
Submitted by heartin on Wed, 02/01/2017 - 21:27
Here I will include notes on Big Data and Data Science concepts in general. There will be separate books on specific technologies like Hadoop.