Tag: action

Spark DataFrame
By: Date: August 7, 2017 Categories: Apache Spark Tags: , , , , ,

Apache Spark DataFrame So, lets recall RDD(Resilient Distributed Datasets)? It is an immutable distributed collection of objects, it is an Interface. OK! we have also seen how to apply transformations in previous post. They are amazing! as they give us all the flexibility to deal with almost any kind of data; unstructured, semi structured and structured…

Read More →
Apache Spark
By: Date: July 26, 2017 Categories: Apache Spark Tags: , , , , , ,

If you haven’t read the previous article about MapReduce, I’d highly recommend reading it because that will set a good foundation to appreciate Sparks existence. Apache Spark – Introduction I want to get to the practical exercises quickly and I think there are enough resources on the internet to explain theoretical view of the framework….

Read More →