Friday, 18 January 2019

Data Storage Objects in Spark

Data Storage Objects in Spark:
Spark Core : RDD (Unstructured files)
SparkSQL : DataFrame, DataSet (Semi, Structured files)
SparkStreaming : DStream (Streaming Applications)
Spark MLLib : Vectors
Spark GraphX : Graph Objects

No comments:

Post a Comment

Flume - Simple Demo

// create a folder in hdfs : $ hdfs dfs -mkdir /user/flumeExa // Create a shell script which generates : Hadoop in real world <n>...