Data Storage Objects in Spark:
Spark Core : RDD (Unstructured files)
SparkSQL : DataFrame, DataSet (Semi, Structured files)
SparkStreaming : DStream (Streaming Applications)
Spark MLLib : Vectors
Spark GraphX : Graph Objects
Subscribe to:
Post Comments (Atom)
Flume - Simple Demo
// create a folder in hdfs : $ hdfs dfs -mkdir /user/flumeExa // Create a shell script which generates : Hadoop in real world <n>...
-
Import data from MySQL to HDFS using SQOOP with conditional data importing //Conditional import using Where sqoop import \ -connect jdbc:m...
-
Input file: emp.csv ---------------- empno,ename,designation,manager,hire_date,sal,deptno 7788,SCOTT,ANALYST,7566,12/9/1982,3000,20 73...
-
Banking.scala: ------------- package com.spark.scala.learning object Banking { def main(args:Array[String]):Unit = { val ob:BankingT...
No comments:
Post a Comment