Ooziee -Aru
-----------
Pre-requisites as follows:
create .sql scripts for this demo:
cat > ddl.sql
create database sakthi;
use sakthi;
create table sakthi2018 (id int, name string);
^C
hdfs dfs -mkdir /user/cloudera/ooziedemo
hdfs dfs -put ddl.sql /user/cloudera/ooziedemo/ddl.sql
cat > dml.sql
insert into sakthi2018(id,name) values(101,'sara');
insert into sakthi2018(id,name) values(102,'mara');
^c
hdfs dfs -put dml.sql /user/cloudera/ooziedemo/dml.sql
copy hive-site.xml to local Desktop:
sudo cp /etc/hive/conf/hive-site.xml /home/cloudera/DeskTop/hive-site.xml
hdfs dfs -put hive-site.xml /user/cloudera/ooziedemo/hive-site.xml
Start Hue in Firefox:
click on Hue link
cloudera : cloudera // username and password
Click on Query -> Scheduler -> WorkFlow
Click on Actions which is hidden behind ^Documents (scroll up to see this within right side pan if it is not visible)
Drag and Drop Hive Action there
select ddl.sql (script)
and hive-site.xml (hive xml)
select dml.sql (script)
and hive-site.xml (hive xml)
click on Jobs - Workflow - Graph
Drag and drop Action
Run,Suspend,Re-run etc
Showing posts with label oozie. Show all posts
Showing posts with label oozie. Show all posts
Monday, 10 December 2018
Subscribe to:
Comments (Atom)
Flume - Simple Demo
// create a folder in hdfs : $ hdfs dfs -mkdir /user/flumeExa // Create a shell script which generates : Hadoop in real world <n>...
-
How to fetch Spark Application Id programmaticall while running the Spark Job? scala> spark.sparkContext.applicationId res124: String = l...
-
// Lead Example // Lead means Next row's salary value spark.sql("SELECT id, fname,lname, designation, technology,salary, LEAD(sal...
-
from pyspark.sql import SparkSession spark = SparkSession.builder.appName("LondonCrimes").getOrCreate() data = spark.read.format(...