Friday, 11 January 2019
Lines and Word Count of a Given File using Scala
package com.spark.scala.learning
import scala.io.Source
object LinesWordsCount {
def main(args: Array[String]): Unit = {
var file = Source.fromFile("D:\\iEd\\sample.txt")
var countL = 0
for (i <- 0 to file.getLines().length - 1) {
countL = countL + 1
}
file.close()
file = Source.fromFile("D:\\iEd\\sample.txt")
var countW = 0
for (j <- file.getLines()) {
countW = countW + j.split("").size
}
println("The Number of Lines : " + countL)
println("The Number of words : " + countW)
}
}
The Number of Lines : 4
The Number of words : 91
Subscribe to:
Post Comments (Atom)
Flume - Simple Demo
// create a folder in hdfs : $ hdfs dfs -mkdir /user/flumeExa // Create a shell script which generates : Hadoop in real world <n>...
-
How to fetch Spark Application Id programmaticall while running the Spark Job? scala> spark.sparkContext.applicationId res124: String = l...
-
// Lead Example // Lead means Next row's salary value spark.sql("SELECT id, fname,lname, designation, technology,salary, LEAD(sal...
-
from pyspark.sql import SparkSession spark = SparkSession.builder.appName("LondonCrimes").getOrCreate() data = spark.read.format(...
No comments:
Post a Comment