Managing Spark Partitions with Coalesce and Repartition
Tag: Spark
Pivot and Unpivot a Spark DataFrame
Pivot: Turn rows into columns. Unpivot: Turn columns into rows.
Spark SQL functions lit() and typedLit()
SPARK – ADDING LITERAL OR CONSTANT TO DATAFRAME USING LIT & TYPEDLIT FUNCTIONS
Spark Performance Tuning
Simple Tips and Tricks to Improve the Performance of your Spark Applications
Big Data File Formats
AVRO vs PARQUET vs ORC
A comparison between RDD, DataFrame and Dataset in Spark
RDD VS DataFrame VS Dataset
groupByKey vs reduceByKey vs aggregateByKey in Apache Spark/Scala
groupByKey() vs reduceByKey() vs aggregateByKey()
Null, null, Nil, Nothing, None, and Unit in Scala
Scala Terminologies: Null, null, Nil, Nothing, None, Unit.
Spark Word Count Example Using Hadoop as File Store
Spark Word Count Program
