RDD recap Spark SQL library Architecture of Spark SQL Comparison with Pig and Hive Pipeline DataFrames Definition of a DataFrames API DataFrames Operations DataFrames features Data cleansing Diagram for logical plan container Plan Optimization & Execution Catalyst Analyzer Catalyst Optimizer Generating Physical Plan Code Generation Extensions