PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training PySpark Dataframe Tutorial
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training Today’s Training Topics ❖ Need for Dataframes ❖ What are Dataframes ❖ Dataframes Features ❖ Sources of Dataframes ❖ Demo
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training Why do we need Dataframes? Processing Structured And Semi-Structured Data
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training Why do we need Dataframes? Processing Structured And Semi-Structured Data Handling Petabytes of Data
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training Why do we need Dataframes? Processing Structured And Semi-Structured Data Handling Petabytes of Data Wide Range of Data Formats and Sources
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training Why do we need Dataframes? Processing Structured And Semi-Structured Data Handling Petabytes of Data Wide Range of Data Formats and Sources Support for Multiple Languages
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training What are Dataframes? 2d labelled Data Structure Similar to SQL
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training Features of Dataframes Distributed Lazy EVALs Immutable
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training Creating a Dataframe(Sources) Dataframe
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training Important Classes • pyspark.sql.SQLContext • pyspark.sql.DataFrame • pyspark.sql.Column • pyspark.sql.Row • pyspark.sql.GroupedData • pyspark.sql.DataFrameNaFunctions • pyspark.sql.DataFrameStatFunctions • pyspark.sql.functions • pyspark.sql.types • pyspark.sql.Window
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training CREATING DATAFRAMES DEMO
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training FIFA WORLD CUP – USE CASEFIFA world Cup Use Case
PYSPARK CERTIFICATION TRAINING www.edureka.co/pyspark-certification-training SUPERHEROS Use Case
PySpark Dataframes Tutorial | Introduction to PySpark Dataframes API | PySpark Training | Edureka

PySpark Dataframes Tutorial | Introduction to PySpark Dataframes API | PySpark Training | Edureka

  • 1.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training PySpark Dataframe Tutorial
  • 2.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training Today’s Training Topics ❖ Need for Dataframes ❖ What are Dataframes ❖ Dataframes Features ❖ Sources of Dataframes ❖ Demo
  • 3.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training Why do we need Dataframes? Processing Structured And Semi-Structured Data
  • 4.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training Why do we need Dataframes? Processing Structured And Semi-Structured Data Handling Petabytes of Data
  • 5.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training Why do we need Dataframes? Processing Structured And Semi-Structured Data Handling Petabytes of Data Wide Range of Data Formats and Sources
  • 6.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training Why do we need Dataframes? Processing Structured And Semi-Structured Data Handling Petabytes of Data Wide Range of Data Formats and Sources Support for Multiple Languages
  • 7.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training What are Dataframes? 2d labelled Data Structure Similar to SQL
  • 8.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training Features of Dataframes Distributed Lazy EVALs Immutable
  • 9.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training Creating a Dataframe(Sources) Dataframe
  • 10.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training Important Classes • pyspark.sql.SQLContext • pyspark.sql.DataFrame • pyspark.sql.Column • pyspark.sql.Row • pyspark.sql.GroupedData • pyspark.sql.DataFrameNaFunctions • pyspark.sql.DataFrameStatFunctions • pyspark.sql.functions • pyspark.sql.types • pyspark.sql.Window
  • 11.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training CREATING DATAFRAMES DEMO
  • 12.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training FIFA WORLD CUP – USE CASEFIFA world Cup Use Case
  • 13.
    PYSPARK CERTIFICATION TRAININGwww.edureka.co/pyspark-certification-training SUPERHEROS Use Case