Home

/

Courses

/Databricks and PySpark
Course | Databricks and PySpark

Databricks and PySpark

20 modules

English, Hindi

Lifetime access

Overview

Learn how to use Databricks and PySpark to process big data and uncover insights. This course covers the basics of distributed computing, cluster management, and data processing using PySpark. Learn how to use Databricks to build a data pipeline and use PySpark to transform and analyze data.

Key Highlights:

  • Gain a working knowledge of PySpark and Databricks
  • Create effective data pipelines
  • Use PySpark to transform and analyze large datasets
  • Learn best practices for working with big data

What you will learn:

  • Introduction to Databricks and PySpark
    Understand the basics of distributed computing and cluster management with Databricks and PySpark.
  • Data Processing with PySpark
    Learn how to use PySpark to process and transform data, including using SQL, dataframes and Spark MLlib.
  • Building Data Pipelines with Databricks
    Learn how to build a data pipeline using Databricks, including using Delta Lake and AWS S3 for storage.
  • Advanced Data Analytics with PySpark
    Build more advanced analytics models using PySpark, including time series analysis, recommender systems, and graph processing.

Modules

Introduction to Databricks | How to setup Account |

How to read CSV file in PySpark | Databricks Tutorial |

How to Rename columns in DataFrame using PySpark | Databricks Tutorial |

How to ADD New Columns in DataFrame using PySpark | Databricks Tutorial |

How to filter a DataFrame using PySpark | Databricks Tutorial |

How to Sort a Dataframe in PySpark | Databricks Tutorial |

How to remove Duplicates in DataFrame using PySpark | Databricks Tutorial |

How to use GroupBY in DataFrame using PySpark | Databricks Tutorial|

How to write into CSV | Databricks Tutorial |

How to merge two DataFrame using PySpark | Databricks Tutorial |

How to use WHEN Otherwise in PySpark | Databricks Tutorial |

How to join two DataFrames in PySpark | Databricks Tutorial |

How to use Window Fuctions in PySpark | Databricks Tutorial |

Why to use Repartition Method in PySpark | Databricks Tutotrial |

How to write Dataframe with Partitions using PartitionBy in PySpark | Databricks Tutorial|

How to create UDF in PySpark | Databricks Tutorial |

How to do casting of Columns in PySpark | Databricks Tutorial |

How to handle NULLs in PySpark | Databricks Tutorial |

How to pivot a DataFrame in PySpark | Databricks Tutorial |

Different types of mode while reading a file in Dataframe using PySpark | Databricks Tutorial |

Rate this course

Free

×

Order ID:

This course is in your library

What are you waiting for? It’s time to start learning!

Illustration | Payment success
×

Wait up!

We see you’re already enrolled in this course till Lifetime. Do you still wish to enroll again?

Illustration | Already enrolled in course