|
| 1 | +# Databricks notebook source |
| 2 | +# MAGIC %md-sandbox |
| 3 | +# MAGIC |
| 4 | +# MAGIC <div style="text-align: center; line-height: 0; padding-top: 9px;"> |
| 5 | +# MAGIC <img src="https://databricks.com/wp-content/uploads/2018/03/db-academy-rgb-1200px.png" alt="Databricks Learning" style="width: 600px"> |
| 6 | +# MAGIC </div> |
| 7 | + |
| 8 | +# COMMAND ---------- |
| 9 | + |
| 10 | +# MAGIC %md |
| 11 | +# MAGIC # Apache Spark Programming with Databricks |
| 12 | +# MAGIC ##### Course Duration: 4 Half-Days |
| 13 | +# MAGIC |
| 14 | +# MAGIC This course uses a case study driven approach to explore the fundamentals of Spark Programming with Databricks, including Spark architecture, the DataFrame API, query optimization, and Structured Streaming. You will start by defining Databricks and Spark, recognize their major components, and explore datasets for the case study using the Databricks environment. After ingesting data from various file formats, you will process and analyze datasets by applying a variety of DataFrame transformations, Column expressions, and built-in functions. Next, you will visualize and apply Spark architecture concepts in example scenarios. This will prepare you to explore the Spark UI and how caching, query optimization, and partitioning affect performance. Lastly, you will execute streaming queries to process and aggregate streaming data and learn about Delta Lake. |
| 15 | +# MAGIC |
| 16 | +# MAGIC ## Target Audience |
| 17 | +# MAGIC This introductory-level course is suitable for anyone who wants to learn the fundamentals of programming in Spark, including SQL analysts, data engineers, data scientists, machine learning engineers, and data architects. |
| 18 | +# MAGIC |
| 19 | +# MAGIC ## Requirements |
| 20 | +# MAGIC - Familiarity with basic SQL concepts: select, filter, groupby, join, etc |
| 21 | +# MAGIC - Beginner programming experience with Python: syntax, conditions, loops, functions |
| 22 | +# MAGIC |
| 23 | +# MAGIC ## Course Objectives |
| 24 | +# MAGIC Upon completion of this course, students should be able to meet the following objectives: |
| 25 | +# MAGIC - Identify core features of Spark and Databricks |
| 26 | +# MAGIC - Describe how DataFrames are created and evaluated in Spark |
| 27 | +# MAGIC - Apply the DataFrame API to process and analyze data |
| 28 | +# MAGIC - Demonstrate how Spark is optimized and executed on a cluster |
| 29 | +# MAGIC - Apply Delta and Structured Streaming to process streaming data |
| 30 | +# MAGIC |
| 31 | +# MAGIC ##### Related Certifications |
| 32 | +# MAGIC - [Databricks Certified Associate Developer for Apache Spark 3.0](https://academy.databricks.com/exam/databricks-certified-associate-developer) |
| 33 | + |
| 34 | +# COMMAND ---------- |
| 35 | + |
| 36 | +# MAGIC %md-sandbox |
| 37 | +# MAGIC © 2022 Databricks, Inc. All rights reserved.<br/> |
| 38 | +# MAGIC Apache, Apache Spark, Spark and the Spark logo are trademarks of the <a href="https://www.apache.org/">Apache Software Foundation</a>.<br/> |
| 39 | +# MAGIC <br/> |
| 40 | +# MAGIC <a href="https://databricks.com/privacy-policy">Privacy Policy</a> | <a href="https://databricks.com/terms-of-use">Terms of Use</a> | <a href="https://help.databricks.com/">Support</a> |
0 commit comments