Skip to content

Commit 6975cf2

Browse files
SireInsectusSireInsectus
authored andcommitted
Publishing 1.7
1 parent ba4419b commit 6975cf2

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

50 files changed

+11645
-0
lines changed
Lines changed: 40 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,40 @@
1+
# Databricks notebook source
2+
# MAGIC %md-sandbox
3+
# MAGIC
4+
# MAGIC <div style="text-align: center; line-height: 0; padding-top: 9px;">
5+
# MAGIC <img src="https://databricks.com/wp-content/uploads/2018/03/db-academy-rgb-1200px.png" alt="Databricks Learning" style="width: 600px">
6+
# MAGIC </div>
7+
8+
# COMMAND ----------
9+
10+
# MAGIC %md
11+
# MAGIC # Apache Spark Programming with Databricks
12+
# MAGIC ##### Course Duration: 4 Half-Days
13+
# MAGIC
14+
# MAGIC This course uses a case study driven approach to explore the fundamentals of Spark Programming with Databricks, including Spark architecture, the DataFrame API, query optimization, and Structured Streaming. You will start by defining Databricks and Spark, recognize their major components, and explore datasets for the case study using the Databricks environment. After ingesting data from various file formats, you will process and analyze datasets by applying a variety of DataFrame transformations, Column expressions, and built-in functions. Next, you will visualize and apply Spark architecture concepts in example scenarios. This will prepare you to explore the Spark UI and how caching, query optimization, and partitioning affect performance. Lastly, you will execute streaming queries to process and aggregate streaming data and learn about Delta Lake.
15+
# MAGIC
16+
# MAGIC ## Target Audience
17+
# MAGIC This introductory-level course is suitable for anyone who wants to learn the fundamentals of programming in Spark, including SQL analysts, data engineers, data scientists, machine learning engineers, and data architects.
18+
# MAGIC
19+
# MAGIC ## Requirements
20+
# MAGIC - Familiarity with basic SQL concepts: select, filter, groupby, join, etc
21+
# MAGIC - Beginner programming experience with Python: syntax, conditions, loops, functions
22+
# MAGIC
23+
# MAGIC ## Course Objectives
24+
# MAGIC Upon completion of this course, students should be able to meet the following objectives:
25+
# MAGIC - Identify core features of Spark and Databricks
26+
# MAGIC - Describe how DataFrames are created and evaluated in Spark
27+
# MAGIC - Apply the DataFrame API to process and analyze data
28+
# MAGIC - Demonstrate how Spark is optimized and executed on a cluster
29+
# MAGIC - Apply Delta and Structured Streaming to process streaming data
30+
# MAGIC
31+
# MAGIC ##### Related Certifications
32+
# MAGIC - [Databricks Certified Associate Developer for Apache Spark 3.0](https://academy.databricks.com/exam/databricks-certified-associate-developer)
33+
34+
# COMMAND ----------
35+
36+
# MAGIC %md-sandbox
37+
# MAGIC &copy; 2022 Databricks, Inc. All rights reserved.<br/>
38+
# MAGIC Apache, Apache Spark, Spark and the Spark logo are trademarks of the <a href="https://www.apache.org/">Apache Software Foundation</a>.<br/>
39+
# MAGIC <br/>
40+
# MAGIC <a href="https://databricks.com/privacy-policy">Privacy Policy</a> | <a href="https://databricks.com/terms-of-use">Terms of Use</a> | <a href="https://help.databricks.com/">Support</a>

0 commit comments

Comments
 (0)