Learn Apache Spark Step by Step (Follow the Sequence)
1. Getting started with Apache Spark
https://lnkd.in/gFRpe3-D
2. A quick introduction to the Spark API
https://lnkd.in/g8Y3tdhX
3. Overview of Spark – RDD, accumulators, broadcast variable
https://lnkd.in/g7fepuFF
4. Spark SQL, Datasets, and DataFrames:
https://lnkd.in/g3iZp7zk
5. PySpark – Processing data with Spark in Python
https://lnkd.in/gBnh6PAi
6. Processing data with SQL on the command line
https://lnkd.in/ggnxDaUu
7. Cluster Overview
https://lnkd.in/guCQnJnv
8. Packaging and deploying applications
https://lnkd.in/gUZpi2P9
9. Customize Spark via its configuration system
https://lnkd.in/gZh8Vkmv
10. Monitoring – Track the behavior of your applications
https://lnkd.in/grpGKFuP
11. Best practices to optimize performance and memory use
https://lnkd.in/gTRYBDQu
Credits – Spark Official Docs
#bigdata #dataengineering #apachespark #dataanalytics
Leave a comment