Apache Spark with Python – Big Data with PySpark and Spark
Ditulis pada: February 05, 2018
Apache Spark with Python – Big Data with PySpark and Spark
100% OFF UDEMY COUPON IT & SOFTWARE ONLINE CLASSESCoupon Online Course Details
Apache Spark with Python – Big Data with PySpark and Spark, Learn Apache Spark and Python by 12+ hands-on examples of analyzing big data with PySpark and Spark
Created by Level Up Expert Program, Pedro Magalhães Bernardo, Tao W., James Lee
SUCCESS 100% Udemy Promo Preview This Course - GET COUPON CODE
What Will I Learn?
- An overview of the architecture of Apache Spark.
- Develop Apache Spark 2.0 applications using RDD transformations and actions and Spark SQL.
- Work with Apache Spark's primary abstraction, resilient distributed datasets (RDDs) to process and analyze large data sets.
- Analyze structured and semi-structured data using DataFrames, and develop a thorough understanding about Spark SQL.
- Advanced techniques to optimize and tune Apache Spark jobs by partitioning, caching and persisting RDDs.
- Scale up Spark applications on a Hadoop YARN cluster through Amazon's Elastic MapReduce service.
- Share information across different nodes on a Apache Spark cluster by broadcast variables and accumulators.
- Write Spark applications using the Python API - PySpark