Advanced Data Analytics with PySpark
This class introduces participants to the Apache Spark platform, the Spark Shell and Spark SQL for big data processing applications. In addition to the Spark platform, participants will learn fundamental tools in the pandas library and gain experience with data visualization using seaborn.
Audience
This course is suitable for: Business Analysts who want a scalable platform for solving SQL-centric problems.
Prerequisites
Students should have knowledge of SQL, familiarity with Python (or the ability to learn the basics of a new language)
- Introduction to Apache Spark
- The Spark Shell
- Introduction to Spark SQL
- Introduction to pandas
- Data Visualization with seaborn
Is there a discount available for current students?
UMBC students and alumni, as well as students who have previously taken a public training course with UMBC Training Centers are eligible for a 10% discount, capped at $250. Please provide a copy of your UMBC student ID or an unofficial transcript or the name of the UMBC Training Centers course you have completed. Asynchronous courses are excluded from this offer.
What is the cancellation and refund policy?
Student will receive a refund of paid registration fees only if UMBC Training Centers receives a notice of cancellation at least 10 business days prior to the class start date for classes or the exam date for exams.