This 3-day course is equally applicable to data engineers, data scientist, analysts, architects, software engineers, and technical managers interested in a thorough, hands-on overview of Apache Spark.
The course covers the fundamentals of Apache Spark including Spark’s architecture and internals, the core APIs for using Spark, SQL and other high-level data access tools, as well as Spark’s streaming capabilities and machine learning APIs. The class is a mixture of lecture and hands-on labs.
Each topic includes lecture content along with hands-on labs in the Databricks notebook environment. Students may keep the notebooks and continue to use them with the free Databricks Community Edition offering after the class ends; all examples are guaranteed to run in that environment.
After taking this class, students will be able to:
Data engineers and Data Scientists interested in the most current technologies, analysts and BI professionals with basic coding skills and developers looking for a specialization in big data. Course material will be written in Python, but you don’t have to be an expert to be able to follow and understand it. The course is also great for IT Managers to get a better understanding of Apache Spark and the capabilities it can deliver.
Databricks’ vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache® Spark™, a powerful open source data processing engine built for sophisticated analytics, ease of use, and speed. Databricks is the largest contributor to the open source Apache Spark project providing 10x more code than any other company. The company has also trained over 20,000 users on Apache Spark, and has the largest number of customers deploying Spark to date. Databricks provides a just-in-time data platform, to simplify data integration, real-time experimentation, and robust deployment of production applications.
If you have questions about the course or would like to register, feel free to contact us here:
|9:00-10:30||Morning session 1|
|10:45-12:00||Morning session 2|
|13:00-14:15||Afternoon session 1|
|14:30-16:00||Afternoon session 2|
Zoltán Tóth is Principal Instructor at Databricks, the company founded by the original creators of Apache Spark. He delivered dozens of Spark courses for companies and also on the major conferences globally, like Strata and Spark Summit. He is also a contributor to Databricks’s Official Spark Courseware, with a special focus on Machine Learning topics. Prior to teaching Apache Spark, Zoltan worked with big data architectures. distributed systems as a Senior Engineer at RapidMiner and an Engineering Manager at Prezi.