The Spark of Big Data: An Introduction to Apache Spark
Pasha Finkelshteyn
Get ready to level up your big data processing skills! Join us for an introductory talk on Apache Spark, the distributed computing system used by tech giants like Netflix and Amazon. We'll cover PySpark DataFrames and how to use them. Whether you're a Python developer new to big data or looking to explore new technologies, this talk is for you. You'll gain foundational knowledge about Apache Spark and its capabilities, and learn how to leverage DataFrames and SQL APIs to efficiently process large amounts of data. Don't miss out on this opportunity to up your big data game!
Pasha Finkelshteyn
Affiliation: JetBrains
Pasha Finkelshteyn is a developer advocate for data engineering at JetBrains with more than a decade of experience in the industry. He has a passion for making big data processing accessible to all and has spent most of his career working with the JVM. However, Pasha switched to Data Engineering where he discovered the power of Python