Cover Image
Gain the knowledge of Spark veteran Alex Pierce on how to manage the challenges of maintaining the performance and usability of your Spark jobs. Apache Spark provides sophisticated ways for enterprises to leverage big data compared to Hadoop. However, the increasing amounts of data being analyzed and processed through the framework is massive and continues to push the boundaries of the engine. This webinar draws on experiences across dozens of production deployments and explores the best practices for managing Apache Spark performance. Learn how to avoid common mistakes, improve the usability, supportability and performance of Spark. Topics include: – Serialization – Partition sizes – Executor resource sizing – DAG management
Vendor:
Posted:
Dec 22, 2020
Published:
Dec 22, 2020
Format:
Type:
Replay

This resource is no longer available.