Benchmarking Apache Spark on a Single Node Machine


Benchmarking Apache Spark on a Single Node Machine

https://databricks.com/blog/2018/05/03/benchmarking-apache-spark-on-a-single-node-machine.html Benchmarking Apache Spark on a Single Node Machine - The Databricks Blog In this blog, we will demonstrate the merits of single node computation using PySpark and share our observations. Through experimentation, we’ll show why you may want to use PySpark instead of Pandas for large datasets that exceed single-node machine’s memory. databricks.com...



원문링크 : Benchmarking Apache Spark on a Single Node Machine