Amazon EMR - Is Spark dataframe.cache() stored in core nodes only, or also on task nodes?

228 Views Asked by itaifrenkel At 15 June 2018 at 07:22

I'm running on AWS EMR Spark (v2) and calculated a large dataframe. When the dataframe.cache() (which is lazy) kicks-in would it be performed on the task nodes, or would the data frame be moved around to the core nodes for caching?

Original Q&A

Amazon EMR - Is Spark dataframe.cache() stored in core nodes only, or also on task nodes?

There are 0 best solutions below

Related Questions in APACHE-SPARK

Related Questions in APACHE-SPARK-SQL

Related Questions in EMR

Trending Questions

Popular # Hahtags

Popular Questions