Planning to use Dataproc cluster with Dataproc Metastore.
Questions:
- when we delete Dataproc clusters, would Dataproc Metastore continue to remain available?
- Can two different Dataproc clusters in same region, use same Dataproc Metastore?
- Can two different Dataproc clusters on different region, use same Dataproc Metastore?
Posting this as a community wiki:
As what Dagang said, the answer is to use an external metastore that persists outside of the cluster lifecycle. You can do this in two ways:
Using Dataproc Metastore Service
Using Cloud SQL proxy