I have recently been introduced to SparkSQL. We use Spark 2.4. I recently found out that SparkSQL query supports the following hints for its Join strategies:
- BROADCAST hint
- MERGE hint
- SHUFFLE_HASH hint
Unfortunately, I have not found any online materials which elaborately discuss these hints and their application scenarios. I wish to learn some tips regarding when to use these hints in a query Join for improving query performance.
Can anyone explain with some examples. Any help is appreciated. Thanks
there are some post ,Hope it help you: Spark SQL Joins Sort-Merge Join