Performance benefits of predict_batch_udf over a Pandas UDF?

119 Views Asked by David At 23 October 2023 at 17:20

Are there any performance benefits to using predict_batch_udf over creating a pandas_udf?

It does expose a batch_size parameter but is this any different from adjusting spark.sql.execution.arrow.maxRecordsPerBatch to control the Pandas UDF batch sizes?

Original Q&A

Performance benefits of predict_batch_udf over a Pandas UDF?

There are 0 best solutions below

Related Questions in PYSPARK

Related Questions in APACHE-SPARK-MLLIB

Trending Questions

Popular # Hahtags

Popular Questions