2gen Google Cloud Functions CPU allocation

Question

2gen Google Cloud Functions CPU allocation

28 Views Asked by Matteo Rossi At 23 March 2024 at 09:52

I've migrated an event triggered Google Cloud Function to gen 2 and I'm facing a lot of problems. The function takes care to print a PDF using Puppeteer. I've solved all issues related to Puppeteer install and managed to add await before all async functions in order to wait for termination of each secondary task before moving to the next one.

Even so I reach a point where the call to a function to render an Handlebars template takes more than 1 minute vs 0.5s in 1gen version. It's a sync operation. Subsequent call to Puppeteer for PDF printing miserably fails.

Now I've found a solution by turning on "CPU always allocated" flag in Cloud Run but I'm afraid that it might be too expensive and can't see why it can't work as 1gen functions. I've configured each instance to receive no more than 1 request. What should I expect when more requests are received? Would each one cold-start a new instance? How long would each instance last? Are instances automatically terminated if no traffic is received? If so what does "CPU always allocated" mean?

Original Q&A

There are 1 best solutions below

**Doug Stevenson** · Answer 1 · 2024-03-23T13:59:22.543000

What should I expect when more requests are received? Would each one cold-start a new instance?

Yes, that's how managed serverless backends work. See the documentation for details.

How long would each instance last?

As long as the cloud provider wants them to last. You don't get to configure that. You are allowing the cloud provider to make a good decision.

Are instances automatically terminated if no traffic is received?

Yes, that's how managed serverless backends work. The documentation says "An instance will never stay idle for more than 15 minutes after processing a request unless it is kept active using minimum instances."

If so what does "CPU always allocated" mean?

Start with the documentation:

By default, Cloud Run instances are only allocated CPU during request processing, container startup and shutdown. (Refer to instance lifecycle). You can change this behavior so CPU is always allocated and available even when there are no incoming requests. Setting the CPU to be always allocated can be useful for running short-lived background tasks and other asynchronous processing tasks.

The documentation is suggesting that the CPU for a server instance can be shut down when there are no requests currently being processed on that instance, which is intended to save time and resources.

2gen Google Cloud Functions CPU allocation

There are 1 best solutions below

Related Questions in GOOGLE-CLOUD-FUNCTIONS

Related Questions in GOOGLE-CLOUD-RUN

Trending Questions

Popular # Hahtags

Popular Questions