-
Notifications
You must be signed in to change notification settings - Fork 3
Description
This was very unexpected and I did not see this anywhere when implementing API Gateway #36...
It seems like there's a default timeout for 60 seconds and a maximum of 60 seconds for API Gateway. This is a HUGE problem because some services take longer like inference
because it runs GPU computation. while the Cloud Run can be run for longer up to 3600s timeout, the gateway does not wait and cuts the request.
THIS IS VERIFIED TO BE A BLOCKING ISSUE on various endpoints, mostly inference and preprocessing. Both services produced valid response (seen in logs), but it breaks on frontend / api testing because the gateway already cut the request...
I do not know how to fix this to be honest... 😢 😢 😢 might need to remove the entire gateway if this is the case, i don't have a choice here if this is the case, I will just build another gateway with cloud endpoints on our own managed Cloud Run router to do auth and user IAM to do trigger similarly