Deploy spacy serverless #11494

jbingel · 2022-09-13T13:31:08Z

jbingel
Sep 13, 2022

Hi all,

I'm wondering what my best bet is for deploying spacy in the cloud for real-time inference, with a serverless solution?

Basically, I want an endpoint similar to what AWS can offer with an EC2 or SageMaker instance, but at the smallest possible cost and with autoscaling (to zero in times of no traffic).

I can tolerate a few seconds delay for cold starts (when loading the model), but after that inference should be at basically no latency (hence AWS Lambda is not an option). I have looked into SageMaker Serverless Inference, but find it quite poorly documented, not sure if applicable for my use case?

Looking forward to your input!

Answered by polm

Sep 15, 2022

Serverless use of spaCy is pretty common, but we don't actually have any particular recommendations for cloud providers. spaCy's needs shouldn't be too idiosyncratic, so it sounds like for your goals you might be better off asking at a Stack Exchange site (not sure exactly which one) or Reddit or something. Sorry we can't provide more help!

View full answer

polm · 2022-09-15T07:04:55Z

polm
Sep 15, 2022

Serverless use of spaCy is pretty common, but we don't actually have any particular recommendations for cloud providers. spaCy's needs shouldn't be too idiosyncratic, so it sounds like for your goals you might be better off asking at a Stack Exchange site (not sure exactly which one) or Reddit or something. Sorry we can't provide more help!

1 reply

jbingel Sep 15, 2022
Author

Gotcha, thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Deploy spacy serverless #11494

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Deploy spacy serverless #11494

Uh oh!

jbingel Sep 13, 2022

Replies: 1 comment · 1 reply

Uh oh!

polm Sep 15, 2022

Uh oh!

jbingel Sep 15, 2022 Author

jbingel
Sep 13, 2022

Replies: 1 comment 1 reply

polm
Sep 15, 2022

jbingel Sep 15, 2022
Author