I’ve gotten this question a few times lately so let’s talk it out.
You’ve got your Glassware and it’s listening for pings because you’ve got a subscription you want to work with. You get one of said pings and you need to do some work. But maybe your work is taking a long time because your Glassware is super popular or you have some very long running calculation. And this is making the Mirror API angry and you’re getting multiple requests from the same ping.
What’s going on?
Mirror has a specific requirement that you must respond to the ping with a 200 status as quickly as possible, such that if you wait too long (ala 10 seconds), it presumes the request failed and it will retry.
10 seconds sounds like a long time, but maybe not if you’re app is getting hammered.
What to do? You need to offload that task and respond with a 200 status.
The App Engine way: Task Queue
Presuming you’re on Google App Engine, you have something readily at your fingertips to help you: the task queue. In this case we’re using a push queue.
The task queue can take a task and run for as long as 10 minutes. You still need to send a 200-ish response (200-299 is considered “all good” by task queue standards).
You can make task queues as complicated as you like, but let’s keep it simple and use the default task queue and hand off a ping from Mirror to said task queue. The example python below takes the whole ping request, sets it as the payload and sends it along to be worked.
This example isn’t terribly taxing; it just pulls an attachment and sends it to Google Cloud Storage. The point being is that we won’t run into that 10 second time limit from Mirror and we can do lots of cool scaling things
Other ways for such things
App Engine isn’t the only game in town when it comes to task queues.
Want the same thing on Amazon? You’d use Amazon Simple Workflow (Amazon SWF). You could reasonably do the same thing with SQS, but SWF is a closer fit. I’ll have to write an example up for it, and it requires a little more than just defining a yaml file and a target path.
You could also use the trusted Beanstalkd. I’ve used Beanstalk for a lot of projects and it works flippin' fantastic. Again however, you have to roll your own implementation.
In either case, be wary of the long running process and understand how to offload work as needed, no matter your choice of queue engine.