Advice for managing unresponsive RhinoCompute instances

We have RhinoCompute running under IIS using the recommended install for an Azure VM.

Sometime (not that uncommon) a compute.geometry process will hang when trying to calculate something we’ve sent to it. This results in a timeout being returned to our middleware “management” layer. Here’s a screenshot of such as occasion (on this occasion we think it might be a gh file that is hanging it):

What has happened above is that our “management” layer has daftly reattempted the timed-out call and therefore progressively locked out all the other instances, IIS has then spawned further ones, and they then also get locked out by our poor “management” layer that is reattempting the failed calls! Oops!

Can anyone point me to good practice on how to monitor for unresponsive instances and kill them (c# .NET) ? Perhaps there is an endpoint dedicated for this?

I share the following DM which explains the above behaviour for the benefit of anyone this is relevant to:

1 Like