We eat our own dog food and use our own service to monitor itself. We have reactions setup to restart nginx, rethinkdb, redis, our app and even our servers if it comes down to it. All automagically. Oh, we don't have an on call setup. Runbook takes care of that for us. Runbook