introduce max_req_timeout_count #11

BinChang · 2016-03-01T01:24:32Z

Count # of consecutive req_timeout, and if it is too high, mark it as not healthy. If the endpoint is still legit, pinger will bring it back, otherwise if the endpoint is totally down, the endpoint will stay as unhealthy.

This helps us find dead endpoints whose server box is totally down.

… not healthy. If the endpoint is still ligit, pinger will bring it back, otherwise say the endpoint is totally down, the endpoint will stay as not healthy. This helps us find dead endpoints that the server box is totally down.

BinChang · 2016-03-01T01:26:38Z

@mranney @Raynos @weikai77, This is the draft diff, let me know if you guys have any concerns?

In parallel, I am working on some tests.

BinChang · 2016-03-01T01:29:11Z

@zhijinli this is the diff to handle the dead host issue.

Raynos · 2016-03-01T01:29:47Z

@BinChang please take a look at https://github.com/uber/tchannel-node/blob/469bd9a4e2e6db72864ad6db25c57ed040ae695f/operations.js#L203-L220 for inspiration :)

BinChang · 2016-03-01T01:36:58Z

@Raynos That's the same idea, tChannel manages tcp socket errors directly, but lb_pool runs request through http.

Raynos · 2016-03-01T01:39:03Z

pool_endpoint.js

 PoolEndpoint.prototype.request_succeeded = function (request, response, body) {
    this.successes++;
    this.complete(null, request, response, body);
+    this.reset_req_timeout_count();


Also reset() in the else case in request_failed ?

Raynos reviewed Mar 1, 2016
View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

introduce max_req_timeout_count #11

introduce max_req_timeout_count #11

Uh oh!

BinChang commented Mar 1, 2016

Uh oh!

BinChang commented Mar 1, 2016

Uh oh!

BinChang commented Mar 1, 2016

Uh oh!

Raynos commented Mar 1, 2016

Uh oh!

BinChang commented Mar 1, 2016

Uh oh!

Raynos Mar 1, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

introduce max_req_timeout_count #11

Are you sure you want to change the base?

introduce max_req_timeout_count #11

Uh oh!

Conversation

BinChang commented Mar 1, 2016

Uh oh!

BinChang commented Mar 1, 2016

Uh oh!

BinChang commented Mar 1, 2016

Uh oh!

Raynos commented Mar 1, 2016

Uh oh!

BinChang commented Mar 1, 2016

Uh oh!

Raynos Mar 1, 2016

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants