Enable container to terminate cleanly #125

vorburger · 2022-11-26T02:16:23Z

Could it make sense to improve upon the current approach how to gracefully stop a node a not receive a penalty by enabling the container to terminate cleanly?

As-is currently, it keeps running after SIGTERM (so you signal it using e.g. docker kill, wait a fixed amount of time, and then terminate it with docker stop - which will SIGTERM again, wait another 10s, and then SIGKILL it.

If there was any way for it to figure out and know by itself when it's "done" and then exit, I have a hunch that this, together with #124, could contribute to making #32, #109 and #123 easier and upgrades faster. Then you could simply docker stop --time=900 but those 15' would then be an upper bound maximum, not a hard-coded fixed duration anymore.

I don't know enough exact details about the architecture yet, but I suspect that this at least one part of solving this includes "stopping / decelining to accept new HTTP requests, but waiting for Nginx to finish serving the ones it currently has going".

The text was updated successfully, but these errors were encountered:

DiegoRBaquero · 2022-12-01T17:20:46Z

There's no way to do this with only inside-the-container knowledge, as DNS, even if set to 2 minutes TTL, might not be fully propagated after 15 mins, new requests will still be coming in, in less amount, but non-zero.

vorburger · 2022-12-03T14:01:47Z

and then terminate it with docker stop - which will SIGTERM

This ^^^ technically isn't fully accurate actually: When we request to stop this project's container, whether with docker stop or some equivalent on some container orchestration platform, it (currently) actually immediately receives a SIGQUIT instead of the (default) SIGTERM - because the used Nginx base image changed the STOPSIGNAL.

new requests will still be coming in, in less amount, but non-zero.

Maybe this actually is less of an issue than I originally thought. I think the primary (interesting) overall goal probably is more "seemless (and fast) version upgrades" than "terminate cleanly for permanent shutdown" - and that may be possible by (reliably fully automated, TBD) "rolling" upgrades...

vorburger · 2022-12-04T18:35:15Z

After having given this much further thought, I now think what is actually primarily missing to enable clean Rolling Updates is a way to signal the container to let Nginx finish serving ongoing requests (just to avoid clients experiencing "Connection reset by peer"), but WITHOUT "Draining server' by deregistering from the Orchestrator (because for a seamless fully rolling update you would actually NOT want to do that).

Based on what I've learnt so far, I doubt that is possible as-is today, given that the Shim handles SIGQUIT, that immediately exit(0), and probably doesn't "propagate" to Nginx. I haven't actually fully tested it yet, but suggest that be the next step on this issue, and fixing that (if needed).

Let us embrace docker compose. Fixes filecoin-saturn#109, filecoin-saturn#125

This was referenced Nov 26, 2022

Replace update.sh with an external dependency #109

Closed

Container image version dictated by Orchestrator, to enable canarying with centrally managed roll forwards and rollbacks #126

Open

Revert nginx config if faulty #123

Closed

AnomalRoil added a commit to AnomalRoil/L1-node that referenced this issue Jan 11, 2023

Migrating away from update.sh and run.sh

b9d8c4c

Let us embrace docker compose. Fixes filecoin-saturn#109, filecoin-saturn#125

AnomalRoil added a commit to AnomalRoil/L1-node that referenced this issue Jan 11, 2023

Migrating away from update.sh and run.sh

3dfd577

Let us embrace docker compose. Fixes filecoin-saturn#109, filecoin-saturn#125

AnomalRoil mentioned this issue Jan 11, 2023

Migrating away from update.sh and run.sh #168

Merged

AnomalRoil added a commit to AnomalRoil/L1-node that referenced this issue Jan 11, 2023

Migrating away from update.sh and run.sh

df19399

Let us embrace docker compose. Fixes filecoin-saturn#109, filecoin-saturn#125

joaosa closed this as completed in #168 Jan 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable container to terminate cleanly #125

Enable container to terminate cleanly #125

vorburger commented Nov 26, 2022

DiegoRBaquero commented Dec 1, 2022

vorburger commented Dec 3, 2022 •

edited

Loading

vorburger commented Dec 4, 2022

Enable container to terminate cleanly #125

Enable container to terminate cleanly #125

Comments

vorburger commented Nov 26, 2022

DiegoRBaquero commented Dec 1, 2022

vorburger commented Dec 3, 2022 • edited Loading

vorburger commented Dec 4, 2022

vorburger commented Dec 3, 2022 •

edited

Loading