-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Terminate Runs of Background Tasks on Cancel #1425
Comments
The run count is the number of runs completed, so it would make sense that a long-running Stopping runs underway is covered by #129. It will require some work; I'm not sure how much yet. I'll bring that into 5.2.0 and could be talked into making a 5.1.x bugfix. |
Hi @arlake228 I've also run into a similar issue, particularly with At least from my observations, the In my case, here's what it looks like over time on a single host within a mesh of 16 servers over 2 days: This is within the last 24 hour period: Noting that all 16 servers show similar issues, some are worse than others. On the specific server in question:
On the oldest running
I also see the same issues within our perfSONAR (v5.1.2) dev env (which is a smaller mesh of 3 servers within the same site). In addition, I was chatting with another perfSONAR server operator who mentioned to me that they don't use FWIW, I had initially thought the issue was being caused by |
I ran into a case in 5.1.0 where I built a mesh where some of the far-end hosts had not come up yet. Since latencybg tests are single-particpant, the tasks were still created and powstream is started. Since there is nothing on the far end, powstream would never yield any results.
pSConfig would run again and notice we had these background tasks that weren't producing any results (based on the run count in pscheduler api), send a cancel and then create a new task. Example of such a task below:
After canceling the task, pSConfig would create a new task. The problem is that the "cancel" operation doesn't kill the powstream. This led to a build up of powstreams (and quite a bit of memory usage), as new ones got created every hour for tasks in this category. I know we have discussed this before but was having trouble finding the issue for it. I think pSConfig is doing the right thing with the info it has, but it'd be nice to kill the underlying background process on cancel.
The text was updated successfully, but these errors were encountered: