Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

supervisor doesn't always respond to TERM #7416

Open
robnagler opened this issue Jan 6, 2025 · 0 comments
Open

supervisor doesn't always respond to TERM #7416

robnagler opened this issue Jan 6, 2025 · 0 comments

Comments

@robnagler
Copy link
Member

#7404 (review)

I can't create a reproducible example but this has happened to me 3 times: The supervisor can get in a state where it no longer responds to SIGTERM.

~$ ps uww | grep job_supervisor
vagrant   426434  6.5  2.0 596624 164596 pts/2   Sl+  16:41   0:32 /home/vagrant/.pyenv/versions/3.9.15/envs/py3/bin/python /home/vagrant/.pyenv/versions/py3/bin/sirepo job_supervisor
~$ kill -SIGTERM 426434
~$ ps uww | grep job_supervisor
vagrant   426434  6.5  2.0 596624 164596 pts/2   Sl+  16:41   0:33 /home/vagrant/.pyenv/versions/3.9.15/envs/py3/bin/python /home/vagrant/.pyenv/versions/py3/bin/sirepo job_supervisor

When I send the SIGTERM to the supervisor it logs

Jan 06 16:49:33 426434     0 sirepo/job_driver/local.py:87:kill LocalDriver(a=foCR k=sequential u=d2dF []) pid=427224

The closest I can get to a reproducible example is it seem to only happen when the supervisor is signaled while a job is running.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant