-
Notifications
You must be signed in to change notification settings - Fork 654
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pipelines stuck on remote #1328
Comments
@lolpa1n As you'll see in this example, the queue specified in |
i am add maybe is this because I'm running on the same machine? |
Hello, |
@lolpa1n Sounds like the issue is more that you are using the same agent (you can deploy multiple agents on the same machine) - It can't take care of the steps since its busy handling the pipeline controller which in turn is waiting for the steps to complete. @suparshukov Not sure the same applies for your use case? I think you'll need to take a look at the logs inside the container that appears as if its not doing anything to better isolate the issue. |
Experimenting with steps from functions and Also, I'm using freshly built docker image that I didn't push into our artifactory. I set it on ClearML UI, and during the execution it said: |
Hi @kiranzo , can you include the complete log? |
Hello,
I deployed clearml server on my machine and wanted to make pipelines:
my code:
if I execute:
pipe.start_locally(run_pipeline_steps_locally=True)
then everything works,
but if I change to
pipe.start(queue='test_gpu')
, afterclearml-agent daemon --detached --queue test_gpu --gpus 0:
then nothing happens and the green status - QUEUED
logs:
Tell me pls, how to do this correctly, if I want, for example, to select a specific GPU, etc. for launch
The text was updated successfully, but these errors were encountered: