Fix CUDA plugin CI. #10848
build_and_test.yml
on: pull_request
get-torch-commit
1s
Build XLA CUDA plugin
/
build
5m 32s
TPU tests
/
tpu-test
50m 9s
Build docs
/
build-docs
1m 40s
Matrix: GPU tests / test
Matrix: CPU tests / test
Matrix: GPU tests requiring torch CUDA / test
Waiting for pending jobs
Annotations
6 errors and 2 warnings
GPU tests / test (python_tests, torch_mp_op)
Process completed with exit code 134.
|
GPU tests / test (python_tests, xla_op1)
Process completed with exit code 1.
|
GPU tests / test (python_tests, xla_op2)
Process completed with exit code 1.
|
GPU tests / test (python_tests, xla_op3)
Process completed with exit code 1.
|
CPU tests / test (python_tests, torch_mp_op)
Process completed with exit code 134.
|
Build PyTorch with CUDA / build
The self-hosted runner: i-03d77a36868c4b238 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
|
get-torch-commit
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|
Build docs / build-docs
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|