[Codegen][ROCm] Drop assume alignments #14793
Triggered via pull request
November 14, 2024 16:24
Status
Failure
Total duration
6h 13m 44s
Artifacts
4
pkgci.yml
on: pull_request
setup
/
setup
11s
Unit Test
/
Linux (x86_64)
2m 45s
Test AMD MI250
/
test_mi250
4m 57s
Test AMD MI300
/
test_mi300
5h 58m
Test AMD W7900
/
test_w7900
40s
Test Android
/
android_arm64
5m 37s
Test RISC-V 64
/
riscv64
5m 1s
Test TensorFlow
/
Linux (x86_64)
1m 13s
Matrix: Regression Test / test_models
Matrix: Regression Test / test_regression_suite
Matrix: Test ONNX / test_onnx_models
Matrix: Test ONNX / test_onnx_ops
Annotations
21 errors
Test TensorFlow / Linux (x86_64)
Process completed with exit code 1.
|
Regression Test / test_models :: cpu_llvm_task
Process completed with exit code 1.
|
Test AMD W7900 / test_w7900
Process completed with exit code 1.
|
Unit Test / Linux (x86_64)
Process completed with exit code 1.
|
Test ONNX / test_onnx_models :: cpu_llvm_task
Process completed with exit code 1.
|
Regression Test / test_regression_suite :: cpu_llvm_task
Process completed with exit code 1.
|
Regression Test / test_regression_suite :: cpu_llvm_task
Process completed with exit code 1.
|
Test ONNX / test_onnx_ops :: cpu_llvm_sync
Process completed with exit code 1.
|
Test ONNX / test_onnx_ops :: amdgpu_vulkan
Process completed with exit code 1.
|
Test AMD MI250 / test_mi250
Process completed with exit code 8.
|
Test RISC-V 64 / riscv64
Process completed with exit code 1.
|
Test Android / android_arm64
Process completed with exit code 1.
|
Test ONNX / test_onnx_ops :: amdgpu_rocm_rdna3
Process completed with exit code 1.
|
Regression Test / test_models :: amdgpu_rocm_mi300_gfx942
System.TimeoutException: The HTTP request timed out after 00:01:40.
---> System.Threading.Tasks.TaskCanceledException: A task was canceled.
at System.Threading.Tasks.TaskCompletionSourceWithCancellation`1.WaitWithCancellationAsync(CancellationToken cancellationToken)
at System.Net.Http.HttpConnectionPool.GetHttp11ConnectionAsync(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken)
at System.Net.Http.HttpConnectionPool.SendWithVersionDetectionAndRetryAsync(HttpRequestMessage request, Boolean async, Boolean doRequestAuth, CancellationToken cancellationToken)
at System.Net.Http.AuthenticationHelper.SendWithAuthAsync(HttpRequestMessage request, Uri authUri, Boolean async, ICredentials credentials, Boolean preAuthenticate, Boolean isProxyAuth, Boolean doRequestAuth, HttpConnectionPool pool, CancellationToken cancellationToken)
at System.Net.Http.DecompressionHandler.SendAsync(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken)
at GitHub.Services.Common.VssHttpMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken)
--- End of inner exception stack trace ---
at GitHub.Services.Common.VssHttpMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken)
at GitHub.Services.Common.VssHttpRetryMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken)
at GitHub.Runner.Common.RedirectMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken)
at GitHub.Runner.Common.ThrottlingReportHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken)
at System.Net.Http.HttpClient.<SendAsync>g__Core|83_0(HttpRequestMessage request, HttpCompletionOption completionOption, CancellationTokenSource cts, Boolean disposeCts, CancellationTokenSource pendingRequestsCts, CancellationToken originalCancellationToken)
at GitHub.Services.WebApi.VssHttpClientBase.SendAsync(HttpRequestMessage message, HttpCompletionOption completionOption, Object userState, CancellationToken cancellationToken)
at GitHub.Services.WebApi.VssHttpClientBase.SendAsync[T](HttpRequestMessage message, Object userState, CancellationToken cancellationToken)
at GitHub.Services.Location.Client.LocationHttpClient.GetConnectionDataAsync(ConnectOptions connectOptions, Int64 lastChangeId, CancellationToken cancellationToken, Object userState)
at GitHub.Services.WebApi.Location.VssServerDataProvider.GetConnectionDataAsync(ConnectOptions connectOptions, Int32 lastChangeId, CancellationToken cancellationToken)
at GitHub.Services.WebApi.Location.VssServerDataProvider.ConnectAsync(ConnectOptions connectOptions, CancellationToken cancellationToken)
at GitHub.Runner.Common.JobServer.ConnectAsync(VssConnection jobConnection)
at GitHub.Runner.Worker.JobRunner.RunAsync(AgentJobRequestMessage message, CancellationToken jobRequestCancellationToken)
at GitHub.Runner.Worker.Worker.RunAsync(String pipeIn, String pipeOut)
at GitHub.Runner.Worker.Program.MainAsync(IHostContext context, String[] args)
|
Regression Test / test_models :: amdgpu_rocm_mi300_gfx942
The self-hosted runner: iree-mi300-4 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
|
Regression Test / test_regression_suite :: amdgpu_rocm_mi300_gfx942
System.AggregateException: One or more errors occurred. (The HTTP request timed out after 00:01:40.) (The HTTP request timed out after 00:01:40.) (The HTTP request timed out after 00:01:40.) (The HTTP request timed out after 00:01:40.) (The HTTP request timed out after 00:01:40.)
---> System.TimeoutException: The HTTP request timed out after 00:01:40.
---> System.Threading.Tasks.TaskCanceledException: A task was canceled.
at System.Threading.Tasks.TaskCompletionSourceWithCancellation`1.WaitWithCancellationAsync(CancellationToken cancellationToken)
at System.Net.Http.HttpConnectionPool.GetHttp11ConnectionAsync(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken)
at System.Net.Http.HttpConnectionPool.SendWithVersionDetectionAndRetryAsync(HttpRequestMessage request, Boolean async, Boolean doRequestAuth, CancellationToken cancellationToken)
at System.Net.Http.AuthenticationHelper.SendWithAuthAsync(HttpRequestMessage request, Uri authUri, Boolean async, ICredentials credentials, Boolean preAuthenticate, Boolean isProxyAuth, Boolean doRequestAuth, HttpConnectionPool pool, CancellationToken cancellationToken)
at System.Net.Http.DecompressionHandler.SendAsync(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken)
at GitHub.Services.Common.VssHttpMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken)
--- End of inner exception stack trace ---
at GitHub.Services.Common.VssHttpMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken)
at GitHub.Services.Common.VssHttpRetryMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken)
at GitHub.Runner.Common.RedirectMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken)
at GitHub.Runner.Common.ThrottlingReportHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken)
at System.Net.Http.HttpClient.<SendAsync>g__Core|83_0(HttpRequestMessage request, HttpCompletionOption completionOption, CancellationTokenSource cts, Boolean disposeCts, CancellationTokenSource pendingRequestsCts, CancellationToken originalCancellationToken)
at GitHub.Services.WebApi.VssHttpClientBase.SendAsync(HttpRequestMessage message, HttpCompletionOption completionOption, Object userState, CancellationToken cancellationToken)
at GitHub.Services.WebApi.VssHttpClientBase.SendAsync(HttpMethod method, Guid locationId, Object routeValues, ApiResourceVersion version, HttpContent content, IEnumerable`1 queryParameters, Object userState, CancellationToken cancellationToken)
at GitHub.DistributedTask.WebApi.TaskHttpClient.RaisePlanEventAsync[T](Guid scopeIdentifier, String planType, Guid planId, T eventData, CancellationToken cancellationToken, Object userState)
at GitHub.Runner.Worker.JobRunner.CompleteJobAsync(IJobServer jobServer, IExecutionContext jobContext, AgentJobRequestMessage message, Nullable`1 taskResult)
--- End of inner exception stack trace ---
at GitHub.Runner.Worker.JobRunner.CompleteJobAsync(IJobServer jobServer, IExecutionContext jobContext, AgentJobRequestMessage message, Nullable`1 taskResult)
at GitHub.Runner.Worker.JobRunner.CompleteJobAsync(IRunnerService server, IExecutionContext jobContext, AgentJobRequestMessage message, Nullable`1 taskResult)
at GitHub.Runner.Worker.JobRunner.RunAsync(AgentJobRequestMessage message, CancellationToken jobRequestCancellationToken)
at GitHub.Runner.Worker.JobRunner.RunAsync(AgentJobRequestMessage message, CancellationToken jobRequestCancellationToken)
at GitHub.Runner.Worker.Worker.RunAsync(String pipeIn, String pipeOut)
at GitHub.Runner.Worker.Program.MainAsync(IHostContext context, String[] args)
---> (Inner Exception #1) System.TimeoutException: The HTTP request timed out after 00:01:40.
---> System.Threading.Tasks.TaskCanceledException: A task was canceled.
at System.Threading.Tasks.TaskCompletionSourceWithCancellation`1.WaitWithCancellationAsync(CancellationToken cancellationToken)
a
|
Regression Test / test_regression_suite :: amdgpu_rocm_mi300_gfx942
The self-hosted runner: iree-mi300-7 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
|
Regression Test / test_regression_suite :: amdgpu_rocm_mi300_gfx942
Process completed with exit code 1.
|
Test AMD MI300 / test_mi300
The job running on runner iree-mi300-5 has exceeded the maximum execution time of 360 minutes.
|
Test AMD MI300 / test_mi300
Unable to download artifact(s): Unable to download and extract artifact: Artifact download failed after 5 retries.
|
Test AMD MI300 / test_mi300
The operation was canceled.
|
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
linux_x86_64_release_packages
|
74.4 MB |
|
onnx_ops_cpu_llvm_sync.json
|
3.05 KB |
|
onnx_ops_gpu_rocm_rdna3.json
|
3.12 KB |
|
onnx_ops_gpu_vulkan.json
|
3.59 KB |
|