Skip to content

[Codegen][ROCm] Drop assume alignments #14793

[Codegen][ROCm] Drop assume alignments

[Codegen][ROCm] Drop assume alignments #14793

Triggered via pull request November 14, 2024 16:24
Status Failure
Total duration 6h 13m 44s
Artifacts 4

pkgci.yml

on: pull_request
Build Packages  /  Linux Release (x86_64)
13m 11s
Build Packages / Linux Release (x86_64)
Unit Test  /  Linux (x86_64)
2m 45s
Unit Test / Linux (x86_64)
Test AMD MI250  /  test_mi250
4m 57s
Test AMD MI250 / test_mi250
Test AMD MI300  /  test_mi300
5h 58m
Test AMD MI300 / test_mi300
Test AMD W7900  /  test_w7900
40s
Test AMD W7900 / test_w7900
Test Android  /  android_arm64
5m 37s
Test Android / android_arm64
Test RISC-V 64  /  riscv64
5m 1s
Test RISC-V 64 / riscv64
Test TensorFlow  /  Linux (x86_64)
1m 13s
Test TensorFlow / Linux (x86_64)
Matrix: Regression Test / test_models
Matrix: Regression Test / test_regression_suite
Matrix: Test ONNX / test_onnx_models
Matrix: Test ONNX / test_onnx_ops
Fit to window
Zoom out
Zoom in

Annotations

21 errors
Test TensorFlow / Linux (x86_64)
Process completed with exit code 1.
Regression Test / test_models :: cpu_llvm_task
Process completed with exit code 1.
Test AMD W7900 / test_w7900
Process completed with exit code 1.
Unit Test / Linux (x86_64)
Process completed with exit code 1.
Test ONNX / test_onnx_models :: cpu_llvm_task
Process completed with exit code 1.
Regression Test / test_regression_suite :: cpu_llvm_task
Process completed with exit code 1.
Regression Test / test_regression_suite :: cpu_llvm_task
Process completed with exit code 1.
Test ONNX / test_onnx_ops :: cpu_llvm_sync
Process completed with exit code 1.
Test ONNX / test_onnx_ops :: amdgpu_vulkan
Process completed with exit code 1.
Test AMD MI250 / test_mi250
Process completed with exit code 8.
Test RISC-V 64 / riscv64
Process completed with exit code 1.
Test Android / android_arm64
Process completed with exit code 1.
Test ONNX / test_onnx_ops :: amdgpu_rocm_rdna3
Process completed with exit code 1.
Regression Test / test_models :: amdgpu_rocm_mi300_gfx942
System.TimeoutException: The HTTP request timed out after 00:01:40. ---> System.Threading.Tasks.TaskCanceledException: A task was canceled. at System.Threading.Tasks.TaskCompletionSourceWithCancellation`1.WaitWithCancellationAsync(CancellationToken cancellationToken) at System.Net.Http.HttpConnectionPool.GetHttp11ConnectionAsync(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken) at System.Net.Http.HttpConnectionPool.SendWithVersionDetectionAndRetryAsync(HttpRequestMessage request, Boolean async, Boolean doRequestAuth, CancellationToken cancellationToken) at System.Net.Http.AuthenticationHelper.SendWithAuthAsync(HttpRequestMessage request, Uri authUri, Boolean async, ICredentials credentials, Boolean preAuthenticate, Boolean isProxyAuth, Boolean doRequestAuth, HttpConnectionPool pool, CancellationToken cancellationToken) at System.Net.Http.DecompressionHandler.SendAsync(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken) at GitHub.Services.Common.VssHttpMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken) --- End of inner exception stack trace --- at GitHub.Services.Common.VssHttpMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken) at GitHub.Services.Common.VssHttpRetryMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken) at GitHub.Runner.Common.RedirectMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken) at GitHub.Runner.Common.ThrottlingReportHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken) at System.Net.Http.HttpClient.<SendAsync>g__Core|83_0(HttpRequestMessage request, HttpCompletionOption completionOption, CancellationTokenSource cts, Boolean disposeCts, CancellationTokenSource pendingRequestsCts, CancellationToken originalCancellationToken) at GitHub.Services.WebApi.VssHttpClientBase.SendAsync(HttpRequestMessage message, HttpCompletionOption completionOption, Object userState, CancellationToken cancellationToken) at GitHub.Services.WebApi.VssHttpClientBase.SendAsync[T](HttpRequestMessage message, Object userState, CancellationToken cancellationToken) at GitHub.Services.Location.Client.LocationHttpClient.GetConnectionDataAsync(ConnectOptions connectOptions, Int64 lastChangeId, CancellationToken cancellationToken, Object userState) at GitHub.Services.WebApi.Location.VssServerDataProvider.GetConnectionDataAsync(ConnectOptions connectOptions, Int32 lastChangeId, CancellationToken cancellationToken) at GitHub.Services.WebApi.Location.VssServerDataProvider.ConnectAsync(ConnectOptions connectOptions, CancellationToken cancellationToken) at GitHub.Runner.Common.JobServer.ConnectAsync(VssConnection jobConnection) at GitHub.Runner.Worker.JobRunner.RunAsync(AgentJobRequestMessage message, CancellationToken jobRequestCancellationToken) at GitHub.Runner.Worker.Worker.RunAsync(String pipeIn, String pipeOut) at GitHub.Runner.Worker.Program.MainAsync(IHostContext context, String[] args)
Regression Test / test_models :: amdgpu_rocm_mi300_gfx942
The self-hosted runner: iree-mi300-4 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
Regression Test / test_regression_suite :: amdgpu_rocm_mi300_gfx942
System.AggregateException: One or more errors occurred. (The HTTP request timed out after 00:01:40.) (The HTTP request timed out after 00:01:40.) (The HTTP request timed out after 00:01:40.) (The HTTP request timed out after 00:01:40.) (The HTTP request timed out after 00:01:40.) ---> System.TimeoutException: The HTTP request timed out after 00:01:40. ---> System.Threading.Tasks.TaskCanceledException: A task was canceled. at System.Threading.Tasks.TaskCompletionSourceWithCancellation`1.WaitWithCancellationAsync(CancellationToken cancellationToken) at System.Net.Http.HttpConnectionPool.GetHttp11ConnectionAsync(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken) at System.Net.Http.HttpConnectionPool.SendWithVersionDetectionAndRetryAsync(HttpRequestMessage request, Boolean async, Boolean doRequestAuth, CancellationToken cancellationToken) at System.Net.Http.AuthenticationHelper.SendWithAuthAsync(HttpRequestMessage request, Uri authUri, Boolean async, ICredentials credentials, Boolean preAuthenticate, Boolean isProxyAuth, Boolean doRequestAuth, HttpConnectionPool pool, CancellationToken cancellationToken) at System.Net.Http.DecompressionHandler.SendAsync(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken) at GitHub.Services.Common.VssHttpMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken) --- End of inner exception stack trace --- at GitHub.Services.Common.VssHttpMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken) at GitHub.Services.Common.VssHttpRetryMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken) at GitHub.Runner.Common.RedirectMessageHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken) at GitHub.Runner.Common.ThrottlingReportHandler.SendAsync(HttpRequestMessage request, CancellationToken cancellationToken) at System.Net.Http.HttpClient.<SendAsync>g__Core|83_0(HttpRequestMessage request, HttpCompletionOption completionOption, CancellationTokenSource cts, Boolean disposeCts, CancellationTokenSource pendingRequestsCts, CancellationToken originalCancellationToken) at GitHub.Services.WebApi.VssHttpClientBase.SendAsync(HttpRequestMessage message, HttpCompletionOption completionOption, Object userState, CancellationToken cancellationToken) at GitHub.Services.WebApi.VssHttpClientBase.SendAsync(HttpMethod method, Guid locationId, Object routeValues, ApiResourceVersion version, HttpContent content, IEnumerable`1 queryParameters, Object userState, CancellationToken cancellationToken) at GitHub.DistributedTask.WebApi.TaskHttpClient.RaisePlanEventAsync[T](Guid scopeIdentifier, String planType, Guid planId, T eventData, CancellationToken cancellationToken, Object userState) at GitHub.Runner.Worker.JobRunner.CompleteJobAsync(IJobServer jobServer, IExecutionContext jobContext, AgentJobRequestMessage message, Nullable`1 taskResult) --- End of inner exception stack trace --- at GitHub.Runner.Worker.JobRunner.CompleteJobAsync(IJobServer jobServer, IExecutionContext jobContext, AgentJobRequestMessage message, Nullable`1 taskResult) at GitHub.Runner.Worker.JobRunner.CompleteJobAsync(IRunnerService server, IExecutionContext jobContext, AgentJobRequestMessage message, Nullable`1 taskResult) at GitHub.Runner.Worker.JobRunner.RunAsync(AgentJobRequestMessage message, CancellationToken jobRequestCancellationToken) at GitHub.Runner.Worker.JobRunner.RunAsync(AgentJobRequestMessage message, CancellationToken jobRequestCancellationToken) at GitHub.Runner.Worker.Worker.RunAsync(String pipeIn, String pipeOut) at GitHub.Runner.Worker.Program.MainAsync(IHostContext context, String[] args) ---> (Inner Exception #1) System.TimeoutException: The HTTP request timed out after 00:01:40. ---> System.Threading.Tasks.TaskCanceledException: A task was canceled. at System.Threading.Tasks.TaskCompletionSourceWithCancellation`1.WaitWithCancellationAsync(CancellationToken cancellationToken) a
Regression Test / test_regression_suite :: amdgpu_rocm_mi300_gfx942
The self-hosted runner: iree-mi300-7 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
Regression Test / test_regression_suite :: amdgpu_rocm_mi300_gfx942
Process completed with exit code 1.
Test AMD MI300 / test_mi300
The job running on runner iree-mi300-5 has exceeded the maximum execution time of 360 minutes.
Test AMD MI300 / test_mi300
Unable to download artifact(s): Unable to download and extract artifact: Artifact download failed after 5 retries.
Test AMD MI300 / test_mi300
The operation was canceled.

Artifacts

Produced during runtime
Name Size
linux_x86_64_release_packages
74.4 MB
onnx_ops_cpu_llvm_sync.json
3.05 KB
onnx_ops_gpu_rocm_rdna3.json
3.12 KB
onnx_ops_gpu_vulkan.json
3.59 KB