Skip to content

openBLAS warning and segmentation fault issue #32

@rvitancol

Description

@rvitancol

I am encountering warnings on openBLAS and eventually segmentation fault that leads to stopping of execution for Novae: Happens during fine-tuning step

OpenBLAS warning: precompiled NUM_THREADS exceeded, adding auxiliary array for thread metadata.
To avoid this warning, please rebuild your copy of OpenBLAS with a larger NUM_THREADS setting
or set the environment variable OPENBLAS_NUM_THREADS to 64 or lower
Segmentation fault (core dumped)

I have tried debugging and different settings for the environment variables that controls openblas number of threads but still getting issues. Here is the thread pool information:

=== Thread Pool Information ===
Active Threading Libraries:
  openblas        | Threads:  57 | Prefix: libopenblas | Version: 0.3.23.dev
  openblas        | Threads:  57 | Prefix: libscipy_openblas | Version: 0.3.28
  openmp          | Threads:  48 | Prefix: libgomp | Version: None
  openmp          | Threads:  57 | Prefix: libgomp | Version: None

PyTorch Threading:
  PyTorch version: 2.8.0+cu128
  Number of threads: 48
  Inter-op threads: 57

  Parallel info:
ATen/Parallel:
	at::get_num_threads() : 48
	at::get_num_interop_threads() : 57
OpenMP 201511 (a.k.a. OpenMP 4.5)
	omp_get_max_threads() : 48
Intel(R) oneAPI Math Kernel Library Version 2024.2-Product Build 20240605 for Intel(R) 64 architecture applications
	mkl_get_max_threads() : 48
Intel(R) MKL-DNN v3.7.1 (Git Hash 8d263e693366ef8db40acc569cc7d8edf644556d)
std::thread::hardware_concurrency() : 96
Environment variables:
	OMP_NUM_THREADS : 57
	MKL_NUM_THREADS : 57
ATen parallel backend: OpenMP

=== CPU Information (lscpu) ===
Architecture:                            x86_64
CPU op-mode(s):                          32-bit, 64-bit
Byte Order:                              Little Endian
Address sizes:                           46 bits physical, 48 bits virtual
CPU(s):                                  96
On-line CPU(s) list:                     0-95
Thread(s) per core:                      2
Core(s) per socket:                      48
Socket(s):                               1
NUMA node(s):                            1
Vendor ID:                               GenuineIntel
CPU family:                              6

Do you have suggestions on other things to try to make this work.

Thank you.
Rona

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions