Skip to content

Commit 7149558

Browse files
ogriselthomasjpfan
andauthored
CI Enable parallel build on CI via an env variable (scikit-learn#18826)
Co-authored-by: Thomas J. Fan <[email protected]>
1 parent e31b21d commit 7149558

File tree

6 files changed

+45
-12
lines changed

6 files changed

+45
-12
lines changed

.github/workflows/wheels.yml

+1
Original file line numberDiff line numberDiff line change
@@ -83,6 +83,7 @@ jobs:
8383
OMP_NUM_THREADS=2
8484
OPENBLAS_NUM_THREADS=2
8585
SKLEARN_SKIP_NETWORK_TESTS=1
86+
SKLEARN_BUILD_PARALLEL=3
8687
CIBW_BUILD: cp${{ matrix.python }}-${{ matrix.platform_id }}
8788
CIBW_TEST_REQUIRES: pytest pandas threadpoolctl
8889
# Test that there are no links to system libraries

build_tools/azure/install.sh

+7-7
Original file line numberDiff line numberDiff line change
@@ -133,19 +133,19 @@ try:
133133
except ImportError:
134134
print('pandas not installed')
135135
"
136-
python -m pip list
136+
# Set parallelism to 3 to overlap IO bound tasks with CPU bound tasks on CI
137+
# workers with 2 cores when building the compiled extensions of scikit-learn.
138+
export SKLEARN_BUILD_PARALLEL=3
137139

140+
python -m pip list
138141
if [[ "$DISTRIB" == "conda-pip-latest" ]]; then
139-
# Check that pip can automatically install the build dependencies from
140-
# pyproject.toml using an isolated build environment:
142+
# Check that pip can automatically build scikit-learn with the build
143+
# dependencies specified in pyproject.toml using an isolated build
144+
# environment:
141145
pip install --verbose --editable .
142146
else
143147
# Use the pre-installed build dependencies and build directly in the
144148
# current environment.
145-
# Use setup.py instead of `pip install -e .` to be able to pass the -j flag
146-
# to speed-up the building multicore CI machines.
147-
python setup.py build_ext --inplace -j 3
148149
python setup.py develop
149150
fi
150-
151151
ccache -s

build_tools/circle/build_doc.sh

+3-2
Original file line numberDiff line numberDiff line change
@@ -183,8 +183,9 @@ source activate testenv
183183
pip install sphinx-gallery
184184
pip install numpydoc
185185

186-
# Build and install scikit-learn in dev mode
187-
python setup.py build_ext --inplace -j 3
186+
# Set parallelism to 3 to overlap IO bound tasks with CPU bound tasks on CI
187+
# workers with 2 cores when building the compiled extensions of scikit-learn.
188+
export SKLEARN_BUILD_PARALLEL=3
188189
python setup.py develop
189190

190191
export OMP_NUM_THREADS=1

build_tools/circle/build_test_pypy.sh

+4-1
Original file line numberDiff line numberDiff line change
@@ -32,8 +32,11 @@ export CCACHE_COMPRESS=1
3232
export PATH=/usr/lib/ccache:$PATH
3333
export LOKY_MAX_CPU_COUNT="2"
3434
export OMP_NUM_THREADS="1"
35+
# Set parallelism to 3 to overlap IO bound tasks with CPU bound tasks on CI
36+
# workers with 2 cores when building the compiled extensions of scikit-learn.
37+
export SKLEARN_BUILD_PARALLEL=3
3538

36-
python setup.py build_ext --inplace -j 3
39+
# Build and install scikit-learn in dev mode
3740
pip install --no-build-isolation -e .
3841

3942
# Check that Python implementation is PyPy

doc/developers/advanced_installation.rst

+14
Original file line numberDiff line numberDiff line change
@@ -439,3 +439,17 @@ Before using ICC, you need to set up environment variables::
439439
Finally, you can build scikit-learn. For example on Linux x86_64::
440440

441441
python setup.py build_ext --compiler=intelem -i build_clib --compiler=intelem
442+
443+
Parallel builds
444+
===============
445+
446+
It is possible to build scikit-learn compiled extensions in parallel by setting
447+
and environment variable as follows before calling the ``pip install`` or
448+
``python setup.py build_ext`` commands::
449+
450+
export SKLEARN_BUILD_PARALLEL=3
451+
pip install --verbose --no-build-isolation --editable .
452+
453+
On a machine with 2 CPU cores, it can be beneficial to use a parallelism level
454+
of 3 to overlap IO bound tasks (reading and writing files on disk) with CPU
455+
bound tasks (actually compiling).

setup.py

+16-2
Original file line numberDiff line numberDiff line change
@@ -108,13 +108,27 @@ def run(self):
108108

109109
cmdclass = {'clean': CleanCommand, 'sdist': sdist}
110110

111-
# custom build_ext command to set OpenMP compile flags depending on os and
112-
# compiler
111+
# Custom build_ext command to set OpenMP compile flags depending on os and
112+
# compiler. Also makes it possible to set the parallelism level via
113+
# and environment variable (useful for the wheel building CI).
113114
# build_ext has to be imported after setuptools
114115
try:
115116
from numpy.distutils.command.build_ext import build_ext # noqa
116117

117118
class build_ext_subclass(build_ext):
119+
120+
def finalize_options(self):
121+
super().finalize_options()
122+
if self.parallel is None:
123+
# Do not override self.parallel if already defined by
124+
# command-line flag (--parallel or -j)
125+
126+
parallel = os.environ.get("SKLEARN_BUILD_PARALLEL")
127+
if parallel:
128+
self.parallel = int(parallel)
129+
if self.parallel:
130+
print("setting parallel=%d " % self.parallel)
131+
118132
def build_extensions(self):
119133
from sklearn._build_utils.openmp_helpers import get_openmp_flag
120134

0 commit comments

Comments
 (0)