README

Overview

We use asv to benchmark some representative BrainGlobe workflows. The asv workflow is roughly as follows:

asv creates a virtual environment to run the benchmarks on, as defined in the asv.conf.json file.
It installs the version of the brainglobe-workflows package corresponding to the tip of the locally checked-out branch.
It runs the benchmarks as defined (locally) under benchmarks/benchmarks and saves the results to benchmarks/results as json files.
With asv publish, the output json files are 'published' into an html directory (benchmarks/html).
With asv preview the html directory can be visualised using a local web server.

We include code to benchmark the workflows defined under brainglobe_workflows. There are three main ways in which these benchmarks can be useful to developers:

Developers can run the available benchmarks on their machine on either
- a small test dataset, or
- on custom data.
We also run the benchmarks internally on a large dataset, and make the results publicly available.

Additionally, we ship two asv configuration files, which define two different environments for asv to create and run the benchmarks in. brainglobe-workflows depends on a number of BrainGlobe packages. The only difference between the two asv-defined environments is the version of the BrainGlobe packages. In asv.pip.conf.json, we install the packages from PyPI. In asv.latest-github.conf.json, we install the packages from their main branch on GitHub. Note that because of this all asv commands will need to specify the configuration file with the --config flag.

See the asv reference docs for further details on the tool, and on how to run benchmarks. The first time running benchmarks on a new machine, you will need to run asv machine --yes to set up the machine for benchmarking.

Installation

To run the benchmarks, install asv in your current environment:

pip install asv

Note that to run the benchmarks, you do not need to install a development version of brainglobe-workflows in your current environment (asv takes care of this).

Running benchmarks on a small default dataset

To run the benchmarks on the default dataset:

Git clone the brainglobe-workflows repository:

git clone https://github.com/brainglobe/brainglobe-workflows.git

Run asv from the benchmarks directory:
```
cd brainglobe-workflows/benchmarks
asv run --config <path-to-asv-config>  # dependencies from PyPI or GitHub, depending on the asv config file used
```
This will benchmark the workflows defined in brainglobe_workflows/ using a default set of parameters and a default small dataset. The default parameters are defined as config files under brainglobe_workflows/configs. The default dataset is downloaded from GIN.

Running benchmarks on custom data

To run the benchmarks on a custom local dataset:

Git clone the brainglobe-workflows repository

git clone https://github.com/brainglobe/brainglobe-workflows.git

Define a config file for the workflow to benchmark.
- You can use the default config files at brainglobe_workflows/configs/ as reference.
- You will need to edit/add the fields pointing to the input data.
  - For example, for the cellfinder workflow, the config file will need to include an input_data_dir field pointing to the data of interest. The signal and background data are assumed to be in signal and background directories, under the input_data_dir directory. If they are under directories with a different name, you can specify their names with the signal_subdir and background_subdir fields.
Benchmark the workflow, passing the path to your custom config file as an environment variable.
- For example, to benchmark the cellfinder workflow, you will need to prepend the environment variable definition to the asv run command (valid for Unix systems):
```
CELLFINDER_CONFIG_PATH=/path/to/your/config/file asv run --config <path-to-asv-config>
```

Running benchmarks in development

The following flags to asv run are often useful in development:

--quick: will only run one repetition per benchmark, and no results to disk.
--verbose: provides further info on intermediate steps.
--show-stderr: will print out stderr.
--dry-run: will not write results to disk.
--bench: to specify a subset of benchmarks (e.g., TimeFullWorkflow). Regexp can be used.
--python=same: runs the benchmarks in the same environment that asv was launched from

Example:

asv run --config <path-to-asv-config> --bench TimeFullWorkflow --dry-run --show-stderr --quick

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

README

Overview

Installation

Running benchmarks on a small default dataset

Running benchmarks on custom data

Running benchmarks in development

Files

README.md

Latest commit

History

README.md

File metadata and controls

README

Overview

Installation

Running benchmarks on a small default dataset

Running benchmarks on custom data

Running benchmarks in development