|
| 1 | +--- |
| 2 | +title: 'WSI Superpixel Guided Labeling' |
| 3 | +tags: |
| 4 | + - Python |
| 5 | + - histology |
| 6 | + - bioimage informatics |
| 7 | + - whole slide annotation |
| 8 | + - whole slide images |
| 9 | + - guided labeling |
| 10 | +# (add orcid for anyone who has one) |
| 11 | +authors: |
| 12 | + - name: Brianna Major |
| 13 | + affiliation: 1 |
| 14 | + - name: Jeffery A. Goldstein |
| 15 | + affiliation: 2 |
| 16 | + - name: Lee A. Newberg |
| 17 | + affiliation: 1 |
| 18 | + - name: Anders Sildnes |
| 19 | + affiliation: 2 |
| 20 | + - name: Abhishek Sharma |
| 21 | + affiliation: 2 |
| 22 | + - name: Mike Nagler |
| 23 | + affiliation: 1 |
| 24 | + - name: Lee A. D. Cooper |
| 25 | + affiliation: 2 |
| 26 | + - name: Jeff Baumes |
| 27 | + affiliation: 1 |
| 28 | + - name: David Manthey |
| 29 | + affiliation: 1 |
| 30 | + orcid: 0000-0002-4580-8770 |
| 31 | +affiliations: |
| 32 | + - index: 1 |
| 33 | + name: Kitware, Inc., New York, United States |
| 34 | + - index: 2 |
| 35 | + name: Northwestern University, Illinois, United States |
| 36 | +date: 30 April 2025 |
| 37 | +# (if we have a bib file, add bibliography: paper.bib) |
| 38 | +--- |
| 39 | + |
| 40 | +# Summary |
| 41 | + |
| 42 | +`WSI Superpixel Guided Labeling` facilitates active learning on whole slide images. It has a user interface built on top of the HistomicsUI base and deployed as part of the Digital Slide Archive, and uses the HistomicsTK tool kit as part of the process. |
| 43 | + |
| 44 | +Users label superpixel regions or other segmented areas of whole slide images to be used as classification input for machine learning algorithms. An example algorithm is included which generates superpixels, features, and machine learning models for active learning on a directory of images. The interface allows bulk labeling, labeling the most impactful superpixels to improve the model, and reviewing labeled and predicted categories. |
| 45 | + |
| 46 | +# Statement of need |
| 47 | + |
| 48 | +One of the limitations in generating accurate models is the need for labeled data. Given a model and a few labeled samples, there are a variety of algorithms that can be used to determine what samples should be additionally labeled to most efficiently improve the model. To actually get labeled data, this prediction of which samples to label needs to be combined with an efficient workflow so that the domain expect can use their labeling time in the most effective manner possible. |
| 49 | + |
| 50 | +`WSI Superpixel Guided Labeling` provides a user interface and workflow for this guided labeling process. Given a set of whole slide images, the images are segmented based on a some user choices. This segmentation is the basis for labeling. The user can specify any number of label categories, including labels that will be excluded from training (for instance, for segmented regions whose categories cannot be accurately determined). After labeling a few initial segments, a model is generated and used to both predict the category of all segments and the segments that would result in the best improvement in the model if they were also labeled. The user can retrain the model at any time and review the results of both the predictions and other users. |
| 51 | + |
| 52 | +For development, the initial segmentation uses superpixels generated with the SLIC algorithm. These are computed on whole slide images in a tiled manner so that they can work on arbitrarily large images, and the tile boundaries are properly handled to avoid visible artifacts. Either of two basic models can be trained and used for predictions: small-scale CNN using image features implemented in keras and torch, or a huggingface foundational model that generates a one-dimensional feature vector. The certainty criteria for which segments should be labeled next can also be selected, and includes confidence, margin, negative entropy, and the batchbald algorithm. |
| 53 | + |
| 54 | +We had a placental pathologist provide feedback to validate the efficiency of the user interface and utility of the process. |
| 55 | + |
| 56 | + |
| 57 | + |
| 58 | +# Acknowledgements |
| 59 | + |
| 60 | +[//]: # (Add financial support statement) |
| 61 | + |
| 62 | +We acknowledge contributions from Faiza Ahmed for some of the UI design. |
| 63 | + |
| 64 | +[//]: # (if we have a bib file, add # References) |
| 65 | +[//]: # (references should include DSA, superpixel github repo, and...) |
0 commit comments