Add tutorial: Where to start with bioimage analysis in Galaxy #6603

dianichj · 2026-01-15T12:26:50Z

Summary

Hi everyone! 👋 This PR adds a tutorial for the GTN Imaging section: “Where to start with bioimage analysis in Galaxy.” It’s intended to be the foundational “compass” tutorial—a bridge for newcomers to move from viewing images as "pictures" to treating them as quantitative data. It complements the existing intro and emphasizes FAIR-by-design principles within Galaxy.

Linked to issue: #36

What’s included

Conceptual Foundations: Pixels/voxels, 5D hyperstacks (XYZCT), bit-depth, and spatial calibration.
Galaxy Onboarding: Handling proprietary formats via Bio-Formats, OME-NGFF standards, and the OMERO route.
Hands-on "First Steps": Practical modules for metadata inspection, filtering, thresholding, and validation.
Logical Roadmaps: A decision tree to help users choose between classical computer vision (CellProfiler) and AI (Cellpose/StarDist).
The Pitfall Guardrail: Warnings on JPEG compression, RGB merging, and photobleaching.

Supporting Files

tutorial.bib: Might need a few more citations or formatting tweaks.
data-library.yaml: Still needs the example datasets configuration.
faqs/index.md & workflows/index.md: Basic structures are in place.

Status: Work in Progress 🧪

This is a functional draft, but still needs some love:

Visuals: Adding missing images and screenshots.
Hands-on: Final polishing of the step-by-step instructions.
Editorial: Cleaning up placeholders like "(Add ...)" and smoothing the text.

Review Focus 🔍

I’d love to get your thoughts on:

Structure: Does the flow make sense for a total beginner?
Next Steps: Is the roadmap clear enough to know where to go after this?
Tools: Are we highlighting the best current practices in Galaxy (e.g., Cellpose-SAM)?
Clarity: Is any part too "jargon-heavy"?

Checklist

Tutorial front matter added.
Finalize data-library.yaml.
Replace image placeholders with final figures.
Verify tutorial tags and metadata.

Thanks a lot for your time and feedback! I'm really looking forward to your ideas and collaboration to make this better. 🚀

@beatrizserrano
@kostrykin
@rmassei

github-actions

Remaining comments which cannot be posted as a review comment to avoid GitHub Rate Limit

GTN Lint

🚫 [GTN Lint] <GTN:009> _{reported by reviewdog 🐶}
This tool identifier looks incorrect, it doesn't have the right number of segments.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 348 in f050a38

    
           | **{% tool [CellProfiler](https://usegalaxy.eu/root?tool_id=interactive_tool_cellprofiler) %}** | High-content screening & automation | Allows you to build a complex multi-step "pipeline" and run it on thousands of images consistently. |

🚫 [GTN Lint] <GTN:009> _{reported by reviewdog 🐶}
This tool identifier looks incorrect, it doesn't have the right number of segments.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 381 in f050a38

    
           * **{% tool [QuPath IT](https://usegalaxy.eu/root?tool_id=interactive_tool_qupath) %}:** The gold standard for digital pathology. Use this for large tissue sections and to access **StarDist** segmentation.

🚫 [GTN Lint] <GTN:009> _{reported by reviewdog 🐶}
This tool identifier looks incorrect, it doesn't have the right number of segments.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 382 in f050a38

    
           * **{% tool [Ilastik IT](https://usegalaxy.eu/root?tool_id=interactive_tool_ilastik) %}:** Best for "training by example"—manually paint a few cells to teach the computer how to segment the rest based on texture.

🚫 [GTN Lint] <GTN:009> _{reported by reviewdog 🐶}
This tool identifier looks incorrect, it doesn't have the right number of segments.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 383 in f050a38

    
           * **{% tool [Cellpose IT](https://usegalaxy.eu/root?tool_id=interactive_tool_cellpose) %}:** & **{% tool [Cellprofiler IT](https://usegalaxy.eu/root?tool_id=interactive_tool_cellprofiler) %}:** Useful for building and fine-tuning your parameters visually before running a massive batch job.

🚫 [GTN Lint] <GTN:033> _{reported by reviewdog 🐶}
The icon (param-conditional) could not be found, please add it to _config.yml.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 223 in f050a38

    
           >    - {% icon param-conditional %} *"Type of image data to process"*: `2-D image data (or series thereof)`

🚫 [GTN Lint] <GTN:033> _{reported by reviewdog 🐶}
The icon (param-conditional) could not be found, please add it to _config.yml.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 225 in f050a38

> - {% icon param-conditional %} *"Filter type"*: `Median`

🚫 [GTN Lint] <GTN:033> _{reported by reviewdog 🐶}
The icon (param-conditional) could not be found, please add it to _config.yml.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 246 in f050a38

    
           >    - {% icon param-conditional %} *"Thresholding method"*: `Globally adaptive / Otsu`

⚠️ [GTN Lint] <GTN:020> _{reported by reviewdog 🐶}
This looks like a heading, but isn't. Please use proper semantic headings where possible. You should check the heading level of this suggestion, rather than accepting the change as-is.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 49 in f050a38

    
           **(To be add: image showing simple workflow 1. Raw image-Pixels, 2. Numerical Grid for intensities, 3. Data extraction, 4. Final Data)**

⚠️ [GTN Lint] <GTN:020> _{reported by reviewdog 🐶}
This looks like a heading, but isn't. Please use proper semantic headings where possible. You should check the heading level of this suggestion, rather than accepting the change as-is.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 64 in f050a38

**(Add Pete B's Image from his book)**

⚠️ [GTN Lint] <GTN:020> _{reported by reviewdog 🐶}
This looks like a heading, but isn't. Please use proper semantic headings where possible. You should check the heading level of this suggestion, rather than accepting the change as-is.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 126 in f050a38

    
           **(Add example images of how two images that look the same to the human eye have different data)**

🚫 [GTN Lint] <GTN:028> _{reported by reviewdog 🐶}
You have skipped a heading level, please correct this.

Listing of Heading Levels

# Introduction
# 1. Know your data (the “digital anatomy” of an image)
## Pixels and voxels
### Bit depth (the range)
### Spatial calibration (the size)
## The 5 dimensions (5D)
## Bit depth: why it matters for science
# 2. How to get your images into Galaxy
### Why use the Bio-Formats tool suite?
# 3. Before you begin: diagnose your data
# 4. The lifecycle of an analysis pipeline
### Stage A: Pre-processing (cleaning)
### Stage B: Segmentation (Defining objects)
### Stage B.2: The Region of Interest (ROI)
### Stage C: Post-processing (Refining)
### Stage D: Quantification (Extracting numbers)
### Stage E: Validation (The sanity check)
# 4. Finding your workflow: modality and tools
## The decision tree: your logical roadmap
## The Galaxy imaging toolbox
#### A. Standard tools (single images, high-performance & batch)
#### B. Interactive tools (Visual exploration)
### Identifying your modality
### Practice: applying the roadmap
# 6. Common pitfalls to avoid
### 1. The “JPG” trap
### 2. The “merged image” mistake
### 3. Ignoring Saturation
# Conclusion
# 7. Glossary of Bioimage Terms
# 8. Next Steps: Choose your Tutorial

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 152 in f050a38

### Why use the Bio-Formats tool suite?

🚫 [GTN Lint] <GTN:028> _{reported by reviewdog 🐶}
You have skipped a heading level, please correct this.

Listing of Heading Levels

# Introduction
# 1. Know your data (the “digital anatomy” of an image)
## Pixels and voxels
### Bit depth (the range)
### Spatial calibration (the size)
## The 5 dimensions (5D)
## Bit depth: why it matters for science
# 2. How to get your images into Galaxy
### Why use the Bio-Formats tool suite?
# 3. Before you begin: diagnose your data
# 4. The lifecycle of an analysis pipeline
### Stage A: Pre-processing (cleaning)
### Stage B: Segmentation (Defining objects)
### Stage B.2: The Region of Interest (ROI)
### Stage C: Post-processing (Refining)
### Stage D: Quantification (Extracting numbers)
### Stage E: Validation (The sanity check)
# 4. Finding your workflow: modality and tools
## The decision tree: your logical roadmap
## The Galaxy imaging toolbox
#### A. Standard tools (single images, high-performance & batch)
#### B. Interactive tools (Visual exploration)
### Identifying your modality
### Practice: applying the roadmap
# 6. Common pitfalls to avoid
### 1. The “JPG” trap
### 2. The “merged image” mistake
### 3. Ignoring Saturation
# Conclusion
# 7. Glossary of Bioimage Terms
# 8. Next Steps: Choose your Tutorial

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 208 in f050a38

### Stage A: Pre-processing (cleaning)

🚫 [GTN Lint] <GTN:028> _{reported by reviewdog 🐶}
You have skipped a heading level, please correct this.

Listing of Heading Levels

# Introduction
# 1. Know your data (the “digital anatomy” of an image)
## Pixels and voxels
### Bit depth (the range)
### Spatial calibration (the size)
## The 5 dimensions (5D)
## Bit depth: why it matters for science
# 2. How to get your images into Galaxy
### Why use the Bio-Formats tool suite?
# 3. Before you begin: diagnose your data
# 4. The lifecycle of an analysis pipeline
### Stage A: Pre-processing (cleaning)
### Stage B: Segmentation (Defining objects)
### Stage B.2: The Region of Interest (ROI)
### Stage C: Post-processing (Refining)
### Stage D: Quantification (Extracting numbers)
### Stage E: Validation (The sanity check)
# 4. Finding your workflow: modality and tools
## The decision tree: your logical roadmap
## The Galaxy imaging toolbox
#### A. Standard tools (single images, high-performance & batch)
#### B. Interactive tools (Visual exploration)
### Identifying your modality
### Practice: applying the roadmap
# 6. Common pitfalls to avoid
### 1. The “JPG” trap
### 2. The “merged image” mistake
### 3. Ignoring Saturation
# Conclusion
# 7. Glossary of Bioimage Terms
# 8. Next Steps: Choose your Tutorial

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 339 in f050a38

#### A. Standard tools (single images, high-performance & batch)

🚫 [GTN Lint] <GTN:028> _{reported by reviewdog 🐶}
You have skipped a heading level, please correct this.

Listing of Heading Levels

# Introduction
# 1. Know your data (the “digital anatomy” of an image)
## Pixels and voxels
### Bit depth (the range)
### Spatial calibration (the size)
## The 5 dimensions (5D)
## Bit depth: why it matters for science
# 2. How to get your images into Galaxy
### Why use the Bio-Formats tool suite?
# 3. Before you begin: diagnose your data
# 4. The lifecycle of an analysis pipeline
### Stage A: Pre-processing (cleaning)
### Stage B: Segmentation (Defining objects)
### Stage B.2: The Region of Interest (ROI)
### Stage C: Post-processing (Refining)
### Stage D: Quantification (Extracting numbers)
### Stage E: Validation (The sanity check)
# 4. Finding your workflow: modality and tools
## The decision tree: your logical roadmap
## The Galaxy imaging toolbox
#### A. Standard tools (single images, high-performance & batch)
#### B. Interactive tools (Visual exploration)
### Identifying your modality
### Practice: applying the roadmap
# 6. Common pitfalls to avoid
### 1. The “JPG” trap
### 2. The “merged image” mistake
### 3. Ignoring Saturation
# Conclusion
# 7. Glossary of Bioimage Terms
# 8. Next Steps: Choose your Tutorial

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 427 in f050a38

### 1. The "JPG" trap

🚫 [GTN Lint] <GTN:046> _{reported by reviewdog 🐶}
Please do not include an # Introduction section, it is unnecessary here, just start directly into your text. The first paragraph that is seen by our infrastructure will automatically be shown in a few places as an abstract.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Lines 1 to 2 in f050a38

    
           --- 
        
           layout: tutorial_hands_on

…rial.md Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

github-actions

Remaining comments which cannot be posted as a review comment to avoid GitHub Rate Limit

GTN Lint

⚠️ [GTN Lint] <GTN:020> _{reported by reviewdog 🐶}
This looks like a heading, but isn't. Please use proper semantic headings where possible. You should check the heading level of this suggestion, rather than accepting the change as-is.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 49 in 6128c35

    
           **(To be add: image showing simple workflow 1. Raw image-Pixels, 2. Numerical Grid for intensities, 3. Data extraction, 4. Final Data)**

⚠️ [GTN Lint] <GTN:020> _{reported by reviewdog 🐶}
This looks like a heading, but isn't. Please use proper semantic headings where possible. You should check the heading level of this suggestion, rather than accepting the change as-is.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 64 in 6128c35

**(Add Pete B's Image from his book)**

⚠️ [GTN Lint] <GTN:020> _{reported by reviewdog 🐶}
This looks like a heading, but isn't. Please use proper semantic headings where possible. You should check the heading level of this suggestion, rather than accepting the change as-is.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 126 in 6128c35

    
           **(Add example images of how two images that look the same to the human eye have different data)**

🚫 [GTN Lint] <GTN:028> _{reported by reviewdog 🐶}
You have skipped a heading level, please correct this.

Listing of Heading Levels

# Introduction
# 1. Know your data (the “digital anatomy” of an image)
## Pixels and voxels
### Bit depth (the range)
### Spatial calibration (the size)
## The 5 dimensions (5D)
## Bit depth: why it matters for science
# 2. How to get your images into Galaxy
### Why use the Bio-Formats tool suite?
# 3. Before you begin: diagnose your data
# 4. The lifecycle of an analysis pipeline
### Stage A: Pre-processing (cleaning)
### Stage B: Segmentation (Defining objects)
### Stage B.2: The Region of Interest (ROI)
### Stage C: Post-processing (Refining)
### Stage D: Quantification (Extracting numbers)
### Stage E: Validation (The sanity check)
# 4. Finding your workflow: modality and tools
## The decision tree: your logical roadmap
## The Galaxy imaging toolbox
#### A. Standard tools (single images, high-performance & batch)
#### B. Interactive tools (Visual exploration)
### Identifying your modality
### Practice: applying the roadmap
# 6. Common pitfalls to avoid
### 1. The “JPG” trap
### 2. The “merged image” mistake
### 3. Ignoring Saturation
# Conclusion
# 7. Glossary of Bioimage Terms
# 8. Next Steps: Choose your Tutorial

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 152 in 6128c35

### Why use the Bio-Formats tool suite?

🚫 [GTN Lint] <GTN:028> _{reported by reviewdog 🐶}
You have skipped a heading level, please correct this.

Listing of Heading Levels

# Introduction
# 1. Know your data (the “digital anatomy” of an image)
## Pixels and voxels
### Bit depth (the range)
### Spatial calibration (the size)
## The 5 dimensions (5D)
## Bit depth: why it matters for science
# 2. How to get your images into Galaxy
### Why use the Bio-Formats tool suite?
# 3. Before you begin: diagnose your data
# 4. The lifecycle of an analysis pipeline
### Stage A: Pre-processing (cleaning)
### Stage B: Segmentation (Defining objects)
### Stage B.2: The Region of Interest (ROI)
### Stage C: Post-processing (Refining)
### Stage D: Quantification (Extracting numbers)
### Stage E: Validation (The sanity check)
# 4. Finding your workflow: modality and tools
## The decision tree: your logical roadmap
## The Galaxy imaging toolbox
#### A. Standard tools (single images, high-performance & batch)
#### B. Interactive tools (Visual exploration)
### Identifying your modality
### Practice: applying the roadmap
# 6. Common pitfalls to avoid
### 1. The “JPG” trap
### 2. The “merged image” mistake
### 3. Ignoring Saturation
# Conclusion
# 7. Glossary of Bioimage Terms
# 8. Next Steps: Choose your Tutorial

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 208 in 6128c35

### Stage A: Pre-processing (cleaning)

🚫 [GTN Lint] <GTN:028> _{reported by reviewdog 🐶}
You have skipped a heading level, please correct this.

Listing of Heading Levels

# Introduction
# 1. Know your data (the “digital anatomy” of an image)
## Pixels and voxels
### Bit depth (the range)
### Spatial calibration (the size)
## The 5 dimensions (5D)
## Bit depth: why it matters for science
# 2. How to get your images into Galaxy
### Why use the Bio-Formats tool suite?
# 3. Before you begin: diagnose your data
# 4. The lifecycle of an analysis pipeline
### Stage A: Pre-processing (cleaning)
### Stage B: Segmentation (Defining objects)
### Stage B.2: The Region of Interest (ROI)
### Stage C: Post-processing (Refining)
### Stage D: Quantification (Extracting numbers)
### Stage E: Validation (The sanity check)
# 4. Finding your workflow: modality and tools
## The decision tree: your logical roadmap
## The Galaxy imaging toolbox
#### A. Standard tools (single images, high-performance & batch)
#### B. Interactive tools (Visual exploration)
### Identifying your modality
### Practice: applying the roadmap
# 6. Common pitfalls to avoid
### 1. The “JPG” trap
### 2. The “merged image” mistake
### 3. Ignoring Saturation
# Conclusion
# 7. Glossary of Bioimage Terms
# 8. Next Steps: Choose your Tutorial

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 339 in 6128c35

#### A. Standard tools (single images, high-performance & batch)

🚫 [GTN Lint] <GTN:028> _{reported by reviewdog 🐶}
You have skipped a heading level, please correct this.

Listing of Heading Levels

# Introduction
# 1. Know your data (the “digital anatomy” of an image)
## Pixels and voxels
### Bit depth (the range)
### Spatial calibration (the size)
## The 5 dimensions (5D)
## Bit depth: why it matters for science
# 2. How to get your images into Galaxy
### Why use the Bio-Formats tool suite?
# 3. Before you begin: diagnose your data
# 4. The lifecycle of an analysis pipeline
### Stage A: Pre-processing (cleaning)
### Stage B: Segmentation (Defining objects)
### Stage B.2: The Region of Interest (ROI)
### Stage C: Post-processing (Refining)
### Stage D: Quantification (Extracting numbers)
### Stage E: Validation (The sanity check)
# 4. Finding your workflow: modality and tools
## The decision tree: your logical roadmap
## The Galaxy imaging toolbox
#### A. Standard tools (single images, high-performance & batch)
#### B. Interactive tools (Visual exploration)
### Identifying your modality
### Practice: applying the roadmap
# 6. Common pitfalls to avoid
### 1. The “JPG” trap
### 2. The “merged image” mistake
### 3. Ignoring Saturation
# Conclusion
# 7. Glossary of Bioimage Terms
# 8. Next Steps: Choose your Tutorial

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Line 427 in 6128c35

### 1. The "JPG" trap

🚫 [GTN Lint] <GTN:046> _{reported by reviewdog 🐶}
Please do not include an # Introduction section, it is unnecessary here, just start directly into your text. The first paragraph that is seen by our infrastructure will automatically be shown in a few places as an abstract.

training-material/topics/imaging/tutorials/where-to-start-bioimaging-galaxy/tutorial.md

Lines 1 to 2 in 6128c35

    
           --- 
        
           layout: tutorial_hands_on

…rial.md Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Updated tutorial content for clarity and structure.

kostrykin

Very cool, @dianichj! 🚀🪐

A few comments for Section 1 inside.