Clinical Content Management Tools

This repository aims to provide a set of scripts and utilities to (hopefully) facilitate the management of clinical content using OpenConceptLab (OCL) and OpenMRS 3 Forms. The tools are designed to automate repetitive tasks across various implementers, facilities, and forms.

The vision behind this set of tools is to evolve into a user-friendly and flexible toolkit covering critical and often cumbersome stages of Health Metadata Management. Here is an overview and progress made on each stage:

%%{
  init: {
    'theme': 'base',
    'themeVariables': {
      'primaryColor': '#f0f3f7',
      'primaryTextColor': '#000',
      'primaryBorderColor': '#f0f3f7',
      'lineColor': '#000',
      'secondaryColor': '#000',
      'tertiaryColor': '#000'
    }
  }
}%%
flowchart LR
    A["<div style='width:250px; height:250px; display:flex; flex-direction:column; justify-content:center; text-align:center;'>
         <div style='font-size:22px;'><b>🏗️<br><br>Paper-To-Form Converter</b></div>
         <div><br>1st tests made, prompting refined</div>
         <div>Target release: June 2025</div>
         </div>"] -->
    B["<div style='width:250px; height:250px; display:flex; flex-direction:column; justify-content:center; text-align:center;'>
         <div style='font-size:22px;'><b>🏗️<br><br>Concept standardization and mapping</b></div>
         <div><br>OCL Mapper v2 in progress</div>
         <div>Target release: June 2025</div>
         </div>"]
    B -->
    C["<div style='width:250px; height:250px; display:flex; flex-direction:column; justify-content:center; text-align:center;'>
         <div style='font-size:22px;'><b>🗓️<br><br>Content creation assistant for OCL</b></div>
         <div><br>Planned</div>
         </div>"]
    C -->
    D["<div style='width:250px; height:250px; display:flex; flex-direction:column; justify-content:center; text-align:center;'>
         <div style='font-size:22px;'><b>🏗️<br><br>Metadata validation assistant</b></div>
         <div><br>Initial tool being prepared</div>
         </div>"]
    D -->
    E["<div style='width:250px; height:250px; display:flex; flex-direction:column; justify-content:center; text-align:center;'>
         <div style='font-size:22px;'><b>🚀<br><br>Metadata to OpenMRS 3 form generation</b></div>
         <div><br>Used with MSF forms</div>
         <div>Automation coverage: 80%</div>
         </div>"]
    E -->
    F["<div style='width:250px; height:250px; display:flex; flex-direction:column; justify-content:center; text-align:center;'>
         <div style='font-size:22px;'><b>🏗️<br><br>Metadata to e2e test cases automation</b></div>
         <div><br>1st tests made, prompting refined</div>
         <div>Target release: June 2025</div>
         </div>"]

Below if an introduction video about the standardization and mapping tooling - Though deprecated and now replaced by OCL Mapper.

Here is an explanation/demo video

Python scripts

OCL concept automatching: matcher.py automates the process of matching OCL concepts.
XLSX to O3 form schema conversion: converter.py converts XLSX files to O3 (OpenMRS 3) form JSON schemas.

Tooling scripts

OCL Source fetcher: fetcher.py download a local snapshot of an OCL source for the automatch.
Source Filter: filter.py creates a filtered version of the source snapshot to improve performance.
Updating the form and translations in your EMR repo: update_form_and_translations.py takes the newly generated form and translation files and updates them in your repo almost instantly.

Requirements

To run these scripts, you will need the following:

Python 3.x
Pandas library
Openpyxl library

You can also run install the required dependencies using: pip install -r requirements.txt

Installation

To get started with the Clinical Content Management Tools, follow these steps:

Clone the repository: git clone https://github.com/michaelbontyes/clinical-content-tools.git
Navigate to the project directory: cd clinical-content-tools
Install Python 3.x (if not already installed):
- For Windows: Download and install Python from the official website: https://www.python.org/downloads/
- For macOS: Install Python using Homebrew: brew install python3
- For Linux: Use the package manager of your distribution (e.g., apt-get install python3 for Ubuntu)
Install pip (if not already installed):
- For Windows: pip is usually included with Python installation. If not, download get-pip.py from https://bootstrap.pypa.io/get-pip.py and run python get-pip.py
- For macOS and Linux: Use the package manager of your distribution (e.g., apt-get install python3-pip for Ubuntu)
Install the required dependencies: pip install -r requirements.txt

Getting started

Create a .env file from the provided .env.example file and update in the required environment variables. You could also use the default values provided in the .env.example file for testing purposes.

cp .env.example .env

This file contains the common configuration variables. You can modify the values as needed.

Common configuration file

sheets: A list of sheet names in the metadata Excel file that contain the concepts to be matched.
OCL_URL: The base URL of the OCL server where the concepts will be matched.
FUZZY_THRESHOLD: The fuzzy string matching threshold (default is 95). This value determines the minimum similarity score required for a match.
METADATA_FILEPATH: The file path of the metadata Excel file containing the concepts to be matched.
OUTPUT_DIR: The directory where the generated form schemas will be saved.
automatch_references: A dictionary containing the details of the OCL sources to be used for matching. Each key in the dictionary represents a source name, and the corresponding value is another dictionary containing the source details.

Usage and configuration for `matcher.py`

The matcher.py script is designed to automate the process of matching OCL concepts based on the provided configuration settings. Below is a detailed explanation of the configuration parameters and their usage.

To use the matcher.py script, you need to provide two input files:

An Excel file containing the data to be matched. Example provided: metadata_example.xlsx
JSON files containing the reference data for matching. Examples provided in ocl_source_snapshots: MSF_Source_Filtered_20240712_163433.json for MSF Source and CIEL_Source_Filtered_20240708_153712.json for CIEL Source

You can configure the destination columns where to write the suggested matches, for each OCL source provided:

source_filepath: The file path of the JSON file containing the concepts from the OCL source.
suggestion_column: The name of the column in the metadata Excel file that contains the suggestions for matching concepts.
external_id_column: The name of the column in the metadata Excel file that contains the external IDs of the concepts.
description_column: The name of the column in the metadata Excel file that contains the descriptions of the concepts.
datatype_column: The name of the column in the metadata Excel file that contains the datatypes of the concepts.
dataclass_column: The name of the column in the metadata Excel file that contains the classes of the concepts.
score_column: The name of the column in the metadata Excel file that contains the scores of the matching concepts.

To use the matcher.py script with the provided configuration and metadata Xlsx file, simply run the script from the command line:

python matcher.py

The script will read the configuration from the config.json file, process the concepts, and generate the form schemas based on the matching results.

Usage and configuration for `converter.py`

Similarly to matcher.py, use the converter.py script with the provided in the Excel file containing the form configuration metadata.

To run the script, use the following command:

python converter.py

The script will then generate OpenMRS 3 form configurations and translation files from the data in the Excel file, and store them in the folder generated_form_schemas. Then you can copy-paste them directly into OpenMRS Initializer folder or Form Builder UI.

Usage and configuration for `update_form_and_translations.py`

This script is designed to be executed after converter.py. It updates the form and translation files in the distro repo using the newly generated files from the generated_form_schemas/ folder. The script relies on properties defined in the .env file to locate the distro repository and its relevant directories.

You can configure the following properties in the .env file:

PATH_TO_FORM_FILES: The absolute path to the ampathforms/ folder where the form JSON files are stored. Use the pwd command in your ampathforms/ directory to get the exact path.
PATH_TO_TRANSLATION_FILES: The absolute path to the ampathformtranslation/ folder where the translation files are stored. Use the pwd command in your ampathformtranslations/ directory to get the exact path.

Note: The form and translation files must already exist in the distro repository. If they do not, you can manually copy the generated files into the appropriate directories. This script is intended to update existing files, minimizing manual copy-paste operations for developers or users.

To execute the script, use the following command:

python update_form_and_translations.py

You can chain the execution of converter.py and update_form_and_translations.py as follows:

python converter.py && python update_form_and_translations.py

This will:

Generate the form and translation files using converter.py.
Update the pages property in the form JSON files and the translations in the respective translation files within your distro repository.

Calculation Features

The form generator supports two main types of calculations that can be configured in the Excel metadata file:

Previous Observation Values: Fetch the most recent observation value for a concept from previous encounters
Cross-References: Reference values from other questions within the same form

Fetching Previous Values

To fetch a value from previous encounters, add previous or latest in the Calculation column:

Question         | External ID            | Datatype | Calculation
-----------------|------------------------|----------|------------
Last PHQ-9 score | depressionSeverityScale| coded    | previous

This will generate a calculation that fetches the most recent value:

{
  "calculateExpression": "api.getLatestObs(patient.id, 'depressionSeverityScale').then(obs => obs?.valueCodableConcept?.code)",
  "readonly": true
}

Cross-Referencing Questions

To reference another question's value within the same form, use ref: prefix followed by the question ID:

Question        | External ID    | Datatype  | Calculation
----------------|----------------|-----------|-------------
MHOS score      | mhos_score     | numeric   |
Last MHOS score | last_mhos_score| numeric   | ref:mhosScore

This will generate:

{
  "calculateExpression": "api.getLatestObs(patient.id, 'mhosScore').then(obs => obs?.valueQuantity?.value)",
  "readonly": true
}

Important Notes:

Value Accessors: The script automatically selects the correct value accessor based on datatype:
- numeric: valueQuantity?.value
- coded: valueCodableConcept?.code
- text: valueString
Required Fields:
- For previous values: External ID must be filled
- For cross-references: Referenced question ID must exist in the form
- Datatype must be correctly specified
Readonly Behavior: All calculated fields are set as readonly by default

Contributing

Contributions to the Clinical Content Management Tools project are welcome! If you have any suggestions, improvements, or bug fixes, please feel free to open an issue or submit a pull request.

Acknowledgments

The Clinical Content Management Tools project is made possible thanks to OpenConceptLab and OpenMRS communities. Special thanks to the contributors who have contributed to the development of these tools.

Contact

For any questions, please contact Michael Bontyes or reach out to the OpenConceptLab Squad and OpenMRS community.

License

This project is licensed under the MIT License. See the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
.github/workflows		.github/workflows
generated_form_schemas		generated_form_schemas
metadata		metadata
ocl_source_snapshots		ocl_source_snapshots
openfn/jobs		openfn/jobs
.env.example		.env.example
.gitignore		.gitignore
2025 MHPSS - Counseling Base Line Consultation Form_EN.pdf		2025 MHPSS - Counseling Base Line Consultation Form_EN.pdf
LICENSE		LICENSE
README.md		README.md
checker.py		checker.py
cleaner.py		cleaner.py
concepts.csv		concepts.csv
config.json		config.json
converter.py		converter.py
creator.py		creator.py
fetcher.py		fetcher.py
filter.py		filter.py
matcher-andy.py		matcher-andy.py
matcher.py		matcher.py
metadata_example.xlsx		metadata_example.xlsx
ocl_openmrs_checker.py		ocl_openmrs_checker.py
requirements.txt		requirements.txt
translate.py		translate.py
translation_dict.csv		translation_dict.csv
translator.py		translator.py
update_form_and_translations.py		update_form_and_translations.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Clinical Content Management Tools

Below if an introduction video about the standardization and mapping tooling - Though deprecated and now replaced by OCL Mapper.

Python scripts

Tooling scripts

Requirements

Installation

Getting started

Common configuration file

Usage and configuration for `matcher.py`

Usage and configuration for `converter.py`

Usage and configuration for `update_form_and_translations.py`

Calculation Features

Fetching Previous Values

Cross-Referencing Questions

Important Notes:

Contributing

Acknowledgments

Contact

License

About

Uh oh!

Uh oh!

Contributors 5

Uh oh!

Languages

License

MadiroGlobalHealth/clinical-content-tools

Folders and files

Latest commit

History

Repository files navigation

Clinical Content Management Tools

Below if an introduction video about the standardization and mapping tooling - Though deprecated and now replaced by OCL Mapper.

Python scripts

Tooling scripts

Requirements

Installation

Getting started

Common configuration file

Usage and configuration for matcher.py

Usage and configuration for converter.py

Usage and configuration for update_form_and_translations.py

Calculation Features

Fetching Previous Values

Cross-Referencing Questions

Important Notes:

Contributing

Acknowledgments

Contact

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 5

Uh oh!

Languages

Usage and configuration for `matcher.py`

Usage and configuration for `converter.py`

Usage and configuration for `update_form_and_translations.py`