This repository contains useful notebook examples on how to extract and visualize breast cancer data from the World Health Organization (WHO) database.
Example notebooks are available in the apps/ directory.
They can also be accessed by clicking their respective CoLab badge below:
The Notebooks were tested against Python 3.8 on macOS operating system. Continuous integration tests are also performed for all notebooks on Ubuntu 20.04 operating system (see here). They are also compatible with Google CoLab and should be quite robust against other setups.
The following dependencies were used and are installed as part of the notebooks:
Note that all rendered results could be interactive, but we have set Plotly's show(renderer="png")
,
to enable the result to be shown in the output on CoLab, on GitHub, and locally. To get interactive
figures on CoLab set show(renderer="colab")
, or locally use the default show()
. GitHub only support static figures.
The code in this repository is released under MIT license.
Note that the CSV files originate from WHO-related websites. Global data from Cancer Today (powered by GLOBOCAN 2020) and Nordic data from NORDCAN. If you use these figures or datasets, please, cite the appropriate references. Information on how to cite these can be found here and here, respectively.
If you found this project useful, please, consider citing it in your research article:
@software{andre_pedersen_2023_8224208,
author = {André Pedersen},
title = {andreped/breast-cancer-stats: v0.1.0},
month = aug,
year = 2023,
publisher = {Zenodo},
version = {v0.1.0},
doi = {10.5281/zenodo.8224207},
url = {https://doi.org/10.5281/zenodo.8224207}
}