pangaeapy - a Python module to access and analyse PANGAEA data

Background

PANGAEA (https://www.pangaea.de) is one of the world's largest archives of this kind offering essential data services such as data curation, long-term data archiving and data publication. PANGAEA hosts about 400,000 datasets comprising around 17.5 billion individual measurements (Aug. 2020) and observations which have been collected during more than 240 international research projects. The system is open to any project, institution or individual scientist using, archiving or publishing research data.

Since the programming languages Python and R have become increasingly important for scientific data analysis in recent years, we have developed 'pangaeapy' a new, custom Python module that considerably simplifies typical data science tasks.

Given a DOI, pangaeapy uses PANGAEA’s web services to automatically load PANGAEA metadata into a dedicated python object and tabular data into a Python Data Analysis Library (pandas) DataFrame with a mere call of a specialized function. This makes it possible to integrate PANGAEA data with data from a large number of sources and formats (Excel, NetCDF, etc.) and to carry out data analyses within a suitable computational environment such as Jupyter notebooks in a uniform manner.

Installation

Source code from github
- pip install git+https://github.com/pangaea-data-publisher/pangaeapy.git
Wheels for Python from PyPI
- pip install pangaeapy

Usage

import pangaeapy.pandataset as pd

ds = pd.PanDataSet(787140)
print(ds.title)
print(ds.data.head())

Examples

Please take a look at the example Jupyter Notebooks which you can find in the 'examples' folder

Documentation

https://github.com/pangaea-data-publisher/pangaeapy/blob/master/docs/pandataset.md

Running the tests

The tests ar located in the test directory. You can run them by executing pytest or tox in the root directory.

Cite as

Robert Huber, Egor Gordeev, Markus Stocker, Aarthi Balamurugan, & Uwe Schindler (2020). pangaeapy - a Python module to access and analyse PANGAEA data. Zenodo. http://doi.org/10.5281/zenodo.4013940.

Name		Name	Last commit message	Last commit date
Latest commit History 233 Commits
docs		docs
examples		examples
src/pangaeapy		src/pangaeapy
test		test
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE.md		LICENSE.md
README.md		README.md
RELEASE_GUIDELINES.md		RELEASE_GUIDELINES.md
conf.py		conf.py
release.sh		release.sh
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pangaeapy - a Python module to access and analyse PANGAEA data

Background

Installation

Usage

Examples

Documentation

Running the tests

Cite as

About

Releases 4

Packages

Contributors 12

Languages

License

pangaea-data-publisher/pangaeapy

Folders and files

Latest commit

History

Repository files navigation

pangaeapy - a Python module to access and analyse PANGAEA data

Background

Installation

Usage

Examples

Documentation

Running the tests

Cite as

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 12

Languages

Packages