IDTrack

Cross-Temporal and Cross-Database Biological Identifier Mapping

Modern biology constantly mixes identifiers from different years, databases, and genome builds. The result is a familiar set of problems: IDs disappear, symbols change, references disagree, and “the same gene” isn’t always represented the same way across datasets.

IDTrack is built for that reality. It provides a time-aware, audit-friendly way to translate and harmonize biological identifiers across Ensembl releases and across external namespaces (HGNC, UniProt, RefSeq, Entrez, …), while keeping ambiguity explicit instead of silently forcing a single answer.

What makes IDTrack different

Time-aware mapping: treat Ensembl releases as a “time axis” and travel forward/backward through identifier history.
Assembly-aware mapping: harmonize identifiers across genome builds (e.g. GRCh37 ↔ GRCh38) and respect external databases that are assembly-scoped.
Snapshot boundary for reproducibility: build a release-bounded graph snapshot so results are stable and repeatable.
Explicit external database opt-in: choose which external namespaces participate via a small, editable YAML contract.
Transparency over coercion: conversions are naturally classified as 1→0 (no match), 1→1 (clean), or 1→n (ambiguous).
Scale-ready workflows: caching and snapshot reuse make repeated conversions and multi-dataset harmonization practical.

Who is it for?

Wet-lab researchers who need a reliable, step-by-step path from “my gene list is old” to “my analysis is reproducible”.
Bioinformaticians who want release-pinned, auditable conversions in notebooks, pipelines, and integration workflows.
Atlas builders / integrators who need to harmonize gene identifiers across many cohorts (different Ensembl releases, symbols, and external IDs), keep an explicit audit trail of what mapped/failed/was ambiguous, and ship a release-pinned, reproducible feature space for downstream integration and publication.

Common use cases

Dataset harmonization before integration (single-cell, bulk, atlas-scale collections).
Legacy data rescue (old Ensembl releases, mixed symbols/IDs, retired identifiers).
Publication-grade reproducibility (pin a snapshot boundary + share the exact external configuration).
Cross-database interoperability when collaborators use different identifier conventions.

Documentation and tutorials

The documentation includes a full tutorial suite designed to be the primary learning resource:

Documentation: Documentation
Tutorials: start from the “Tutorials” section in the docs (Part 0 → Part 7).

Name		Name	Last commit message	Last commit date
Latest commit History 141 Commits
.github/workflows		.github/workflows
docs		docs
idtrack		idtrack
makefiles		makefiles
reproducibility		reproducibility
tests		tests
.bandit.yml		.bandit.yml
.darglint		.darglint
.editorconfig		.editorconfig
.flake8		.flake8
.gitattributes		.gitattributes
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.readthedocs.yml		.readthedocs.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CODE_OF_CONDUCT.rst		CODE_OF_CONDUCT.rst
LICENSE		LICENSE
Makefile		Makefile
README.rst		README.rst
codecov.yml		codecov.yml
example_manual_running copy.ipynb		example_manual_running copy.ipynb
noxfile.py		noxfile.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

IDTrack

Cross-Temporal and Cross-Database Biological Identifier Mapping

What makes IDTrack different

Who is it for?

Common use cases

Documentation and tutorials

About

Uh oh!

Releases 5

Packages

Uh oh!

Languages

License

theislab/idtrack

Folders and files

Latest commit

History

Repository files navigation

IDTrack

Cross-Temporal and Cross-Database Biological Identifier Mapping

What makes IDTrack different

Who is it for?

Common use cases

Documentation and tutorials

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Languages

Packages