Skip to content

datajoint/datajoint-python

Repository files navigation

DataJoint for Python

DataJoint is a framework for scientific data pipelines based on the Relational Workflow Model — a paradigm where your database schema is an executable specification of your workflow.

  • Tables represent workflow steps — Each table is a step in your pipeline
  • Foreign keys encode dependencies — Parent tables must be populated before child tables
  • Computations are declarative — Define what to compute; DataJoint handles when
  • Results are immutable — Full provenance and reproducibility

Documentation: https://docs.datajoint.com

📘 Upgrading from legacy DataJoint (pre-2.0)? See the Migration Guide for a step-by-step upgrade path.

PyPI pypi Conda conda Tests tests
License Apache-2.0 Citation DOI Coverage coverage

Installation

pip install datajoint

or with Conda:

conda install -c conda-forge datajoint

Example Pipeline

pipeline

Yatsenko et al., bioRxiv 2021

Resources

Contributing

See CONTRIBUTING.md for development setup and guidelines.