Overview
databooks
is a package to ease the collaboration between data scientists using
Jupyter notebooks, by reducing the number of git conflicts between
different notebooks and resolution of git conflicts when encountered.
The key features include:
- CLI tool
- Clear notebook metadata
- Resolve git conflicts
- Simple to use
- Simple API for using modelling and comparing notebooks using Pydantic
Requirements
databooks
is built on top of:
Installation
pip install databooks
Usage
Clear metadata
Simply specify the paths for notebook files to remove metadata. By doing so, we can already avoid many of the conflicts.
$ databooks meta [OPTIONS] PATHS...
Fix git conflicts for notebooks
Specify the paths for notebook files with conflicts to be fixed. Then, databooks
finds
the source notebooks that caused the conflicts and compares them (so no JSON manipulation!)
$ databooks fix [OPTIONS] PATHS...
License
This project is licensed under the terms of the MIT license.