[N] Using Prodmodel to speed up data science development and productionization
I built a tool which keeps track of all code and data deps of your data science project. It caches partial results and can figure out if a particular output (model, transformed data, code library) has to be recomputed before the fact. This can save huge amounts of time during an iterative development process.
Setting up usage is similar to build systems like Bazel or Make (https://github.com/prodmodel/prodmodel/blob/master/example/build.py).
Feedback, users, contributors and constructive criticism is welcome!