[R] Using DVC to create an efficient version control system for data projects
The following article shows how DVC (Data Version Control) tool helped a fintech data team in dealing with production data files such as trained machine learning algorithms and provided a reliable way of versioning those files along the project development: Using DVC to create an efficient version control system for data projects
The tool brought versioning for inputs, intermediate files and algorithm models and this drastically increased productivity by providing a clean framework to manage data in an effortless way to split a project into atomic steps.
To make it more concrete, the article illustrated with a real project on VAT auto-detection from receipts – it consists in automatically retrieving the value-added tax amount from a receipt document in order to simplify accounting work.
submitted by /u/thumbsdrivesmecrazy
[link] [comments]