Loading...
Discovering amazing AI tools


Open-source data version control system that brings Git-like workflows to datasets, models, and ML experiments.

Open-source data version control system that brings Git-like workflows to datasets, models, and ML experiments.
DVC (Data Version Control) is an open-source tool that provides Git-like versioning and workflow primitives for data, models, and ML experiments. It stores lightweight metafiles in Git while tracking large files, datasets, and model artifacts in external storage (S3, GCS, Azure, SSH, etc.), enabling reproducible pipelines and efficient cache-based transfers. DVC also provides pipeline orchestration (DVC pipelines), experiment tracking and comparison, metrics and plots, and integrations with CI/CD and Git hosting to support collaboration across data science and engineering teams. The tool is designed to make ML projects reproducible, shareable, and easier to manage without forcing teams to leave familiar Git workflows.






