Skip to content

Cleaning the DVC

Hello,

At the moment the dvc-cache-de340 / ClimateDT-phase2 branch without the dvc checkout weights 1.1G which is obviously not good.

The summary:

16K	README.md
512	configuration.yaml
41M	ifsdata
109M	nemo
7.3M	restarts
8.7M	tco1279l137
49M	tco2559l137
1.5M	tco399l137
152M	tco79l137
712M	.git/
21K	.dvc/
512	.dvcignore

The .git is taking most of the space (though I can do some cleaning in the tco79 folder and nemo sending more files to the dvc)

The heavy part of the .git are the packs: /gpfs/scratch/ehpc01/bsc998159/repositories/dvc-cache-de340/.git/objects/pack

I'm looking into what can we safely do with this

cc @kkeller @nrocha