I have restored data on hostB (107.4Gb) from a repository that was backed up from hostA
I ran the backup on hostB the same 107.4Gb to create a baseline snapshot.
Kopia took time like normal but uploaded 572.4Mb…
this output is of another snapshot with the same case of partial reupload…
I am on a Windows platform.
The one above was restored using the show lzzzzzz trick with worked…
saviodsouza@savio:D:\Personal\vmware\GNS3 VM
2020-09-21 19:00:43 IST k617b8c069d13de55dfe9fa15d4b12b53 205.2 GB drwxrwxrwx files:10 dirs:1 (latest-2)
2020-09-22 08:52:32 IST k617b8c069d13de55dfe9fa15d4b12b53 205.2 GB drwxrwxrwx files:10 dirs:1 (latest-1,annual-1,monthly-1,weekly-1)
saviodsouza@savio-pc:C:\userdata\vmware
2020-09-24 17:09:37 IST k177c8e93d6cad794da25be808aa2f513 205.2 GB dr-xr-xr-x files:10 dirs:1 (latest-3,weekly-2,daily-2)
2020-09-28 12:23:48 IST kfcd48f1b17febcbe577e29daacc1beec 205.2 GB dr-xr-xr-x files:10 dirs:2 (latest-2,hourly-2)
2020-09-28 14:50:45 IST kdcfc3b3e40b8e580afde0ecb2f5aec49 312.7 GB dr-xr-xr-x files:11 dirs:2 (latest-1,annual-1,monthly-1,weekly-1,daily-1,hourly-1)
kopia.exe diff k617b8c069d13de55dfe9fa15d4b12b53 k177c8e93d6cad794da25be808aa2f513
changed ./GNS3 VM-disk1.vmdk at 2020-09-21 14:29:51.0350172 +0530 IST (size 4224385024 -> 4224385024)
changed ./GNS3 VM-disk2.vmdk at 2020-09-21 14:29:50.3283905 +0530 IST (size 200976891904 -> 200976891904)
We analyzed the data on Slack, majority of differences can be attributed to the fact that zstd compression is in use and it does not always guarantee byte-by-byte identical outputs for the same input. Assuming hostA and hostB had different CPU characteristics (number of cores, etc.) that explains most of the difference.
There’s however the difference in splitter output too, which we’re still trying to get to the bottom of: all split points except first 3 are the same, but it appears that first 6291640 bytes got split differently (1685450+2132853+2473337) vs (2097152+2097152+2097336).
Host a and b are just the same in this case…
I had just reformatted my pc to fix some common issues in windows.
If a file in windows is fragmented can that be a reason for splitting issue…
Say for example before I formatted my pc, Centos8.vdi file was not fragmented when I first run the backup.
After I formatted and did kopia restore Centos8.vdi got fragmented.
Again I did a backup now because the file was fragmented on disk the splitting worked different…
What do you mean by “fragmented”? I am pretty sure, that kopia doesn’t “see” file fragmentation. It simply opens a file and ingests all (logical) blocks… even if a file was fragmented on-disk, this would not be visible to kopia.