Data Deduplication

Saying what @happybeing said differently

You store a file

  • it is split and self encrypted in chunks
  • each chunk is sent to the network to be stored
  • for each chunk you pay the network to do the store
  • you have the datamap (list of chunk addresses obtained from the self encryption process IIRC)
  • the network does the storing
  • if the chunk already exists (duplicate chunk) then no store happens
  • you are returned stored_OK type of response. There is no indication that it already existed and no refunds.

In other words there is no response back to you that the chunk existed or not and your chunk store appears exactly the same whether the chunk previously existed or not.

EDIT: for the no refunds being confirmed see Data Deduplication - #15 by frabrunelle

14 Likes