I am trying to get a good understanding of datachains, and even though I have been reading a lot I still have noob questions. I would appreciate a lot if someone could help me understand.
-
Where is the datachain hosted ? Is it public like the blockchain ? Is there just one datachain hosted on the safenetwork, or is it duplicated so that each close group maintains a duplication of the datachain ? My understanding is the datachain is scattered amongst groups within the network, and that nodes know part of the chain and validate it but nobody except the network itself has a clear picture of what the datachain as a whole is. Am I correct ?
-
Datachain is about data identifier not data itself. I think I understand this. Still, if the network grows a lot, how do we make sure the datachain does not grow so big that it becomes difficult to handle and takes a lot of resources or weight on performances. How does datachain scale compared to blockchain ? I read david’s answer that some old block of datachain can be deleted, but what if some data identifier within these blocks needs to be kept alive because they are still relevant ? do we then need to keep the full block ? the datachain could become a very very complex thing to maintain.
-
The concept of nodes of groups lost me. A group can be made of up to 32 computers. How many groups does it take to make a node ? Will some nodes only have the task of maintaining the identifiers and not the data ? in that case, these nodes would be crucial to maintaining the network consistancy and attacking them could have more severe impact than just data loss ?
-
The attack we had with alpha1 is easy to prevent in beta, and spamming the network won’t be possible as we add friction through safecoin implementation. Attacking the network by providing massive amount of nodes and then when they get a good rank kill them all, seems much more dangerous to me. Is the node rank public ?
-
On a more general note, I have been around for 2-3 months so there are things I am missing. My question is how come something that seems so crucial to the safenetwork as datachain, something that sets apart SAFE so distinctively and is at the heart of the network, how come it is only developed and tested now ? Datachains algos couldn’t have been built before ?
Please really understand that my tone is not aggressive at all. I am just trying to understand and catch up with all of you.