RFC 57: Safecoin Revised

This will yield a reward which is 2 * payment amount.

If StoreCost is 0,01 (about 1% of full vaults at 10 000 vaults network)

The total farmed Safecoin available (4 525millions) will be farmed with current setting after storing of about 350 000TB.

What was the point to change PUT price and GET rewards ? Is it plan to little change original vision of something close to new algorithm ?

2 Likes

Just simpler for version 1 farming rate calculations, we do not expect to launch with that, but we do expect MVP with it.

6 Likes

Thatā€™s what Iā€™ve seen with ecommerce. A data structure having an integer and a currency code (for me, USD and the integer represents base units aka pennies). When you want it shown on the front-end, a helper function is available that formats it with a dollar sign, decimal point, thousands separator (if the amount is large enough), and two decimal places. And not just ecommerce. For example, Ethereum has a base unit (ā€œweiā€) with 1 ether = 10^18 wei.

Rounding can come into play when multiplication/division are involved percentage (i.e. tax, coupon, sending a percentage to the content owner, maidsafe devs, etc). To me it seems difficult to do with a data type that splits units and parts into two variables, and prone to confusion and buggy code. Arithmetic like ā€œadd $1.65 to $0.75 and divide by 3ā€ is easy when working with a datatype that represents currency amounts as base units. (165 + 75) = 240. 240 / 3 = 80. Youā€™re not having to carry values back and forth between the units variables and the parts variables. It also makes multiplication and division much simpler.

I think itā€™s more work than 2 separate operations. Iā€™m trying to think how to do the same basic arithmetic I mentioned above (ā€œadd $1.65 to $0.75 and divide by 3ā€) if represented with units and parts variablesā€¦

When you add, you get 1 unit and 140 parts. You do integer division by 100 on the parts to extract the hundreds place to a temp variable and add that temp variableā€™s value to the units variable to get a 2 in the units place. Then you do modulo 100 on the 140 parts and overwrite so that the parts variable has 40. Thatā€™s kind of easy compared to what comes next - we havenā€™t performed the division yet. Letā€™s just say it involves dividing both parts independently, multiplying the units by 100, and adding the result to the parts. And losing a little bit along the way.

Alternatively, I suppose you might be able to use the units and parts variables to set a float (double), perform your math with the float, convert values from the float back to the integers for storage, and hope that rounding quirks wonā€™t cause a problem this time. I oversimplified this example by treating the units variable like pennies, not 250 pico-coins.

Maybe thereā€™s an easier way to do arithmetic with this data type, but I havenā€™t thought of it. Maybe if operator overloading is used (if it is available / supported by the language). Or maybe using helper functions like: "Math.round(Math.divide(Math.add(x, y), 3)). Seems much easier/cleaner to go to a single 64 bit integer, or a float (double).

1 Like

So then i guess a MVP is not the launch version? Are you referring to a testnet as an MVP?


When a node cannot store a chunk due to being at capacity, then the chunk is stored on the next closest node. This is a relocated chunk and Elders should keep a note of relocated chunks. As this is handled via PARSEC then it SHOULD NOT present any issue. Should a node pretend to store chunks, it puts itself in danger of losing all Age. For this reason when a node is asked to send a chunk to a requester and does not then the Elders should vote to kill the node.

This could be a problem if coupled with the change in farming rewards. A node could still pretend to store the chunk, then when asked to send that chunk, it would make a proxy fetch from another node.

Outsourcing Attack : Malicious storage providers could commit to store more data than the amount they can physically store, relying on quickly fetching data from other storage providers.

ā€“ Novel constructions for Proof-of-Replication Ā· Issue #4 Ā· protocol/research Ā· GitHub

Thatā€™s not a problem if the old rewarding algo is applied, as only the fastest node is rewarded, thus counteracting proxying.

3 Likes

It is not a problem anyway IMO, we pay nodes to do stuff, we donā€™t really care how they do it, if that makes sense. So reward all those who can do the service we want.

Yes that is correct. Start with all features, then refine when everyone has the big picture.

3 Likes

I looked into some coin denomination ideas from before. I think they could work.

struct Coin {
    address: u128; // maybe encoded in some packed format like UTF8
}

If the biggest denominations are at the beginning with a few coins, we could add a new set of smaller coins at the end whenever the largest Safecoin denomination becomes too big to be practical. We would never run into a problem of divisibility. If the number of the coins in each denomination is predetermined, we could have a practically infinite supply of coins with a finite total Safecoin value.

Rewards would be sampled from a small subset of the available coins, maybe from a sliding window placed over all the coins, selecting just 1-2 denominations. I think it should be possible to set the window so that the average reward is whatever the network decides and, if vault rewards came in many but smaller coins, people with just one or two vaults would still get paid frequently enough.

3 Likes

Not if itā€™s another node/vault in the same section storing the actual data. it would decrease the replication ratio. If my vault is able to fetch that data (maybe even masking it self as a client) and another vault (another replica) in the same section would reply to the fetch request in a timely mannerā€¦

4 Likes

Good point, however I think that is OK as long as both do what they should. Node age and relocate etc. will separate them into different sections, Elders in their own section will separate them as well and so on. It is related to Sybil prevention and works in this case as well. If nodes do that I think we are fine, they may be taking risks though.

I think it could be seen from different angles. I am replying to a huge number of posts right now (internally) so wanted to give some context here. Hope it helps.

3 Likes

More than 10% of them at least a few years back IIRC

1 Like

But in SN we have a more stable reward, thereā€™s no halving.

4 Likes

Good point and should be considered more. Perhaps dev rewards should be related to available coins left to be farmed or similar? Worth digging into that

10 Likes

This post was a lot longer but I found a lot of stuff answered itself as I wrote it. At the risk of stating the obvious, this is a very detailed and nuanced rfc!


From Unresolved questions:

ā€œwe currently require payment for the creation of a new CoinBalance instance, whereas here the vault would need to create one without being able to pay for its creation.ā€

Some options

  • Elders pay for it. Elders already have both higher income and higher costs. Itā€™s a tiny amount of extra work/cost to pay for a new wallet for the new vault as part of the overall burden of a new vault joining. Could be baked in that if the new vault CoinBalance already exists then the elders credit it for being a well prepared citizen.

  • Have the ā€˜fromā€™ field be the aggregate section bls key which is accepted even though it has no balance. The check for valid creation becomes a two-stage test a) ā€˜fromā€™ is able to pay or b) ā€˜fromā€™ is a section.

  • Have elders act as an escrow, and when the new vault has earned enough to pay for CoinBalance creation the elders create it. This may involve having an actual CoinBalance for the aggregate section bls key. In this way Elders act as a ā€˜farming poolā€™ for very young vaults, aggregating funds like a bitcoin mining pool. But the purpose of the pool is not to reduce variation in rewards, more to ensure a correct sequence of operations in the vault setup phase.

I think the second option is simplest. Would be interested to hear more ideas.


In the Safecoin Transfer section, how is uuid for a CoinTransfer.Credit.transaction_id determined?


In the Account creation section, it seems most of text belongs in an Account Structureā€™ RFC. These details seem to have no impact on safecoin or farming. I think some mention of this data structure is useful to add context to CoinBalance but the details donā€™t seem to belong here. Probably the first two sentences of this section is all thatā€™s needed, and also ā€œany updates are free of charge.ā€

A minor point for sure, but the document is pretty large and any way to reduce cognitive burden is a win.


A question to ponder about the StoreCost algorithmā€¦

Should the network measure ā€œfull vaultsā€ or ā€œspare spaceā€ and whatā€™s the difference?

On one hand, full vaults is really what needs be avoided, but on the other hand spare space is the metric with the best resolution for allowing it to be avoided. Full Vault is post-problem, Spare Space is pre-problem.

Another questionā€¦

Should StoreCost for a network of 10 sections with 1 full vault each be the same or different to a network of 10,000 sections with 1 full vault each?

On one hand the portion of full vaults is the same for both networks, but on the other hand the larger network has a thousand times more spare space than the smaller one.


With overall 2 Ɨ UploadCost reward, it could be that vaults would want to upload data to their section to get more rewards. If every vault did this it might pay for itself but it becomes a prisoners dilemma problem. Just touching this to see if it tickles anyone.


Could be a maximum reward scenario when one vault is not full and all others are (maybe only briefly? maybe all only 1 chunk over-capacity?). This would mean nearly 2 safecoin per PUT with 2Ɨ reward totalling nearly 4 safecoin reward per PUT to be divided into the section. Not sure how achievable or dangerous this scenario is but again wanted to touch on it to see if prods anyone.


ā€œreward will be divided as followsā€

single_node_age = if no associated CoinBalance::owner { 0 }
                  else if flagged as full { node's age/2 }
                  else { node's age }

I think an extra line might be good here to account for the additional workload of elders that adults donā€™t have (especially important for tie-breakers among adults for eldership):

single_node_age = if no associated CoinBalance::owner { 0 }
                  else if flagged as full { node's age/2 }
                  else if elder { node's age + 1 }
                  else { node's age }

Iā€™m interested in the Full Vault detailsā€¦

Are chunks always attempted to be stored in full vaults to give them a chance to become recognised as unfull? Or does a full vault get automatically bypassed and the chunk redirected to the next nearest nonfull vault?

Will there be an option for a full vault to ask for relocated chunks to be returned to them, eg an Unrelocate request?

Is there some time after which the full vault is killed? Or can full vaults live on the network forever? Or until their next relocation?

Can the list of relocated chunks keep growing forever or is there some limit to how many chunks can be relocated?


Has the disallow rule been replaced by number of full vaults?

Seems to be covered in Future enhancements section:

ā€œaccepting a node to a section will revolve around 100 nodes capable of storing data, full nodes will not be counted as part of the section recommended size of 100ā€

and

ā€œEach section will aim to maintain a minimum ratio of 50% good nodesā€

Soā€¦ full nodes donā€™t count toward the 100 target, but they do count toward the 50% good.

Does this psuedocode seem right?

allow new vault?
    less than 100 good -> yes
    more than 50% full -> yes
    else -> no

ā€œaccepting a node to a section will revolve around 100 nodes capable of storing data, full nodes will not be counted as part of the section recommended size of 100ā€

ā€œEach section will aim to maintain a minimum ratio of 50% good nodesā€

This gives some expected ranges of section size.

100 good nodes is the target. At least 50% good nodes means target between 0 and 100 full nodes.

200 good nodes is roughly when a split will happen, which means between 0 and 200 full nodes in a big section.

So the likely biggest section would be 400 nodes (200 good, 200 full).

Some StoreCost calculations

G    F    N    SC
100    0  100  0.0100
100  100  200  0.5100
200    0  200  0.0050
200  200  400  0.5050

StoreCost will almost always be roughly between 0.005 - 0.5 safecoin per PUT.

Does this summary sound right? Am I missing some details?

13 Likes

As I said later when (underflows)/overflows occur from/to parts

And yes i was using ā€œoperationā€ as a set of instructions that is performed on each variable and the set of instructions can include the overflow/underflow portion.

Whereas the one fixed point integer has basically one instruction, not just the one operation.

No as this is a avenue for attack to drain the elders by spamming.

The solution is to have zero cost creations (PUTs) for certain privileged data, like coinBalance creation of a coinBalance

A coinBalance is created when the Vault has earnt enough to put into it. Thus prevent spamming of coinBalance creation

ā€œFull Vaultsā€ seems odd to me too since we never want full vaults. And you seem to agree.

My thinking is that one full vault indicates that other vaults may also be near full and a potential for a cascading effect where a small amount of uploads may cause many times more vaults to be full and the consensus chain (is it still called the datachain) grows rapidly as going full causes more decisions to be made and consensus events and this causes almost full vaults to become full. Worse case is that 5% of vaults are full and 50% are one block/chunk away from being full and say another 40% are very close. The random distribution has worked very well to cause even distribution (rare I know) and thus vault after fault goes full with each chunk uploaded.

The sacrificial chunks allowed for a much better indicator since it is harder to successfully fool as to how much free space is (almost) guaranteed to be there. There maybe a lot more but the network only needs to know that there is enough spare space.

6 Likes

Iā€™ve been asked to limit my time in this thread, so Iā€™m afraid I wonā€™t be able to answer individual points that Iā€™ve seen in here. But Iā€™ll try and get across a couple of points while I can.

Firstly, I already answered @happybeing on the dev forum so that might give some pointers :slight_smile:

@riddim On the issue of the issue of the type of encoding, I would probably favour plain base-32 myself over z-base-32 since itā€™s a better known standard, and less likely to have buggy implementations, but thatā€™s just a personal opinion.

On the issue of the Coin structure, Iā€™ve created an example to try and get my point across more clearly since I think a few people were misunderstanding what would be exposed to app devs or users. Please have a peek at the generated docs and let me know if you still see problems there. As of now I havenā€™t had time to include functionality to allow for simple arithmetic operations on the various types, and those will likely be worth discussing too (e.g. should we have a couple of ways to add two Coins, one which panics if the result is too large and another which returns a Result?)

Interesting ideas, all three! I suggested another alternative in my reply on the dev forum, but the gist was: I imagine we could have a different RPC to create a CoinBalance which would be used for the sole purpose of receiving farming rewards and which would be free to create. While it has earned less than the amount it normally costs to create a new CoinBalance itā€™s flagged as receive-only or something like that.

This is probably more like your third option, which appeals to me too. The main downside I can see of using an aggregate section key at the start is that once the balance is high enough, the section would have to transfer ownership of the CoinBalance to a singly-owned key which the new vault has created. The section would also need to keep transferring ownership to the new aggregate section key as the section changes.

The idea is that itā€™s up to the clients to decide. If e.g. a vendor wants to just derive it from a sequential invoice or order number, it can. The vendor provides the required UUID to the customer along with his CoinBalance address and invoice amount, and the customer does the transfer using those values. Using that known UUID, both the vendor and the customer can get a ā€œreceiptā€ from the network by calling get_transaction(vendor's address, uuid).

If youā€™re just transferring an amount of safecoin between two of your own CoinBalances and donā€™t care about a ā€œreceiptā€, the UUID can be completely random.

I agree about the appeal of using Spare Space, but it has its own problems. Itā€™s easy for a malicious vault to lie about that, and even for good vaults, theyā€™d have to control the amount of space theyā€™re advertising or else the amount of disk space they can actually use could drop to zero more or less instantly (e.g. the user fills his disk by copying over a bunch of videos).

7 Likes

All good :slightly_smiling_face: glad to hear this (ā€¦ Because the only upside I am seeing is a (maybe) better readability of the last characterā€¦ Depending on the byte length of the encoded string (I wonā€™t count because I refuse to invest more thinking on thisā€¦) ā€¦ While the first characters are fixed because of the cid and depending on the ā€˜readability scoreā€™ for them that might even in the logic of base32z make base32 encoded xor names more readable than base32z encoded onesā€¦ So latest after talking a bit about what it is and where it applies it should be clear now that itā€™s a a bit questionable choiceā€¦ [and that there are even encoding errors in the example of the paper this doesnā€™t make it look betterā€¦] )

ā€¦ In the end people will just use it and only the ones implementing language specific bindings will be asking stupid questions and all others will just use it and not even recogniseā€¦ So I will try to really say no more about this topic at all now ā€¦
ā€¦ So this is a topic that (by far) doesnā€™t deserve all this thought I guess :slightly_smiling_face: there is way more important choices that need to be madeā€¦

1 Like

@anon86652309, Just in case you did not realise fixed point integer maths is just integer maths. The fixed point only applies when the value is to be displayed. And fixed point operations is just integer operations no difference. The difference is when the value is to be displayed or converted to/from strings or displays

I still do not understand why you think its better to hold coin balance as 2 distinct variables. Adding with fixed point becomes as easy as c=a+b and checking for overflow, no need to check for milli, or micro, or nano.

For string being supplied - Fixed point looks at the string and adds zeros to fill 9 decimal places and remove the decimal point and you have the string version of the fixed point number, and convert string to integer value to store.
eg supply 1.2345 add zeros to fill 9 places => 1.234500000 then remove decimal point => 1234500000 and convert that to an integer and store it as coin value in u64

For string to be returned, convert integer to string and add decimal point at 9 digit point.
coin value integer is 1234567890 convert to string =>1234567890 add decimal point at 9 digit => 1.234567890

Let the front end accept or supply an integer variable value and no need for back end to know of strings. Have string interface if you feel its necessary along the lines of above and if you want milli, micro, nano then simply truncate string and check if cut off digits is zero or not==error.

Then the back end is doing integer maths on the fixed point integer, which is exactly the same as just doing integer maths. So please explain why keeping the coin balance as 2 separate values is easier or quicker or conceptually better then just storing the coin value as one integer that represents value in nanos? (for both back and front ends)

2 separate values means multiple operations to just add two coin values together. Integer value in nanos is one ALU add operation, a single machine code addition. 2 separate values will be conceptually strange and large number of machine operations.

13 Likes

Great job getting this out! Itā€™s taken a while for me to digest this, but a few comments/questions:

  1. Mav touched on this, but the store costs are too high if we are still talking about 1MB PUTS. At 0.005 (the cheapest they can be) that is 0.5c/MB at $1 SAFE. $5/GB will only allow very limited use cases, and worse is linearly dependent on price which we all want to be higher and not artificially capped by killing use cases. Note this is the best possible case and can only be cheapened by a lower market price for safe, not by adding capacity, which seems wrong at a fundamental level.

  2. I think the formula fails because of 1, and because the minimum is set entirely by the exogenous choice of number of nodes per section and not any performance considerations. As Mav noted (Full/notFull) doesnā€™t give enough fine grained information about available space (and could only be used to infer actual available space if size distribution was known, and the section was almost full already).

  3. The price may jump quickly (double) during section splitsā€¦

  4. Having a section specific price seems to add issues with ā€˜shoppingā€™ while the network is young (it should even out in a big network)ā€¦ why not simply keep reencoding data with some kind of new nonce (and getting a new section) until some section reported a low price? I could be off hereā€¦

It seems a design principle for the formula should be as independent as possible from the choice of number of nodes/section, and be able to go arbitrarily low (and possibly high). To achieve this feels like it requires greater insights into the amount of data storage available on the nodesā€¦ possibly a success/fail message could also include the used/remaining space. There may also have to be a thought for if and how vaults will be able to dynamically add more space. One possible solution from control theory is to target the rate rather than the valueā€¦ if you had an estimated available space you could calculate the time to fill it at storage rates over say the last week (a reasonable time frame to add capacity) and base the price off thatā€¦ some thinking needs to be done here in any case. We probably need a simulator to calculate pricing for a number of formulas at different network sizes/conditions.

The design for safecoin is in any case almost simple compared to getting the network incentive right. The switch to ā€˜sectionā€™ farming rather than individual farming has pros and consā€¦ pro: less reward variability, simplicity in implementation. Con: it seems to eliminate some of the incentives to provide the fastest nodes possible and rather incentivizes uptime exclusively. Maybe this is good, maybe bad, but there is in any case one incentive lost.

Great work. Canā€™t wait to follow the development here.

11 Likes

The target should be to set algorithm to be able in early stages to have about 1TB per 1SafeCoin. As it looks like very tiny reward for farmers, it should lead to ā€œre-investā€ all earns back to network, while $ price would rise and 1TB/1SafeCoin ratio would be getting lower.
If that would be set before beta release, we should see also adequate makret price rise to reflect real price of 1TB forever storage with unlimited bandwidth.

Edit: And as side effect should be less or rare vaults from big datacentres, when profit would be very speculative.

1 Like

Iā€™m struggling to understand this too.

One u64 integer is just super simple and industry standard. Easy to store, easy to represent, easy to sum. Why complicate things?

8 Likes

If SAFEcoin ever got to $10000 per coin then nano safecoin is 1 milli-cent

Does anyone have any objections to this smallest value of SAFE coin if SAFEcoin ever got to $10000

I am thinking of those who want micro tipping or transactions.

Its doubtful it will get to this value in the expected lifetime of this evolution of the SAFE network. (say 20-50 years).

Personally this is good enough for me.

9 Likes