Suggestion for helping to reduce "accidential" overprovisioning by node runners

neo · October 2, 2024, 11:41pm

This “32GB” node size test network has highlighted an issue that many people can run 100 or more times the nodes than their system can handle. This can cause issues if too many of these large operations die due to running out of disk.

Another issue is that no one can announce that the network as a certain amount of storage available as its worse than random guessing since people can use node count to justify the monopoly “money” figure for raw capacity.

My suggestion is simple and covers the more “accidental” over provisioning done by eager beavers. And yes won’t stop people with the ability to modify the code but seriously that is a small number willing to do that.

It is simply create dummy chunks of max or expected-average chunk size and store in the record store to fill the record store directory to the expected node size. We currently have node size of 32GB which is merely an expected average size that the 128K max record node will get to. So on startup the node will fill the record store to 32GB of dummy records. Either with max chunk sized dummies or expected average sized dummies.

The code already removes chunks that the node is not responsible for and the node would remove dummies first when needed.

This solves the over provisioning issue well enough that people can confidently state what the raw capacity of the network is within a typical error margin. This is an important figure along with stored data on the network. Also allows real/actual knowledge on how full the network really is.

riddim · October 3, 2024, 6:27am

It’s an anti feature that needs maintaining, the empty chunks need to be prevented to enter the network and removed with priority…

What problem is this supposed to solve? Once the network fills and price rises storage will be there - the expensive resource is bandwidth - not storage space

(actually I thought about this as possible solution for a short moment too… Because without filling the space there is no way to ensure it’s there… But I think it doesn’t solve a real world issue and therefore should be avoided…)

This is the attempt to enforce others to ‘play by the rules’ nothing more… I guess it boils down to the decision if that’s worth the hassle with the assessed probability of success

neo · October 3, 2024, 7:16am

Being able to actually know what the “raw” capacity of the network is.

One of the big metrics to be able to promote the network is to be able to indicate the amount of storage available across the network.

We were using “raw” capacity based on average node size (32GB for this beta net) times the number of nodes and divide by 5 for available storage. And then how full the network is based on average fullness on nodes you have.

But with over provision then these figures are now like monopoly money, worthless.

Also in addition assists in preventing cascading collapse as someone with 100,000 nodes but a few TB only having their nodes fall over due to the nodes getting just a bit fuller. EG going from a 195 records average to 200 records average (9.75TB to 10.0TB total with only 10TB disk with OS etc). Then this will have a flow on effect causing other heavily over provisioned node setups collapsing. My 60 nodes are using around 1 to 1.5Mbit/s upload & <5%CPU and 100,000 nodes is 2.5Gbit/sec, so is more than possible for those with 12 computers or more in their setups.

aatonnomicc · October 3, 2024, 7:16am

Best way to enforce the rules is get more uploaders online and fill the network up and see what happens

storage_guy · October 3, 2024, 7:59am

We won’t get to see what happens with this network though because with 140k nodes that’s 4.4PB. Even half filling it to 2.2PB will be a tall order. We’ve only ever seen a few tens of TB uploaded. If a few dozen of us all had tokens from a faucet and were uploading like crazy we’d have a chance of getting it filled to a decent level.

One for a Commnet maybe? Everyone over provision their nodes, lots of people upload like there’s no tomorrow and see what happens? Then add more non overcommited nodes and see if it can be salvaged.

Edit
Some people overcommit their nodes but not everyone!

neo · October 3, 2024, 8:13am

And after 2 days I am only seeing my nodes being responsible for storing <30 records each

There is no way we’ll see the nodes getting 10% full

riddim · October 3, 2024, 8:17am

my current estimation is at current upload rate we’ll end at ~0.5% filling level next tuesday

that’s ofc only true if nothing changes - if maidsafe x20 their uploads we might see the 10% area

neo · October 3, 2024, 8:22am

How does the saying go? “where have I heard that before”

I think their definition of increasing uploads is different to my definition/understanding

riddim · October 3, 2024, 8:24am

well - we’ve seen a significant increase not long ago - might happen again

riddim · October 3, 2024, 8:49am

And later in the network we won’t have those empty levels anyway… Won’t be worth running nodes below 0.5 token per month per node for me at current value and I guess it’s pretty similar for others…

So are we afraid of over provisioning in the life network or in the current test networks?

neo · October 3, 2024, 9:09am

Well both, those doing over provisioning are gaming the system by getting a lot more stores paid for then they are proving hardware for.

If I had a 5Gnit/sec link I could run over 1/4 million nodes with only 20TB backing it (I’ve got 100) and be a huge portion of a young network gaining a ton of payments then end up collapsing, then running them up again.

For one its not fair to those running with honest nodes and secondly its drains the rewards that should be being spread around more. And secondly its a danger to the stability of the data and network.

If the beta can have 150K nodes after many months of beta and with over 60K in the hands of 3 or 4 people (one with over 28K) then in a live network without the incentives we might start off with 100K and grow to 300K in 6 months. But I come in with 250K nodes and holding 45% of the chunks and getting 45% of the rewards only to collapse and repeat it till I collapse again. How many chunks will be permanently lost, even with the expanded Sybil protection of 20 -25 nodes maybe holding chunks

Its a problem in a couple of dimensions.

Now if i could do that with one 5Gb/s link and my personal hardware, then some like Swiss with his hardware across the world and large drive could run 100K to 250K in each location doing this and potentially the network could collapse completely when one falls over due to no space and causes the cascading across all his way over provisioned machines.

Its unlikely he would do that, but it is not just a theory that this could be an issue but easily done and consequences real

riddim · October 3, 2024, 9:13am

That’s what happens when your systems get to 100% cpu load go too - right?

And that someone being able to provide 250k nodes won’t be the one capable of doing the mentioned modification of the code I guess

aatonnomicc · October 3, 2024, 9:15am

I kind of agree but anyone large enough will have systems in place to manage there nodes so no collapse.

Just reduced node count as space is required and should the price not be going up which will attract more node runners to get more nodes in the game?

In the long run I don’t expect it to be profitable to run on hetzners for months to month running costs so that will push us to home hardware where in most cases the router will be the bottle neck not the HD space.

riddim · October 3, 2024, 9:18am

It’s okay - just go for it - it’s not me who will be affected I guess - I’ll just loose 2 days adapting my management pipelines

neo · October 3, 2024, 9:23am

Nope when there is no disk space left and the nodes end up completely shunned. Its not a instantaneous collapse but one where the nodes are shunned and other nodes churn the data to maintain at least 5 nodes responsible for the chunks of the shunned node. This maybe over the period of an hour. Remember that nodes on that machine are also getting churned chunks added speeding up their demise. Basically speeding up the collapse on that machine. But if many people are doing this then their nodes start this process of dying off.

riddim · October 3, 2024, 9:24am

Iirc you are stopping nodes when cpu load gets too high

neo · October 3, 2024, 9:24am

To do this only takes knowing how to run safenode-manager or following people who show that

for ((i=0;i<100000;i++)); do safenode …; sleep 10; done

Editing the code is another higher level of understanding and represents perhaps 1 in 1000 or 1 in 100,000 of these script kiddies doing this sort of gaming

riddim · October 3, 2024, 9:26am

Please go for it - some people here will increase their network share as consequence at the same cost

neo · October 3, 2024, 9:26am

This has nothing to do with cpu load, that is a different situation to this

riddim · October 3, 2024, 9:28am

And someone running out of disk space will just run into the 100% without seeing it coming and be crushing the whole network with it

Topic		Replies	Views
How much disk space per node are you having? Ant-Node (was Safe-node)	94	796	February 19, 2025
How fast how large (deterministic sized nodes) Development	77	2123	January 4, 2023
How will the network determine over/under supply of storage? Autonomi Network Token (incl (e)MAID)	35	2903	January 17, 2018
Farming Pools and InfiniteGB Nodes Autonomi Network Token (incl (e)MAID)	46	1926	December 10, 2020
Network Storage Limit VS Network Reserve Marketing	29	2790	May 11, 2014

Suggestion for helping to reduce "accidential" overprovisioning by node runners

Related topics