While there definitely seems to be something odd happening here I’m not sure it’s quite that simple.
Before you reset your vault I was unable to connect getting a “failed to bootstrap” error. After the reset I have managed to connect 6 vaults to your test net. 5/6 of these vaults have a routing table size of 7 while one has a routing table size of 6. It seems my nodes have almost all found each other.
Are you able to see my nodes connected to yours?
It could also be worth making an issue on the github issues page too.
Edit: Another node just joined. Now have 5/6 with a RT of 8 and 1/6 with a RT of 7.
My “seed vault” is unique in that it starts the network with the “first” flag. It could be that a “first” vault is only allowed to accept a couple of client vaults. So it is preferable if those client vaults it accepts are other hard-coded contacts.* By luck, with the starting and stopping, the otehr hard-coded contact has managed to bootstrap from mine.
* until caching is turned back on. By the way, I found the boolean constant for that (set to false) but I’ll leave it for now.
I restarted it just now. My local vault connected straight away. I then disconnected the local vault to leave room for anyone else to connect.
One more thing to note: My local router (not the seed vault, which is in the cloud) has port forwarding of 5483 to the host of the local vault. Since service discovery is crippled for now then port forwarding, just like on the first testnet, if you’re on NAT, might help.
udp likely doesn’t do anything and as a protocol is apparently less reliable than tcp. Given the success with tcp, devs I think stopped needing to use it, so it’s there perhaps only because the port is suggested for it. In the future perhaps other protocols than tcp might be useful.
so… i suspect on your restarting the vault has not simply crapped out straight away…
but has gone back to the cycle:
INFO 17:46:28.991947115 [routing::core core.rs:1807] Client(953be0..) Failed to get GetNodeName response.
WARN 17:46:28.992135295 [safe_vault::vault vault.rs:171] Restarting Vault
INFO 17:46:31.330784856 [routing::core core.rs:1032] Client(7ce39e..) Running listener.
INFO 17:46:31.392824366 [routing::core core.rs:1555] Client(7ce39e..) Sending GetNodeName request with: PublicId(name: 7ce39e..). This can take a while.
INFO 17:47:31.392808845 [routing::core core.rs:1807] Client(7ce39e..) Failed to get GetNodeName response.
WARN 17:47:31.393059294 [safe_vault::vault vault.rs:171] Restarting Vault
INFO 17:47:33.737790659 [routing::core core.rs:1032] Client(83de3e..) Running listener.
INFO 17:47:33.798988385 [routing::core core.rs:1555] Client(83de3e..) Sending GetNodeName request with: PublicId(name: 83de3e..). This can take a while.
will leave running and see if anything you can do at your end bb can get it back up…?
It’s working now, so try restarting any vault that isn’t obviously connected.
Incidentally, though ill advised for a busy network, I’m running three vaults atm… so that is possible and can only help very fragile network get up and running.