Troubleshoot validator problems
Use this guide to solve the most common validator problems.
Validator has 0 voting power
If your validator has 0 voting power, your validator has become auto-unbonded. On the mainnet, validators unbond when they do not vote on 9500
of the last 10000
blocks (50
of the last 100
blocks on the testnet). Because blocks are proposed every ~5 seconds, a validator that is unresponsive for ~13 hours (~4 minutes on testnet) become unbonded. This problem usually happens when your JMESd process crashes.
To return the voting power back to your validator:
-
If JMESd is not running, restart it:
_1jmesd start -
Wait for your full node to reach the latest block, and run:
_1jmesd tx slashing unjail <jmes> --chain-id=<chain_id> --from=<from><jmes>
is the address of your validator account.<name>
is the name of the validator account. To find this information, runjmesd keys list
.
If you don't wait for JMESd to sync before running unjail
, an error message will inform you that your validator is still jailed.
- Check your validator again to see if your voting power is back:
_1jmesd status
If your voting power is less than it was previously, you may have been slashed for downtime.
JMESd crashes because of too many open files
The default number of files Linux can open per process is 1024
. JMESd is known to open more than this amount, causing the process to crash.
-
Increase the number of open files allowed by running
ulimit -n 4096
. -
Restart the process with
jmesd start
.If you are using
systemd
or another process manager to launch JMESd, you might need to configure them. The following samplesystemd
file fixes the problem:_16# /etc/systemd/system/jmesd.service_16[Unit]_16Description=JMES Udo Node_16After=network.target_16_16[Service]_16Type=simple_16User=ubuntu_16WorkingDirectory=/home/ubuntu_16ExecStart=/home/ubuntu/go/bin/jmesd start_16Restart=on-failure_16RestartSec=3_16LimitNOFILE=4096_16_16[Install]_16WantedBy=multi-user.target
JMESd crashes because of memory fragmentation
As described in this issue, huge memory allocation can cause memory fragmentation issue. Temporal solution is just using small wasm cache size like 50~100MB.
v0.5.10+
:
_1contract-memory-cache-size = 100
v0.5.7~v0.5.9
:
_1write-vm-memory-cache-size = 100
The validator is not active
-
The validator is jailed. To solve this problem,
unjail
the validator by running:jmesd tx slashing unjail <jmes> --chain-id=<chain_id> --from=<from>
-
The validator is not in the active validator set. Only the top 100 validators are in this set. To fix this problem, increase your total stake to be larger than the 100 validator.