While trying to start one of Vertica nodes you may face a data consistency problem.
from vertica.log
<PANIC> @v_vdb_node0005: VX001/2973: Data consistency problems found; startup aborted
HINT: Check that all file systems are properly mounted. Also, the --force option can be used to delete corrupted data and recover from the cluster
LOCATION: mainEntryPoint, /scratch_a/release/svrtar5575/vbuild/vertica/Basics/vertica.cpp:1613
from vertica.log
<PANIC> @v_vdb_node0005: VX001/2973: Data consistency problems found; startup aborted
HINT: Check that all file systems are properly mounted. Also, the --force option can be used to delete corrupted data and recover from the cluster
LOCATION: mainEntryPoint, /scratch_a/release/svrtar5575/vbuild/vertica/Basics/vertica.cpp:1613
so.. don't PANIC :-)
solution:
restart the problematic node with force flag which will repair the corruptions from buddy nodes.
[dbadmin ]$ /opt/vertica/bin/admintools -t restart_node -d $db_name -s $host --force
and the result:
*** Restarting nodes for database vdb ***
restart host node0005 with catalog v_vdb_node0005_catalog
issuing multi-node restart
Starting nodes:
v_vdb_node0005 (node0005)
Starting Vertica on all nodes. Please wait, databases with large catalog may take a while to initialize.
Node Status: v_vdb_node0001: (UP) v_vdb_node0005: (DOWN)
Node Status: v_vdb_node0001: (UP) v_vdb_node0005: (DOWN)
Node Status: v_vdb_node0001: (UP) v_vdb_node0005: (DOWN)
Node Status: v_vdb_node0001: (UP) v_vdb_node0005: (DOWN)
Node Status: v_vdb_node0001: (UP) v_vdb_node0005: (RECOVERING)
Node Status: v_vdb_node0001: (UP) v_vdb_node0005: (UP)
ReplyDeleteHello,
What if I encounter this situation on a single node?
I tried to use this command :
/opt/vertica/bin/admintools -t restart_node -d $db_name -s $host --force
but since there is only a single node, restart doesn't work.
Can you help me with that?
Thanks a lot!
try to restore using "rollback database to last good Epoch" - in the advanced menu
ReplyDelete