Class SystemStoreRepairTask
java.lang.Object
com.linkedin.venice.controller.systemstore.SystemStoreRepairTask
- All Implemented Interfaces:
Runnable
This class tries to scan all cluster which current parent controller is the leader controller.
It will perform the following action for each system store of each cluster:
1. Check system store is created / has current version.
2. Send heartbeat to system store and check if heartbeat is received.
3. If system store failed any of the check in (1) / (2), it will try to run empty push to repair the system store,
until maximum retry of repair is reached.
It will emit metrics to indicate bad system store counts per cluster and how many stores are not fixable by the task.
-
Field Summary
Modifier and TypeFieldDescriptionstatic final org.apache.logging.log4j.Logger
static final String
-
Constructor Summary
ConstructorDescriptionSystemStoreRepairTask
(VeniceParentHelixAdmin parentAdmin, Map<String, SystemStoreHealthCheckStats> clusterToSystemStoreHealthCheckStatsMap, int maxRepairRetry, int heartbeatWaitTimeSeconds, AtomicBoolean isRunning) -
Method Summary
Modifier and TypeMethodDescriptiongetClusterSystemStoreHealthCheckStats
(String clusterName) getControllerClientMap
(String clusterName) void
run()
-
Field Details
-
LOGGER
public static final org.apache.logging.log4j.Logger LOGGER -
SYSTEM_STORE_REPAIR_JOB_PREFIX
- See Also:
-
-
Constructor Details
-
SystemStoreRepairTask
public SystemStoreRepairTask(VeniceParentHelixAdmin parentAdmin, Map<String, SystemStoreHealthCheckStats> clusterToSystemStoreHealthCheckStatsMap, int maxRepairRetry, int heartbeatWaitTimeSeconds, AtomicBoolean isRunning)
-
-
Method Details