Package com.linkedin.venice.utils
Class HelixUtils
java.lang.Object
com.linkedin.venice.utils.HelixUtils
Helper functions for Helix.
-
Field Summary
Modifier and TypeFieldDescriptionstatic final int
Retry 3 times for each helix operation in case of getting the error by default.static final String
The constraint that helix would apply on CRUSH alg. -
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionstatic void
checkClusterSetup
(org.apache.helix.HelixAdmin admin, String cluster, int maxRetry, int retryIntervalSec) Before this participant/spectator joining the Helix cluster, we need to make sure the cluster has been setup by controller.static <T> void
compareAndUpdate
(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, int retryCount, org.apache.helix.zookeeper.zkclient.DataUpdater<T> dataUpdater) static <T> void
compareAndUpdate
(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, org.apache.helix.zookeeper.zkclient.DataUpdater<T> dataUpdater) static void
connectHelixManager
(SafeHelixManager manager, int maxRetries, int sleepTimeSeconds) Try to connect Helix Manger.static <T> void
static <T> void
create
(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, T data, int retryCount) static <T> boolean
static <T> List<T>
getChildren
(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, int retryCount, long retryInterval) static org.apache.helix.model.CloudConfig
getCloudConfig
(org.apache.helix.cloud.constants.CloudProvider cloudProvider, String cloudId, List<String> cloudInfoSources, String cloudInfoProcessorPackage, String cloudInfoProcessorName) static org.apache.helix.model.MaintenanceSignal
getClusterMaintenanceSignal
(String clusterName, SafeHelixManager manager) If a cluster is in maintenance mode it returns all the signals associated with it regarding the reason and other details for creating the maintenance mode.static String
getHelixClusterZkPath
(String clusterName) static Instance
getInstanceFromHelixInstanceName
(String helixInstanceName) static int
getPartitionId
(String helixPartitionName) static String
getPartitionName
(String resourceName, int partitionId) static String
getResourceName
(String helixPartitionName) static String
instanceIdToUrl
(String instanceId) Convert a replica's instance ID to its URL.static boolean
isLiveInstance
(String clusterName, String instanceId, SafeHelixManager manager) listPathContents
(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path) static <T> void
static <T> void
static void
setupCustomizedStateConfig
(org.apache.helix.HelixAdmin admin, String clusterName) This method tells helix what properties to aggregate for customized view.static void
setupInstanceConfig
(String clusterName, String instanceId, String zkAddress) static <T> void
static <T> void
update
(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, T data, int retryCount) static <T> void
updateChildren
(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, List<String> paths, List<T> data) static <T> void
updateChildren
(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, List<String> paths, List<T> data, int retryCount)
-
Field Details
-
DEFAULT_HELIX_OP_RETRY_COUNT
public static final int DEFAULT_HELIX_OP_RETRY_COUNTRetry 3 times for each helix operation in case of getting the error by default.- See Also:
-
TOPOLOGY_CONSTRAINT
The constraint that helix would apply on CRUSH alg. Based on this constraint, Helix would NOT allocate replicas in same partition to same instance. If we need rack aware ability in the future, we could add rack constraint as well.- See Also:
-
-
Constructor Details
-
HelixUtils
public HelixUtils()
-
-
Method Details
-
getHelixClusterZkPath
-
getPartitionId
-
getPartitionName
-
getResourceName
-
getInstanceFromHelixInstanceName
- Parameters:
helixInstanceName
- of form host_port.- Returns:
- Instance object with correct host and port.
-
getChildren
public static <T> List<T> getChildren(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, int retryCount, long retryInterval) - Parameters:
path
- parent path- Returns:
- a list of objects that are under parent path. It will return null if parent path is not existing
-
listPathContents
-
create
public static <T> void create(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, T data) -
create
public static <T> void create(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, T data, int retryCount) -
update
public static <T> void update(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, T data) -
update
public static <T> void update(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, T data, int retryCount) -
updateChildren
-
updateChildren
-
exists
public static <T> boolean exists(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path) -
remove
public static <T> void remove(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path) -
remove
public static <T> void remove(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, int retryCount) -
compareAndUpdate
public static <T> void compareAndUpdate(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, org.apache.helix.zookeeper.zkclient.DataUpdater<T> dataUpdater) -
compareAndUpdate
public static <T> void compareAndUpdate(org.apache.helix.manager.zk.ZkBaseDataAccessor<T> dataAccessor, String path, int retryCount, org.apache.helix.zookeeper.zkclient.DataUpdater<T> dataUpdater) -
connectHelixManager
public static void connectHelixManager(SafeHelixManager manager, int maxRetries, int sleepTimeSeconds) Try to connect Helix Manger. If failed, waits for certain time and retry. If Helix Manager can not be connected after certain number of retry, throws exception. This method is most likely being used asynchronously since it is going to block and wait if connection fails.- Parameters:
manager
- HelixManager instancemaxRetries
- retry timesleepTimeSeconds
- time in second that it blocks until next retry.- Throws:
VeniceException
- if connection keeps failing after certain number of retry
-
checkClusterSetup
public static void checkClusterSetup(org.apache.helix.HelixAdmin admin, String cluster, int maxRetry, int retryIntervalSec) Before this participant/spectator joining the Helix cluster, we need to make sure the cluster has been setup by controller. Otherwise, the following operations issued by participant/spectator would fail. -
setupInstanceConfig
-
isLiveInstance
public static boolean isLiveInstance(String clusterName, String instanceId, SafeHelixManager manager) -
getClusterMaintenanceSignal
public static org.apache.helix.model.MaintenanceSignal getClusterMaintenanceSignal(String clusterName, SafeHelixManager manager) If a cluster is in maintenance mode it returns all the signals associated with it regarding the reason and other details for creating the maintenance mode.- Parameters:
clusterName
-manager
-- Returns:
- the maintenance mode signal if the cluster is in maintenance mode, otherwise returns null.
-
setupCustomizedStateConfig
public static void setupCustomizedStateConfig(org.apache.helix.HelixAdmin admin, String clusterName) This method tells helix what properties to aggregate for customized view. How customized view works in steps: 1. Venice SN reports its instance level states e.g. ingestion progress / hybrid store quota for each partition. 2. Helix aggregates the per instance level states to store version level. This method tells Helix what states to aggregate. So far we need to aggregate two propertiesHelixPartitionState.OFFLINE_PUSH
andHelixPartitionState.HYBRID_STORE_QUOTA
. 3. Other components e.g. Venice routers listen to the store version level states to serve traffic.- Parameters:
admin
-clusterName
-
-
instanceIdToUrl
Convert a replica's instance ID to its URL. An example is "localhost_port" to "https://localhost:port".- Parameters:
instanceId
-- Returns:
- replica URL
-
getCloudConfig
-