All Classes and Interfaces

Class
Description
 
Created by acurtis on 5/1/18.
Read compute and write compute chunking adapter
This abstract class contains all the common field and APIs for compute request builder; for each new compute request version, there will be a new builder class that extends this class and the new builder will include customized fields and APIs.
An abstraction for a record reader that reads records from input into Avro-serialized keys and values.
 
 
Forked from Netty's AbstractCoalescingBufferQueue (4.1.42) Includes fix to guard readableBytes from re-entry.
The implementation of DataWriterComputeJob for Spark engine.
A copy of the netty Http2StreamFrameToHttpObjectCodec except with the methods which actually return the converted objects have been made abstract.
An abstraction of the task that processes each record from the input, and returns serialized, and potentially compressed, Avro key/value pairs.
 
An abstraction of Helix State model behavior registry that defines how participants (Storage node) fetch data from Kafka.
An abstraction of the task that processes all key/value pairs, checks for duplicates and emits the final key/value pairs to Venice's PubSub.
Using Avro Json encoder to print duplicate keys in case there are tons of duplicate keys, only print first AbstractPartitionWriter.DuplicateKeyPrinter.MAX_NUM_OF_LOG of them so that it won't pollute Reducer's log.
 
A base class which encapsulates the common plumbing needed by all PollStrategy implementations.
AbstractPushMonitor is a high level abstraction that manages OfflinePushStatus.
This class is used to store common fields shared by various read responses.
This abstract class is the container for response stats.
Streaming callback for batch-get/compute.
 
 
A simplified implementation of a ShutdownableResource.
ModelFactory manages Helix state model creation.
A base storage abstract class which is actually responsible for data persistence.
 
 
An abstract implementation of a storage partition.
This is an abstraction of metadata maintained per Store.
 
Store-level access control handler, which is being used by both Router and Server.
Abstract class capturing the responsibilities of drainers threads doing store ingestion.
 
 
This class is an aggregate place that keeps stats objects for multiple stores.
 
An abstraction to filter given data type.
 
An abstraction of the mapper that would return serialized, and potentially compressed, Avro key/value pairs.
A generic implementation of the VeniceProducer interface
An abstraction for a record reader that reads records from the configured input into Avro-serialized keys and values.
Blueprint for all Services initiated from Venice Server
 
 
 
A base class which users of VeniceWriter can leverage in order to make unit tests easier.
Deprecated.
An AccessController allows a request to be checked against an Access Control List (ACL).
 
Represents an Access Control Entry.
Represents an Access Control Binding.
Update access controller resource list when a store is created/deleted
 
 
 
This class contains logic that SNs must perform if a store-version is running in Active/Active mode.
Counts the number of the active HTTP/2 Streams
 
This class AdHash is used to keep track of the checksum of the data incrementally.
Running checksum implementations based on Adler32.
 
 
The execution object is a kind of context for a admin command including essential information used to track the status of this command.
 
 
 
This class is used to track the execution of the async admin command.
One consumer service for one cluster.
 
This class is used to create a task, which will consume the admin messages from the special admin topics.
This class is used to create Callable that execute AdminOperations for a given store.
 
 
 
 
 
 
AdminRequest encapsulates an admin request from server admin tools.
This class stores all the information required for answering a server admin request.
Response record for admin request, version 1
Controller admin API leveraging sparkjava: http://sparkjava.com/documentation.html
 
 
 
 
 
 
 
This class provides aggregate initialization support for host level ingestion stats class HostLevelIngestionStats
AggKafkaConsumerService supports Kafka consumer pool for multiple Kafka clusters from different data centers; for each Kafka bootstrap server url, AggKafkaConsumerService will create one KafkaConsumerService.
This class is an aggregate place that keeps stats objects for multiple stores and total stats for each region for AggKafkaConsumerService.
Monitor the change of Helix's external view and warn in case that any partition is unhealthy.
 
 
 
Right now, Venice SN only reports aggregated metrics for RocksDB.
 
AggServerHttpRequestStats is the aggregate statistics for ServerHttpRequestStats corresponding to the type of requests defined in RequestType.
AggServerQuotaUsageStats is the aggregate statistics for ServerQuotaUsageStats which measures requests and quota rejections of each store.
 
The store level stats for blob transfer
 
The store level stats or the total stats will be unpopulated because there is no easy and reliable way to aggregate gauge stats such as rt topic offset lag.
 
Interface defines the ways to access to the allowlist of Helix NodeId.
 
 
An implementation of PubSubAdminAdapter for Apache Kafka.
Implementation of PubSubProducerAdapterFactory used to create Apache Kafka admin clients.
 
This class is not thread safe because of the internal KafkaConsumer is not thread safe.
 
Configuration for Apache Kafka consumer.
Offset position for Apache Kafka topics
A wrapper over Apache Kafka producer which implements PubSubProducerAdapter
Implementation of PubSubProducerAdapterFactory used to create Apache Kafka producers.
A Kafka specific callback which wraps generic PubSubProducerCallback
This class holds all properties used to construct ApacheKafkaProducerAdapter (This class could be refactored to hold consumer properties as well).
Converts RecordMetadata to PubSubProduceResult
 
CompletableFuture implementation, which is able to track the timeout behavior when happening.
A PollStrategy implementation which delivers messages in the order specified at construction time.
TODO: Merge this with ControllerApiConstants
This iterator traverses the entire backing array, while skipping over null entries.
Expose an array of ArrayCollection as an immutable Collection<E> Changes to the array are reflected in the collection.
Simple ArrayList backed Map implementation which uses simple linear scans of the list for searching for matching keys.
A thread-safe SortedSet implementation which uses an array as a backing store.
 
 
 
A Future which has very netty-style semantics where AsyncFutureListener can be added to the future to be executed upon condition of Success or Failure has been satisfied.
 
An interface (lambda) which is fired upon completion of an AsyncFuture.
Created by acurtis on 1/26/18.
A completable AsyncFuture interface.
This class is responsible for announcing and de-announcing a ServiceDiscoveryAnnouncer to a Service Discovery system and taking care of failed registrations by retrying them asynchronously.
 
An interface to provide functionality to manage ACL's for a Resource and provide access permission Permission to principals Principal when performing Method.
Concrete implementations are expected to lock their inner lock(s) in their own constructor.
 
 
This class should be used for experiments only.
This comparator is used to compare GenericRecord collection field elements which must have schemas.
 
This class is used to build a ComputeRequestWrapper object according to the specification, and this class will invoke AbstractAvroStoreClient to send the 'compute' request to backend.
 
 
 
Deprecated.
Venice avro generic client to provide read compute operations.
Venice avro generic client to communicate with Venice backend for key-value lookup.
AvroGenericStoreClient implementation for Avro generic type.
This enum lays out the basic specs of the various stateful protocols used in Venice.
 
 
 
AvroSerializer provides the functionality to serialize objects by using Avro.
AvroSpecificDaVinciClient<K,V extends org.apache.avro.specific.SpecificRecord>
 
AvroSpecificDeserializer<T extends org.apache.avro.specific.SpecificRecord>
 
AvroSpecificStoreClient<K,V extends org.apache.avro.specific.SpecificRecord>
AvroSpecificStoreClient for type safety purpose.
AvroSpecificStoreClientImpl<K,V extends org.apache.avro.specific.SpecificRecord>
AvroSpecificStoreClient implementation for Avro SpecificRecord.
AvroSpecificStoreDeserializerCache<V extends org.apache.avro.specific.SpecificRecord>
 
Container for the deserializers of a single store.
 
Utility intended to convert Avro -> SQL, including DDL and DML statements.
 
Created by acurtis on 5/4/17.
Enums of the strategies used to backup older store versions in Venice.
This class is used to periodically scan inactive store versions and perform optimization if the inactive period of some store version meets the pre-configured threshold, this class will trigger the database reopen action to unload unnecessary RAM usage, which was built up when there were active reads coming.
BackupVersionOptimizationServiceStats record the statistics for the database optimization done by the BackupVersionOptimizationService including both successes and failures.
Created by acurtis on 9/28/17.
This class offers very basic metrics for client, and right now, it is directly used by DaVinci.
 
Created by acurtis on 3/22/17.
Created by acurtis on 3/27/17.
 
Created by acurtis on 4/19/17.
 
A combination of HttpRequestEncoder and BasicHttpResponseDecoder which enables easier client side HTTP implementation.
A replacement for the Netty HttpContentEncoder which can handle use in client and server pipelines.
Created by acurtis on 3/22/17.
Created by acurtis on 3/24/17.
 
 
 
Created by acurtis on 3/23/17.
Decodes ByteBufs into HttpResponses and HttpContents.
 
 
 
 
 
This API controls the behavior of the user payload deserialization phase of the batch get response handling.
This enum controls the behavior of the user payload deserialization phase of the batch get response handling.
Batch get chunking adapter
Keep track of the progress of a batch get request .
 
Key part of the heartbeat sent by a push job.
Value part of the heartbeat sent by a push job
 
Read-through cache for items retrievable by two positive integer IDs.
 
This BatchDeserializer does nothing, and always returns an empty list.
 
 
 
 
 
This class will manage the snapshot creation, for batch store and hybrid store.
A BlobTransferManager is responsible for transferring blobs between two entities, either that Peer to Peer or node to a blob store and vice versa.
 
This class is the metadata of a partition in the blob transfer client
POJO to hold the payload for blob transfer related informartion
Class that exposes stats related to blob transfers
This class is the reporting class for stats class BlobTransferStats Metrics reporting logics are registered into MetricsRepository here and send out to external metrics collection/visualization system.
 
 
 
 
This is an implementation of DaVinciRecordTransformer that implements blocking.
This BatchDeserializer does not do anything asynchronously.
Created by acurtis on 10/16/17.
This PollStrategy delegates polling to another implementation, and also executes an arbitrary function during each poll.
 
This interface is meant for users where local state must be built off of the entirety of a venice data set (i.e.
A map implementation with a bounded size.
This type of specialized ScatterGatherRequest does not use keys for routing, and carry some extra state.
Enums of the policies used to decide how buffer replay start timestamps are calculated.
 
A low overhead immutable container of byte[] suitable for use as a map key.
This class is a wrapper of BinaryDecoder with the following optimization: When deserializing byte array field, instead of copying the bytes into a new ByteBuffer, this class will create a ByteBuffer, which is wrapping the original array.
This class overrides JsonEncoder.writeBytes(byte[], int, int), in order to convert all bytes schema data to hexadecimal strings in the output stream, since hexadecimal strings are more readable.
This class encapsulates a value from venice storage accompanied by the schema id that was used to serialize the value.
 
Utility functions for munging on bytes N.B.: Most functions taken from Voldemort's ByteUtils class.
CachedDnsResolver caches the dns entries for hosts, which matches the specified host pattern.
 
Listener used to monitor zk connection state change and refresh venice resource once zk connection is reconnected.
 
 
 
 
 
A utility class to chain two completable futures together.
 
 
 
ChangelogClientConfig<T extends org.apache.avro.specific.SpecificRecord>
 
ChannelInitializer<C extends io.netty.channel.Channel>
A special ChannelInboundHandler which offers an easy way to initialize a Channel once it was registered to its EventLoop.
The purpose of this class is to execute tasks in series without overlap, even if the ChannelTaskSerializer.executeTask(Consumer, ChannelFutureListener) method is called before the completion if the previous tasks.
Properties used by VeniceChangelogConsumer to seek checkpoints.
Parent class for various running checksum implementations.
Types of checksum algorithms supported by Venice's Data Ingest Validation.
 
 
This class accumulates all mapper values and assemble them to provide assembled complete large values or regular message.
 
The VeniceWriter, upon detecting an instance of this class being passed to it, will always call ChunkAwareCallback.setChunkingInfo(byte[], ByteBuffer[], ChunkedValueManifest, ByteBuffer[], ChunkedValueManifest, ChunkedValueManifest, ChunkedValueManifest) whenever processing a MessageType.PUT, whether it is chunked or not.
 
This record is appended to the end of keys in a store-version where chunking is enabled.
 
This class contains both chunked results and manifest for a specific payload.
 
This InputStream implementation allows us to pass a value made up of many chunks into the Avro decoder without stitching it up into a single big byte array.
This record maintains chunking information in order to re-assemble a value that was split in many chunks.
 
 
This record contains chunking information to uniquely identify each chunk.
This interface allows each code path which needs to interact with chunked values to optimize the way the re-assembly is handled so that the final form in which the is consumed is generated as efficiently as possible, via the use of a temporary ChunkingAdapter.
 
This class and the rest of this package encapsulate the complexity of assembling chunked values from the storage engine.
This interface provides methods to split a key into raw key/value byte array and ChunkedKeySuffix.
 
 
Utility class to help in implementing Measurable.getHeapSize().
Utilities for Classes / reflection.
 
 
ClientConfig<T extends org.apache.avro.specific.SpecificRecord>
 
ClientConfig<K,V,T extends org.apache.avro.specific.SpecificRecord>
 
ClientConfig.ClientConfigBuilder<K,V,T extends org.apache.avro.specific.SpecificRecord>
 
 
 
 
 
Different routing strategy types for client side routing: 1.
 
 
 
A wrapper of ThreadLocal for AutoCloseable objects.
 
 
Takes care of executing each routine once per cluster.
This class encapsulates business logic which needs to be executed once per cluster the first time a controller becomes leader of that cluster.
A centralized place to control the locking behavior, such as lock order and lock granularity.
 
 
Per-route request metrics.
 
This class includes the metrics in the cluster-level.
Protobuf type com.linkedin.venice.protocols.controller.ClusterStoreGrpcInfo
Protobuf type com.linkedin.venice.protocols.controller.ClusterStoreGrpcInfo
 
This class handles all operations on a collection field with replication metadata.
This class centralizes the logic of creating a collection replication metadata schema and providing a POJO representation upon a collection replication metadata generic record.
This class handles merges with replication metadata with collection metadata timestamp CollectionRmdTimestamp for collection field.
 
 
 
TODO: Merge this with ControllerRoute COMMAND ( "long-name", "short-name", "description", "required-args", "optional-args" )
 
 
 
Common config keys shared by venice-backend and venice-thin-client.
Handler for compacting Espresso headers to a single header entry.
A set that uses complement representation, which is useful when universal set cardinality is unknown.
Compose a CompletableFuture and Callback together to be a CompletableFutureCallback type.
Created by acurtis on 4/4/17.
One or more variables that together maintain an accumulation.
A PollStrategy implementation which takes a queue of many poll strategies.
Enums of the strategies used to compress/decompress Record's value
 
A simple wrapper to throw exception when retrieving failed computation.
An interface to abstract executing and monitoring a compute job running on any batch compute engine
 
 
 
 
This record only contains the operations and result schema, and keys will be appended after during serialization
 
Keep track of the progress of a compute request .
This record only contains the operations and result schema, and keys will be appended after during serialization
This record only contains the operations and result schema, and keys will be appended after during serialization
This record only contains the operations and result schema, and keys will be appended after during serialization
This record only contains the operations and result schema, and keys will be appended after during serialization
This class is used by the client to encapsulate the information it needs about a compute request.
This record used to store the response per key
 
 
 
 
This field will store all the related info for one key
ComputeRouterRequestWrapper encapsulates a POST request for read-compute from routers.
This class provides utilities for float-vector operations, and it also handles PrimitiveFloatList transparently to the user of this class.
 
One or more variables that together maintain an accumulation.
 
 
A BlockingQueue implementation which uses a collection of concurrent Queues per thread in order to reduce the hotspot of a highly contended mutex or atomic reference.
 
 
 
Thrown when a config property is invalid or missing
 
A command to migrate all incremental push stores in a cluster to a specific incremental push policy.
 
A ChannelHandler which regulates the number of open connections by disabling the parent socket auto-read config when the number of active connections exceeds the limit value.
 
 
An abstraction of a receiver of data consumed from a message queue.
The class is used to asynchronously trigger behavior changes in the StoreIngestionTask.
An Enum enumerating all valid types of ConsumerAction.
 
Encapsulates the logic for deciding if a given topic-partition should be unsubscribed and, in that case, executing the unsubscription.
Type describes all the version attributes
A POJO to store the response content and additional compression related metadata.
 
 
 
 
Protobuf enum com.linkedin.venice.protocols.controller.ControllerGrpcErrorType
Dependencies for VeniceControllerRequestHandler
 
 
Extend this class to create response objects for the controller Any fields that must be in all responses can go here.
 
 
 
ControlMessage payloads contain metadata about the stream of data, for validation and debuggability purposes.
 
A simple enum to map the values of ControlMessage.controlMessageType N.B.: We maintain this enum manually because Avro's auto-generated enums do not support evolution (i.e.: adding values) properly.
This exception is thrown when we detect corrupt data in the Kafka stream.
 
 
 
An interface to retrieve metrics related to data writer tasks after a MapReduce job has completed.
 
CRC32 algorithm as used by Deflate.
A pure-java implementation of the CRC32 checksum that uses the CRC32-C polynomial, the same polynomial used by iSCSI and implemented on many Intel chipsets supporting SSE4.2.
Running checksum implementations based on CRC32.
 
 
Protobuf type com.linkedin.venice.protocols.controller.CreateStoreGrpcRequest
Protobuf type com.linkedin.venice.protocols.controller.CreateStoreGrpcRequest
 
Protobuf type com.linkedin.venice.protocols.controller.CreateStoreGrpcResponse
Protobuf type com.linkedin.venice.protocols.controller.CreateStoreGrpcResponse
 
This class will add a new version to the given store.
This type of store metadata describes the current states of the store in a given fabric and Venice cluster.
 
 
This type of store metadata describes the current version states of the store in a given fabric and Venice cluster.
 
 
 
 
Runs a daemon thread that consistently sends HTTP GET request to admin healthcheck endpoint.
 
This class provides a server manager with start/shutdown methods to be invoked to manage the lifecycle of the server announcer.
This class is used to find the proper d2 service name for the given store through default D2 service d2://VeniceRouter.
 
Deprecated.
D2Client based TransportClient implementation.
A thread factory that sets the threads to run as daemons.
DataModelBackedStructure<T extends org.apache.avro.specific.SpecificRecord>
 
This class gathers all common data provider patterns in test cases.
DataRecoveryClient is the central class to manage data recovery feature.
 
 
DataRecoveryExecutor is the engine to run tasks in data recovery.
DataRecoveryExecutor is the engine to run tasks in data recovery.
This class contains all the logics to: 1.
 
 
A Task is wrapper class that designed to execute multiple commands in sequence.
 
 
This interface defines all the public APIs, and if you need to add accessors to some new fields, this interface needs to be changed accordingly.
 
 
The data replication policy determines which fabric to use for sending real-time writes from clients and where to direct servers for real-time data consumption.
This class encompasses all error conditions which are related to the quality of the data consumed from Kafka.
All the AccumulatorV2 objects that are used in the Spark DataWriter jobs.
An abstraction for executing and monitoring a data writer compute job running on any batch compute engine
The implementation of DataWriterComputeJob for MapReduce engine.
The default implementation of AbstractDataWriterSparkJob for Avro and Vson file input formats.
An interface to report and retrieve metrics related to data writer tasks.
 
 
DvcBlobFinder discovers live DaVinci peer nodes to facilitate blob transfers necessary for bootstrapping the database
Da Vinci Client to provide key-value lookups in embedded mode
 
 
This is a scheduler for sending batching push status in DaVinci.
This abstract class can be extended in order to transform records stored in the Da Vinci Client, or write to a custom storage of your choice.
Configuration class for DaVinciRecordTransformer, which is passed into DaVinciConfig.
This describes the implementation for the functional interface of DaVinciRecordTransformer
 
Utility class for DaVinciRecordTransformer.
This class is used to systematically copy OffsetRecord instances rather than passing them as is.
This class is used to systematically copy OffsetRecord instances rather than passing them as is.
An implementation of AsyncFuture where much of the implementation is based upon netty's org.jboss.netty.channel.DefaultChannelFuture.
Wraps a Callable in a FutureTask and make it notify a DefaultAsyncFuture.
A specialised instance of @{link AsyncFuture} for collecting the results of one or more futures of lists into a single list.
 
 
The default ingestion backend implementation.
 
 
 
 
Default Kafka VeniceKafkaSerializer
Cloned from com.linkedin.security.ssl.access.control.SSLEngineComponentFactoryImpl; Changes: 1.
 
Default implementation of the VenicePartitioner class.
Delegating layer for DaVinciClient.
Inside Fast-Client, we choose to use n-tier architecture style to build a pipeline to separate different types of logic in different layer.
 
 
 
Delete payloads contain fields related to replication metadata of the record.
 
This is a simple container class to hold replication metadata related fields together to be passed on to the Delete api in VeniceWriter VeniceWriter#delete(Object, Callback, LeaderMetadataWrapper, long, Optional).
 
 
 
 
 
Derived schemas are designed for write compute operations.
This class is registered in ZKClient to handle derive schema serialization.
 
This interface provides method to write map entries in a deterministic order.
An Avro generic datum writer that sorts the map entries by keys before serializing the map.
An Avro specific datum writer that sorts the map entries by keys before serializing the map.
DictionaryFetchRequest encapsulates a GET request to storage/storename/version on the storage node to fetch the compression dictionary for that version.
DictionaryRetrievalService runs in a producer-consumer pattern.
 
 
 
 
 
The callback method to disable leader replicas whenever they are in ERROR state so that helix can elect a new leader.
 
Protobuf type com.linkedin.venice.protocols.controller.DiscoverClusterGrpcRequest
Protobuf type com.linkedin.venice.protocols.controller.DiscoverClusterGrpcRequest
 
Protobuf type com.linkedin.venice.protocols.controller.DiscoverClusterGrpcResponse
Protobuf type com.linkedin.venice.protocols.controller.DiscoverClusterGrpcResponse
 
DiskHealthCheckService will wake up every 10 seconds by default and run a health check in the disk by writing 64KB random data, read them back and verify the content; if there is any error within the process, an in-memory state variable "diskHealthy" will be updated to false; otherwise, "diskHealthy" will be kept as true.
DiskHealthStats measures the disk health conditions based on the periodic tests ran by the DiskHealthCheckService.
 
We define a threshold where we consider the disk to be full.
This class is in charge of routing and serialization/de-serialization.
DispatchingAvroSpecificStoreClient<K,V extends org.apache.avro.specific.SpecificRecord>
 
Deprecated.
This class contains stats for DIV.
This class is the reporting class for stats class DIVStats.
 
 
 
 
 
Linked list implementation of the List interface.
In order to maintain a doubly-linked list, each element needs to establish links to its adjacent elements.
The following class has capability to send dual read requests via Fast-client and Thin-client.
DualReadAvroSpecificStoreClient<K,V extends org.apache.avro.specific.SpecificRecord>
 
 
The design consideration to consume in admin tool directly instead of letting controller to consume the required admin message and return them back to admin tool: 1.
 
This is a benign DataValidationException.
The source and destination topic for topic switching are the same topic
A PollStrategy implementation which can introduce duplicates.
This class is used as the return type of the CompletableFuture that is returned by the Venice producer.
A DynamicAccessController is an AccessController with a mutable resource list.
A POJO containing an index and a timestamp for a collection element.
 
 
 
This ControlMessage is sent per partition by each offline incremental push job, once per partition, at the end of a incremental push
This ControlMessage is sent once per partition, at the end of a bulk load, after all of the data producers come online.
This ControlMessage is sent at least once per partition per producer.
An interface to get information about a running job and task on any batch computing engine.
 
 
Expects a GetRequestObject which has store name, key, and partition Queries the local store for the associated value writes the value (as a byte[]) back down the stack
A task that iterates over store version resources and reset error partitions if they meet the following criteria: The store version resource is the current version. The error partition only has exactly one error replica. For L/F model, using EV to check error replicas misses internal error replica states.
 
ExceptionType enum did not have a default deserialization annotation and this makes it non-evolvable till all clients upgrade to newer versions.
An extension of the Http2FrameCodec which provides a facility to be able to block stream creation from clients without incurring any cost of stream creation on the server side.
 
Utility class to expose package private members.
Forked from Netty's Http2MultiplexHandler (4.1.42) (https://github.com/netty/netty/blob/4.1/codec-http2/src/main/java/io/netty/handler/codec/http2/Http2MultiplexHandler.java) In HTTP/1.1, the requests from the client are distributed across 48 workers threads of router.
Forked from Netty 4 Http2StreamChannelBootstrap to allow using EspressoHttp2MultiplexHandler in the pipeline.
 
 
 
 
This interface defines all the public APIs, and if you need to add accessors to some new fields, this interface needs to be changed accordingly.
 
A container of ETL Enabled Store related configurations.
 
 
In ETL jobs, when a "delete" record is seen, "value" is set to "null" and "DELETED_TS" is set to the offset where this delete record was seen.
A class to throttle Events to a certain rate This class takes a maximum rate in events/sec and a minimum interval over which to check the rate.
This interface is used to abstract the strategy to handle the quota exceeding case.
 
 
 
Deprecated.
since v0.2.620 Previously, this enum didn't have the @JsonEnumDefaultValue annotation.
 
Static utility functions to deal with exceptions.
 
 
Accessor user to get/update the last execution id and the generated id from a storage.
Status of executing off-line push.
 
Factory and utility methods for Executor, ExecutorService, ScheduledExecutorService, ThreadFactory, and Callable classes defined in this package.
An analog to ExecutorService except that the futures returned are AsyncFutures.
 
Created by acurtis on 6/12/17.
 
This record is only used to training a ZSTD dictionary on a small amount of generated avro data
* This class is leveraging linkedin-avro-fastserde-impl to speed up deserialization.
This class is leveraging linkedin-avro-fastserde-impl to speed up serialization, and so far it doesn't work for Avro-1.4, and it will fail back to regular Avro.
FastAvroSpecificDeserializer<T extends org.apache.avro.specific.SpecificRecord>
This class is leveraging linkedin-avro-fastserde-impl to speed up deserialization.
 
Only for the sake of accessing a package-private method...
 
This class encompasses all error conditions which: 1.
The FilterChain class takes a list of AbstractVeniceFilter to assemble a filter chain to manage the life cycles' of filters and perform filtering based on the order of filters.
A PollStrategy implementation which can pluck out records from the stream.
Forked from com.linkedin.databus.core.util @ r293057
If a remote peer is unresponsive due to CPU throttling, GC or otherwise, ForceCloseOnWriteTimeoutHandler closes the connection on write timeout, the time out by default is 30 sec.
Adapted from https://stackoverflow.com/a/723914/791758
 
Created by acurtis on 3/22/17.
Created by acurtis on 3/27/17.
An implementation of SslHandler which limits the number of objects held within the pendingUnencryptedWrites queue.
 
Created by acurtis on 2/1/17.
This is an abstract class which represents a generic class associating a generated schema id with it's corresponding value schema id
In Venice, some schemas are generated, by deriving them from value schemas.
GenericChunkingAdapter<V extends org.apache.avro.generic.GenericRecord>
Read compute and write compute chunking adapter
Just for the sake of casting the generic type to GenericRecord...
 
GetRouterRequest encapsulates a GET request to storage/resourcename/partition/key on the storage node for a single-get operation.
 
 
 
 
 
We need to keep track of each request as it goes through the pipeline so that we can record the necessary metrics and separate different parts of the logic for the response.
 
 
 
This class is an incomplete copypasta of the logic in StorageReadRequestHandler.channelRead(ChannelHandlerContext, Object).
 
 
A wrapper around Guava's RateLimiter to provide a common interface for rate limiting.
 
Utility class for generating GUIDs.
 
 
 
 
Hadoop-specific utils.
 
Forked from com.linkedin.databus.core.util @ r293057
 
HDFSSchemaSource caches the Value schemes and RMD schemas for a given store on the HDFS and serve them during the Mapper stage.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
HealthCheckRequest encapsulates information of a health-check request from routers.
 
 
 
A Guid generator which uses the UUID.nameUUIDFromBytes(byte[]).
This service monitors heartbeats.
 
 
 
 
 
Serializer to adapt venice serializer to PathBasedZkSerializer.
Interface and wrapper for Helix related admin operations needed by Venice when running Helix as a service.
Get routing data from Helix and convert it to our Venice partition and replica objects.
 
Extend HelixBaseRoutingRepository to leverage customized view data for offline push.
An implementation of AbstractPushMonitor that listens to Helix Customized View to determine the push status
Extend HelixBaseRoutingRepository to leverage external view data.
 
This strategy is tried to distribute more load to Helix Groups with more capacity.
This simple strategy will try to distribute the load evenly to every Helix Group.
 
 
This class is a thin wrapper of HelixInstanceConfigRepository and HelixGroupSelectionStrategy, and it will delegate all the related API calls to the corresponding objects.
 
Extend RoutingTableChangeListener to leverage customized view data for hybrid store quota.
This repository is used to store the instance config per instance.
Convert between ZNRecord in Helix instanceConfig and Venice instance.
This class is used to monitor the current live instances(Helix participators).
 
Venice Participation Service wrapping Helix Participant.
An Enum enumerating all valid types of HelixPartitionState.
A parent class to access Helix customized partition state, which is different from the states defined in the state model.
A class for accessing partition offline push and hybrid quota status in Helix customized state (per Helix instance) on Zookeeper
This class is used to cache live cluster configs.
This class is used to cache store schema and provide various query operations.
This repository provides the read-only access to both system stores and regular stores.
This class is used to fetch all store configs from ZK and cached locally, then provide the way to read those configs.
 
This repository provides an read only interface to access both system store and regular venice store.
This repository is used to retrieve all the zk shared system store schemas from the system store cluster.
This repository provides an interface to access zk shared system stores only.
This class is used to modify live cluster configs.
This class is used to add schema entries for stores.
This repository supports the following operations: 1.
Read write store repository which uses Helix as storage.
This repository provides an read-write interface to access both system store and regular venice store.
This strategy will assign an ordering of helix groups via round-robin for even distribution.
 
States of Venice node in Helix.
The control message changed built on Helix message service.
 
Helper functions for Helix.
Aggregate all essentials resources required by controller to manage a Venice cluster.
 
Provides methods to locate the storage node or nodes for database partitions.
This is an interface for monitoring the healthiness of storage nodes.
This class is used to monitor the various reasons for unhealthy hosts.
This class contains stats for stats on the storage node host level.
An implementation of BasicFullHttpRequest which always uses a Http1Headers for holding the headers.
An implementation of BasicFullHttpResponse which always uses a Http1Headers for holding the headers.
 
An implementation of BasicHttpRequest which always uses a Http1Headers for holding the headers.
An implementation of BasicHttpResponse which always uses a Http1Headers for holding the headers.
This class is ExceptionHandler for all HTTP/2.
Initializes HTTP/2 Pipeline.
 
 
Logs the HTTP/2 Settings Frames.
 
 
Client-facing exception for HTTP/2 too many active streams exception with a clearer message for the clients.
HTTP/2 utility methods.
Created by acurtis on 3/23/17.
 
This utility class could construct a HttpClient5 based R2 Client.
 
This class is used to provide a facility to initialize httpclient5 lib based Http/2 Client.
 
 
 
 
 
 
 
Created by acurtis on 3/22/17.
This _only_ knows how to aggregate HttpMultiPart messages and "bad things"(tm) will occur when it encounters other types of messages.
Created by acurtis on 4/26/18.
Maps the provided HTTP response status HttpResponseStatus to one of 1xx, 2xx, 3xx, 4xx, 5xx categories.
An abstract handler which may be used to examine latencies in the Netty4 HTTP request processing.
 
Created by mwise on 3/11/16.
 
Created by acurtis on 3/23/17.
CloseableHttpAsyncClient based TransportClient implementation.
This will be used to stash reoccurring common bits of code.
Created by acurtis on 9/28/17.
This interface defines all the public APIs, and if you need to add accessors to some new fields, this interface needs to be changed accordingly.
 
A container of Hybrid Store related configurations.
 
 
Hybrid store quota status response for a resource; this is a response that will be returned by Router.
An interface for implementation of IC( invocation-context) provider class for calls between various deployable services.
An interface that provides a method to parse identity information from a certificate.
This class is used as an identity function to return the same bytes that were passed in the input to the output.
A record reader that returns the input key and value as is.
 
A Pair which may be used as a Map.Entry.
 
This exception is thrown when we detect a new producer which did not start with a ControlMessageType.START_OF_SEGMENT.
In order to dump the HTTP headers when there is a decode failure, surround the HTTP decode handler with these handles...example:
 
This exception is thrown when the server reads data from same segment after the segment is ended.
 
 
This is a fork of the standard HashMap which implements IndexedMap.
The IndexedMap interface implements Map and several functions for dealing with its content via the index corresponding to their insertion order.
The IndexedMap interface implements Map and several functions for dealing with its content via the index corresponding to their insertion order.
 
 
This class is used to process the AA/WC messages in parallel to address the long-tail partition lagging issues.
IngestionCommandType is an Enum class for specifying different commands for ingestion isolation.
IngestionComponentType is an Enum class for specifying different components for graceful shutdown in forked ingestion process.
 
 
 
 
IngestionMode is an Enum class that contains all modes for ingestion.
IngestionReportType is an Enum class for specifying different ingestion reports for ingestion isolation.
This class contains stats for store ingestion.
This class is the reporting class for stats class IngestionStats.
 
 
 
 
This throttler has the following functionality: 1.
In memory implementation of AdminTopicMetadataAccessor, should only be used for testing purposes.
In memory implementation of ExecutionIdAccessor, should only be used for testing purposes.
Used in unit tests in order to avoid spinning a full Kafka broker with network stack and disk IO.
A single Kafka message, strongly typed for the types that Venice uses.
 
 
In memory implementation of OffsetManager, should really only be used for tests
A simple non-persistent, in-memory store.
 
In memory implementation of StorageMetadataService, should really only be used for tests
An in-memory hashmap implementation of a storage partition Assumptions: 1.No need to worry about synchronizing write/deletes as the model is based on a single writer.
This interface lets users get input data information
A POJO that contains input data information (schema information and input data file size)
This class is used to keep track of store storage quota and storage overhead ratio and check whether the total input data size exceeds the quota
Class defines the Instance in Venice.
The class is used to measure the healthiness about the cluster the store belongs to.
 
 
This utility class provides functions to measure the heap size of objects for a limited number of classes.
Status of the instance in Venice cluster.
This class is not Thread-safe.
 
InternalAvroSpecificSerializer<SPECIFIC_RECORD extends org.apache.avro.specific.SpecificRecord>
Serializer for translating a versioned protocol of Avro records.
This class includes some necessary functions to deal with certain metric-handling activities that only the client implementation can be aware of.
All the internal implementations of different tiers should extend this class.
Abstract class for Fast-Client transport layer implementation, and this layer is mostly decided which methods need to be implemented at different stages.
 
Implementation of the VenicePartitioner class for test purpose.
 
This class is the implementation of ingestion backend designed for ingestion isolation.
 
 
IsolatedIngestionProcessStats is a metrics collecting class that aims to collect metrics from isolated ingestion process.
This class sends requests to monitor service in main process and retrieves responses.
This class is the server service of the isolated ingestion.
 
 
This is the handler class for IsolatedIngestionServer.
This class contains methods used for communication between ingestion client and server.
Utility methods for Iterators.
 
A Guid generator which internally uses UUID.randomUUID().
Utility methods for JDBC.
Hash algorithm by Bob Jenkins, 1996.
 
 
Response for querying job status.
 
This class is used to throttle records consumed per Kafka cluster
Custom Exception for Venice Kafka Consumer Related Issues.
KafkaConsumerService is used to manage a pool of consumption-related resources connected to a specific Kafka cluster.
This consumer assignment strategy specify how consumers from consumer pool are allocated.
This delegator impl is used to distribute different partition requests into different consumer service.
 
This class provides the stats for Kafka consumer service per region or per store.
This class is a library that can validate the Kafka message during consumption, which can be used in Venice-Server/Da-Vinci and ETL.
 
Zstd dict trainer for Kafka Repush.
 
 
We borrowed some idea from the open-sourced attic-crunch lib: https://github.com/apache/attic-crunch/blob/master/crunch-kafka/src/main/java/org/apache/crunch/kafka/record/KafkaInputFormat.java This InputFormat implementation is used to read data off a Kafka topic.
This class is a Combiner, which is a functionality of the MR framework where we can plug a Reducer implementation to be executed within the Mapper task, on its output.
This class is used to support secondary sorting for KafkaInput Repush.
The protocol between Mapper and Reducer with KafkaInputFormat for key field
The data protocol between Mapper and Reducer for KafkaInputFormat
This class is used for KafkaInput Repush, and it only considers the key part of the composed key (ignoring the offset).
We borrowed some idea from the open-sourced attic-crunch lib: https://github.com/apache/attic-crunch/blob/master/crunch-kafka/src/main/java/org/apache/crunch/kafka/record/KafkaRecordReader.java This class is used to read data off a Kafka topic partition.
We borrowed some idea from the open-sourced attic-crunch lib: https://github.com/apache/attic-crunch/blob/master/crunch-kafka/src/main/java/org/apache/crunch/kafka/record/KafkaInputSplit.java InputSplit that represent retrieving data from a single TopicPartition between the specified start and end offsets.
 
This class together with KafkaInputKeyComparator supports secondary sorting of KafkaInput Repush.
Class which stores the components of a Kafka Key, and is the format specified in the KafkaKeySerializer.
VeniceKafkaSerializer to encode/decode KafkaKey for Venice customized kafka message Used by Kafka to convert to/from byte arrays.
 
 
Assumes: One to One mapping between a Venice Store and Kafka Topic.
This class contains logic to dump Venice Kafka topics.
Serializer for the Avro-based kafka protocol defined in: KafkaMessageEnvelope This class needs to be defined explicitly, rather than just instantiating a InternalAvroSpecificSerializer because it is used as a config passed to the Kafka producer.
 
A helper class to return the same lock for the same raw key bytes.
 
 
 
 
 
Deprecated.
, use Gauge instead.
A dummy implementation which simply allocates a new object on every ObjectPool.get() and lets the garbage collector dispose of it rather than reusing.
 
 
Created by acurtis on 3/29/17.
This utility provides lazy initialization for a wrapped object.
Fork of the LazyInitializer with some additional APIs.
This, like Lazy, implements the same API as Optional and provides late initialization for its content.
 
 
Completion state of the leader partition.
Protobuf type com.linkedin.venice.protocols.controller.LeaderControllerGrpcRequest
Protobuf type com.linkedin.venice.protocols.controller.LeaderControllerGrpcRequest
 
Protobuf type com.linkedin.venice.protocols.controller.LeaderControllerGrpcResponse
Protobuf type com.linkedin.venice.protocols.controller.LeaderControllerGrpcResponse
 
 
This class used to override VeniceNotifier.catchUpVersionTopicOffsetLag(String, int) to release the latch when the resource is the current version.
Venice partition's state model to manage Leader/Follower(Standby) state transitions.
A leader session id checker will be created for each consumer action; server checks whether the session id is still valid before processing the consumer action.
 
State Transition Handler factory for handling Leader/Follower resources in the storage node.
The leader follower state thread pool strategy specifies how thread pools are allocated for Helix state transition.
 
This class contains the state transition work between leader and follower; both leader and follower will keep track of information like which topic leader is consuming from and the corresponding offset as well as the latest successfully consumed or produced offset in the version topic (VT).
 
This is a simple container class to hold Leader metadata related fields together to be passed on to the Put and Delete api in VeniceWriter Caller should construct an instance of this object by properly filling up all the fields of this object.
This class holds all the necessary context information which is passed from kafka consumer thread -> kafka producer callback thread -> drainer thread.
 
General purpose object reference leak detector with recovery.
LeakedPushStatusCleanUpService will wake up regularly (interval is determined by controller config ConfigKeys.LEAKED_PUSH_STATUS_CLEAN_UP_SERVICE_SLEEP_INTERVAL_MS), get all existing push status ZNodes on Zookeeper that belong to the specified cluster, without scanning through the replica statuses, find all leaked push status and delete them on Zookeeper.
LeakedResourceCleaner is a background thread which wakes up regularly to release leaked resources from local disk.
LeakedResourceCleanerStats records the occurrences of store resources get removed by LeakedResourceCleaner.
This strategy is trying to route the request to the least-loaded instances.
 
This class provides the unified way to manager venice listener.
Service that listens on configured port to accept incoming GET requests
 
 
 
 
This is a wrapper class on top of InternalBootstrappingVeniceChangelogConsumerImpl.
LocalThreadEventLoopGroup<E extends io.netty.channel.MultithreadEventLoopGroup>
A specialised EventLoopGroup which the MultithreadEventLoopGroup.next() method prefers to return an EventLoop for the current thread, if the current thread is a thread of the wrapped MultithreadEventLoopGroup.
Provides a configured VeniceChangelogConsumer instance.
 
 
 
Logs the Notification received.
This metric class is to optimize for high write throughput, low read throughput measurement use case instead of real-time measurement.
 
This class is a stats generator for dds router framework to record scattering/gathering related metrics.
 
This class is the listener service in main process which handles various kinds of reports sent from isolated ingestion service.
 
This class is the handler class for MainIngestionMonitorService.
This class sends requests to isolated ingestion process and retrieves responses.
MainIngestionStorageMetadataService is an in-memory storage metadata service for IsolatedIngestionBackend.
 
This class manages the ingestion status for each partition for a given topic when isolated ingestion is enabled.
MapOrderPreservingDatumReader converts map type serialized items into instances with a consistent ordering of entries.
MapOrderPreservingDeserializer is a Avro deserializer using MapOrderPreservingDatumReader.
 
Factory to create fast serializer/deserializer of supporting map ordering.
 
MapOrderPreservingGenericDatumWriter converts map type instances to in-memory serialized format * with a consistent ordering of map entries.
This class is a factory that creates MapOrderPreservingSerializer and MapOrderPreservingDeserializer with given schemas and cache them.
MapOrderPreservingSerializer is AvroSerializer that maintains a consistent ordering of map entries by using MapOrderPreservingGenericDatumWriter and MapOrderPreservingSpecificDatumWriter.
MapOrderPreservingSpecificDatumWriter converts map type instances to in-memory serialized format with a consistent ordering of map entries.
 
An implementation of EngineTaskConfigProvider to get information about a running job on the MapReduce engine.
 
Running checksum implementation based on BouncyCastle's implementation of MD5.
 
A subclass of LinkedHashMap which does a best-effort attempt at guessing its size on heap + the size of its values.
This class is a generic implementation of a memory bound blocking queue.
 
API for merging existing data with incoming writes.
This class handles byte-level merge.
The workflow is Query old replication metadata.
 
An object to encapsulate the results of conflict resolution to denote how the operation and value that should be applied or if the current update should be ignored.
This wrapper is used to keep the context after handling Active/Active messages.
Implementations of the API defined in Merge based on V1 metadata timestamp Schema generated by RmdSchemaGeneratorV1.
The purpose of this interface is to extract common merge logic.
 
 
A simple enum to map the values of KafkaMessageEnvelope.messageType N.B.: We maintain this enum manually because Avro's auto-generated enums do not support evolution (i.e.: adding values) properly.
 
MetadataAggregator is a custom HttpObjectAggregator that only aggregated HttpResponse messages for metadata.
MetadataFetchRequest encapsulates a GET request to /metadata/storename on the storage node to fetch metadata for that node.
This MetaDataHandle is used to handle the following meta data requests: 1.
 
This class stores all the information required for answering a server metadata fetch request.
This record will store version properties, key & value schemas, and routing information
 
MetadataUpdateStats records metrics related to storage metadata update via MainIngestionStorageMetadataService
All the data types supported in meta system store.
 
This class serves as the shadow read handler of the meta system store.
This class is to write metadata: store properties/key schema/value schemas/replica statuses to meta system store.
 
Collection of all available methods that a Principal can perform.
Metric entity class to define a metric with all its properties
Operational state of a metric.
create the metrics/Sensors
 
 
 
Utility functions to help create common metrics repository.
Utility functions to help test metrics
Metric type enum to define the type of metrics Venice supports via OpenTelemetry
Metric Unit enum to define list of Units supported for metrics
 
 
 
 
 
 
This exception is thrown when we detect missing data in the Kafka stream.
 
A mock time that takes a list of synthetic timestamps in MS and return them one by one in a circular manner.
 
A PubSubConsumerAdapter implementation which reads messages from the InMemoryKafkaBroker.
A PubSubProducerAdapter implementation which interacts with the InMemoryKafkaBroker in order to make unit tests more lightweight.
 
 
 
A mock AuthorizerService implementation to help in unit test.
 
 
 
 
All MapReducer job counters (identified by a group name + a counter name) are defined here and this util class provides method to increment counters and read counters
Bundle counter group name and counter name in this POJO
Message holder object for use with lazy logging.
 
This field will store all the related info for one record
 
 
 
This field will store all the related info for one key
MultiGetRouterRequestWrapper encapsulates a POST request to storage/resourcename on the storage node for a multi-get operation.
Keep track of the progress of a multi-key requests - batch get and compute.
represents a key requested on a route and tracks info related to it but specific to the route
 
 
MultiKeyRouterRequestWrapper is a base class for a multi-key operation.
 
 
 
 
 
 
 
 
 
 
 
 
 
Created by mwise on 5/3/16.
 
ThreadFactory which creates named threads for use in an ExecutorService.
A thread factory that sets the threads to create threads with a specific prefix name.
A repository of StoreName and StoreVersionName, which are intended to be used as shared instances.
Venice in-house implementation of a read only metadata repository where callers can retrieve various metadata such as Store objects and their corresponding schemas.
 
 
 
This class was copied from ChunkedWriteHandler, and the only change we made here is to using ConcurrentLinkedQueue for NettyChunkedWriteHandler.queue since NettyChunkedWriteHandler.resumeTransfer() or NettyChunkedWriteHandler.discard(Throwable) could happen in any sequence with any parallelism because of scatter/gather logic in Venice Router.
 
Netty based P2P blob transfer manager implementation.
Created by acurtis on 3/30/17.
 
 
 
 
 
 
 
 
 
Response to represent the status of a node
 
 
Locking is not necessary for NoopCompressor, so this class overrides all the public APIs to avoid locking.
 
 
 
 
No Op validator.
 
Warns that a class is not threadsafe.
This iterator traverses the entire backing iterator, while skipping over null entries.
This class is in charge of managing access and maintenance of a VeniceStoreCache.
 
 
An interface to get and give back objects that are intended to be long-lived and recycled, but where the location in the code for beginning to use the object is far from the location where we stop using it, e.g.
Accessor to execute the CURD operations for offline push and its replicas statuses.
Class stores all the statuses and history of one offline push.
Serializer used to convert the data between OfflinePushStatus and json.
 
 
Enum of strategies used to decide the when the data is ready to serve in off-line push.
This class records the offset for every pair(topic,partition) this node is responsible for.
If OffsetRecord is initialized with a serializer that contains SchemaReader, old version of server codes will be able to deserialize OffsetRecord that is serialized with a newer protocol version, which can happen after rolling back a server release with new protocol version to an old server release with old protocol version.
Look up online instances related to a topic's partition.
 
An implementation of a VeniceProducer suitable for online applications.
 
This class is used to optimize bytes field decoding.
 
This class is to reuse the original array for put payload of each message to reduce the unnecessary byte array allocation.
wraps raw bytes into an HTTP response object that HttpServerCodec expects
 
Peer-to-Peer (P2P) Blob Transfer Manager.
 
The client-side Netty handler to process responses for P2P file transfer.
The server-side Netty handler to process requests for P2P file transfer.
The client-side Netty handler to process responses for P2P metadata transfer.
A simple container for a pair of (2) values.
Deprecated.
Instead of this, please create a dedicated class with well-named, non-generic (potentially primitive) properties and getters.
 
This class is a utility class for Parent Controller store update logics.
Enum representing the state of the region where the parent controller resides.
Interceptor to verify that the parent controller is active before processing requests within its region.
 
 
 
 
This class is used to track the thread pool stats for the state transitions of the participant.
A helper class that wraps the readers and writers for the participant stores.
 
 
Class defines the partition in Venice.
This class represent the assignments of one resource including all of assigned partitions and expected number of partitions.
This class is used to maintain internal state for consumption of each partition.
This immutable class holds a association between a key and value and the source offset of the consumed message.
 
 
 
This interface defines all the public APIs, and if you need to add accessors to some new fields, this interface needs to be changed accordingly.
 
 
 
 
Provides methods to find the partition name for a piece of data.
Resource level partition health stats.
This class is for wrapping the information about the role of the partition replica on that host to AggKafkaConsumerService to achieve finer granularity of consumer assignment.
 
 
 
Sort the rows based on the key and value in ascending order using unsigned byte comparison.
This record holds the state necessary for a consumer to checkpoint its progress when consuming a Venice partition.
Class stores the status of one partition including all the replicas statuses in this partition.
An implementation of AbstractPushMonitor that listens to ZK PartitionStatus to determine the push status.
Serializer used to convert the data between PartitionStatus and json.
 
 
Listener used to listen the data change of partition status.
This class maintains state about all the upstream producers for a given partition.
 
 
 
PartitionWiseKafkaConsumerService is used to allocate share consumer from consumer pool at partition granularity.
 
 
The implementation of this method is mostly similar to DefaultChannelFuture except that we store the Channel inside a CompletableFuture because it is not yet known at the time of construction.
The implementation of this method is mostly similar to DefaultChannelFuture except that we store the Channel inside a CompletableFuture because it is not yet known at the time of construction.
This class maintains a simple counter for capacity management.
 
A list of permission that will determine what Method a Principal can perform on a Resource.
Thrown by the StorageEngine or Storage Partitions if storage fails
Enums of persistence types in Venice.
A class to represent a Persona, a relationship between stores and a quota system that is enforced.
This interface is used inside of the MockInMemoryConsumer in order to mess around with the way messages are delivered to the consuming code.
This interface exposes the commonalities between the Apache httpasyncclient and Netty and R2 client.
Utility methods for precondition checks.
Functor interface for performing a predicate test on GenericRecord
 
Represents an actor entry.
 
Utility methods for Processes.
 
A record containing the state pertaining to the data sent by one upstream producer into one partition.
 
PropertyBuilder is the suggested Way to construct VeniceProperties.
An adapter for PubSubAdmin to create/delete/list/update topics.
Generic admin factory interface.
 
 
A wrapper around pub-sub producer, consumer, and admin adapter factories This will be passed as one of the arguments to the component which depends on the pub-sub APIs.
Constants used by pub-sub components.
The purpose of this class is only to make the package private test-only functions of PubSubConstants available to all packages within tests.
An adapter for consuming messages from a Pub-Sub topic.
Generic consumer factory interface.
 
 
 
Uses AvroCoder to encode/decode PubSubMessages.
The class for deserializing messages from the pubsub specific message format to PubSubMessage
A key-value pair that is associated with a message
Set of key-value pairs to tagged with messages produced to a topic.
 
 
Used when an operation against PubSub failed to complete in time.
Represents a position of a message in a partition of a topic.
Factory class for creating PubSubPosition objects from wire format
Constants for the different types of positions Do not change the values of these constants.
 
The pub-sub producer interface with which venice writer's interact to send messages to pub-sub topic.
This class will spin up multiple producer instances to speed up the producing.
 
 
 
Generic producer factory interface.
A callback interface that users of PubSubProducerAdapter should implement if they want to execute some code once PubSubProducerAdapter#sendMessage request is completed.
A simple implementation of PubSubProducerCallback interface for testing purposes.
An interface implemented by specific PubSubProducerAdapters to return the result of a produce action.
This enum is equivalent to Kafka's SecurityProtocol enum.
 
Class for all Venice exceptions that are triggered by Kafka topic authorization related issues.
 
Represents a PubSubTopic configuration.
The source or destination topic for the replication request does not exit
 
 
 
 
 
 
 
 
 
 
This class is a simple schema builder to generate Venice Push Job input file schema.
Different successful checkpoints and known error scenarios of the VPJ flow.
 
 
0 => STARTED, 1 => COMPLETED, 2 => ERROR, 3 => NOT_CREATED, 4 => UNKNOWN, 5 => TOPIC_CREATED, 6 => WRITE_TO_KAFKA_COMPLETED, 7 => KILLED, 8 => END_OF_PUSH_RECEIVED, 9 => START_OF_INCREMENTAL_PUSH_RECEIVED, 10 => END_OF_INCREMENTAL_PUSH_RECEIVED
This interface provides methods to send push job heartbeats
This interface provides methods to let users create a heartbeat provider
This class carries the state for the duration of the VenicePushJob.
 
 
 
Response for uploading a job status record.
 
In Venice, Push is a task that asynchronously writes data to Venice.
This class maintains the mapping of Kafka topic to each PushMonitor instance and delegates calls to the correct instance.
This enum controls the behavior of how PushMonitorDelegator chooses proper PushMonitor for resources.
This class contains some common util methods for push monitoring purpose.
 
 
This class serves as a collector of offline push status for both Venice Server and Da Vinci clients.
Decide the offline push status by checking all replicas statuses under different offline push strategies.
 
Notifies both OfflinePushStatus and Helix Customized View
Push job status response for a resource; this is a response that will be returned by Router.
This class is a helper class for Venice controller to read PushStatus / Heartbeat messages.
 
Helper class for PushStatusStore related logic.
 
Wrapper class for acquiring VeniceWriter for push status stores.
PushStatusStoreWriter is a helper class for Da Vinci to write PushStatus and heartbeat message into PushStatus store real-time topic.
 
 
Put payloads contain a record value, and information on how to deserialize it.
This is a simple container class to hold replication metadata related fields together to be passed on to the Put api in VeniceWriter VeniceWriter.put(Object, Object, int, Callback, LeaderMetadataWrapper, long, PutMetadata).
 
A simple container with builder functions to sugar-coat the code a bit.
Imported this implementation from netty 3.2.4 because the implementation within 3.5.11 will corrupt uri paths, converting ';' semicolons into '&' ampersands.
A tool use thin client to query the value from a store by the specified key.
 
R2 based Transport Client Interface.
 
 
Created by acurtis on 12/12/16.
 
A simple PollStrategy which delivers messages from any partition, while respecting the ordering guarantee of individual partitions.
A ChunkingAdapter to be used when we want to read the value from storage engine as raw bytes.
 
A POJO containing a byte array and a serialized ChunkedKeySuffix or an object of ChunkedKeySuffix
TDOO: Consider merging with AvroProtocolDefinition
 
This class provides support to deserialize customized records even the input doesn't contain the full record.
Exception when the deserialization hits partial record.
This class contains two fields: 1.
 
 
 
 
A read-only wrapper of Store, and all the modification to delegated instance will throw UnsupportedOperationException.
A read-only wrapper of Version
Interface defined the way to retrieve the store config from a repository.
Interface defined readonly operations to access stores.
 
A type of Persona that enforces a read (bandwidth) limit across multiple stores.
 
 
This class define the throttler on reads request.
This is used to store common fields shared by various read responses.
This class is used to accumulate stats associated with a read response.
This class is used to record stats associated with a read response.
Enums of the strategies used to read data from multiple replications in Venice.
 
 
Interface defined read and write operations to access stores.
 
Class which implements the process of switching from a batch topic (e.g.
Schema which describes a record change event in a Venice store.
TODO: This class is a bit of a hack.
 
 
 
 
 
 
 
Utilities for reflection TODO This class may not be needed if we decide to proceed with Guice library for reflections and need to be discarded then
 
 
 
 
RelayNotifier is a VeniceNotifier which takes in a target VeniceNotifier and forward all actions to the target notifier.
Singleton class meant to keep track of subscription failures for ingestion tasks where the ingestion task needs to keep running despite cluster failures.
Replica is the basic unit to distribute data, replica is belong to a partition and running in a instance.
 
 
 
 
Class stores replica status and history.
This ReplicationMetadataRocksDBStoragePartition is built to store key value pair along with the timestamp metadata.
This class is registered in ZKClient to handle Replication metadata schema serialization.
 
 
An interface to report and retrieve metrics related to data writer tasks during the execution of a MapReduce job.
 
 
 
 
Store metadata class that uses the server's endpoint to fetch metadata and keep the local cache up to date.
Created by mwise on 3/16/16.
This class is used to include all the intermediate fields required for the communication between the different tiers.
Provide the appropriate headers based on the request type for all remote Venice clients.
 
 
 
 
 
 
 
Represents a resource identified by "name".
Assignments for all of resources.
 
This class maintains a map from resource of a certain type to its lock.
 
 
This interface defines the API to track the read usage per resource.
A registry to manage Shutdownable resources.
Factories must implement this interface.
Resources which implement ResourceRegistry.ShutdownFirst will be shut down during the first phase.
Resources which implement ResourceRegistry.ShutdownLast will be shut down during the last phase.
Resources which implement ResourceRegistry.Sync will be shut down after preceding resources have finished shutting down and further resources will shut down after it has completed shutting down.
 
 
Created by acurtis on 6/19/17.
 
 
This class is mostly used to trigger retry in the following scenarios: 1.
RetriableAvroSpecificStoreClient<K,V extends org.apache.avro.specific.SpecificRecord>
 
TODO: make retry work for compute request.
A non-thread safe counter that should be used and increased in a single thread env when the increment is called
 
 
 
 
This class implements a stream filter for writing compressed data in the GZIP file format.
This is an util class that contains constants related to replication metadata.
Container for the deserializers of a single store.
This holds the schema to perform conflict resolution on records when samza produces data in active active replication mode.
This is simply a wrapper class to delegate the actual schema generation to version specific adapter.
This class helps in generating a active-active metadata schema to hold update timestamps and offsets from a value schema of a store .
This class is responsible for serialization and deserialization related tasks.
Type of the replication metadata timestamp.
Ths class is responsible for deserializing RMDs and extract some information from it.
 
A POJO class that contains 3 things/fields: 1.
 
Class that exposes RocksDB memory consumption stats based on all properties that are made available in https://github.com/facebook/rocksdb/blob/master/include/rocksdb/db.h#L870 Properties exist on a per RockDB database basis (equivalent to a Venice partition).
 
 
Check TickerType to find more details about RocksDB metrics.
 
 
 
In RocksDBStoragePartition, it assumes the update(insert/delete) will happen sequentially.
This class is used to throttle RocksDB operations.
 
 
 
 
 
This class stores the stats for combined all type(SINGLE/MULTI/COMPUTE etc) of Router requests to the HttpAsync client compared to RouteHttpStats which stores only per type stats.
 
 
 
 
Router4<C extends io.netty.channel.Channel>
 
Router4Impl<C extends io.netty.channel.Channel>
 
Router4PipelineFactory<C extends io.netty.channel.Channel>
 
 
This push monitor is able to query hybrid store quota status from routers
 
 
This push monitor is able to query push job status from routers; it only works for stores running in Leader/Follower mode and it will be built for STREAM_REPROCESSING job.
Router based implementation of StoreSchemaFetcher.
Serializer used to convert data between RouterClusterConfig Object and JSON string
 
Exception class for checked exceptions within the Espresso Router.
Right now, there is no simple way to capture all the exception cases with DDS Router.
 
RouterHeartbeat is a service that monitors and reports the health of current live instances in the Venice cluster.
 
RouterKey encapsulates the required information for a key in a router request.
List all Metric entities for router
 
RouterRequest is an abstract base class for single-get and multi-get operations.
Monitors the stream, when it gets enough bytes that form a genuine object, it deserializes the object and passes it along the stack.
 
Cluster level metadata for all routers.
 
 
This interface define the operations used to manage all routers in a cluster.
 
 
 
 
Store-level access control handler, which is being used by both Router and Server.
 
 
 
 
 
Repository to access routing data like Partition and replica.
 
Enums of strategies used to route the key to partition in Venice.
A utility class to convert Spark SQL Row to an Avro GenericRecord with the specified schema.
 
A simple filter Runnable which will only invoke the contained runnable no more than once.
A wrapper around HelixDataAccessor which disables the unsafe functionality.
A wrapper around HelixManager which disables the unsafe functionality.
Various methods for stopping/exiting a Samza task.
Collection of ScatterGatherRequests for a request to the router.
 
 
Created by acurtis on 6/20/17.
Routing information for part of a request which the router has mapped to a single storage node.
 
 
ScatterGatherRequestHandlerImpl<H,P extends ResourcePath<K>,K,R,CHC,BHS extends BasicRequest,HR,HRS extends io.netty.handler.codec.http.HttpResponseStatus,SCATTER_GATHER_HELPER extends ScatterGatherHelper<H,P,K,R,BHS,HR,HRS>>
 
 
 
An analog to ScheduledExecutorService which extends ExecutorService.
 
Try to adapt the to the .
Used to cache the toString of a given schema, since it is expensive to compute.
Evaluate the compatibility between a reader schema and a writer schema.
Identifies the type of a schema compatibility result.
Provides information about the compatibility of a single reader and writer schema pair.
This class is used to store all the schemas related to a given store: 1.
 
SchemaEntry is composed of a schema and its corresponding id.
This class to use to serialize/deserialize Zookeeper node
Throw this exception when the schema is not fully compatible with the previous ones.
 
This class helps checks presence of a schema version in ZK.
This interface is considered as an internal interface to Venice codebase.
 
 
 
A high-level interface as a source specifically for RmdSchema.
 
 
 
This is a port of the com.linkedin.security.netty.ssl.access.control.SecureClientHandler class as a netty 4 implementation.
 
A segment is a sequence of messages sent by a single producer into a single partition.
 
Interface of class that is used to keep track of push job details sent to the Venice controller.
This store buffer services maintains two separate drainer queues for store ingestions.
 
For one given store, the client only needs the followings: 1.
 
Together with ServerStoreAclHandler, Server will allow two kinds of access pattern: 1.
 
 
 
 
 
 
 
ServerHttpRequestStats contains a list of counters in order to mainly measure the performance of handling requests from Routers.
 
A wrapper that holds reference for various repositories responsible for constructing metadata responses upon request.
For measuring quota requests and rejections for a given store.
We need to be able to record server side statistics for gRPC requests.
Together with ServerAclHandler, Server will allow two kinds of access pattern: 1.
This interface is used to announce the service to a Service Discovery system on startup and de-announce the service on graceful shutdown.
This class is responsible for retrying the registration of a ServiceDiscoveryAnnouncer in the case of registration failure.
 
 
 
 
This class is intended to only be used in Controller, which is sharing one instance across all the clusters.
This class is intended to only be used in Controller, which is sharing one instance across all the clusters.
 
A class containing shared kafka producer related configurations.
A factory class to create shared objects that need to release resources cleanly.
 
 
ShutdownableEventLoopGroup<E extends io.netty.channel.EventLoopGroup>
Created by acurtis on 3/30/17.
An Executor factory to be used with ResourceRegistry instead of using Executors
Example:
An ExecutorService interface which also extends the Shutdownable interface.
 
 
 
 
A ScheduledExecutorService interface which also extends the Shutdownable interface.
 
SimpleChannelInitializer<C extends io.netty.channel.Channel>
Created by acurtis on 4/27/18.
A VERY simple JSON mapper.
This VenicePartitioner implementation takes the first byte of the key, and computes the partition as such: firstByte % numPartitions It is meant to be used in tests where we want to control the partition that a key belongs to, but without needing to work out a proper input that yields the desired partition when hashed by the DefaultVenicePartitioner.
A simple implementation of PubSubProduceResult interface for testing purposes.
 
Single get chunking adapter
 
 
Created by acurtis on 3/30/17.
Created by acurtis on 3/30/17.
 
Field accessor using Reflection.
Helper class for the SlowFieldAccessor implementation.
This class provides thread-local variables with SoftReference semantics.
 
 
 
 
An implementation of AbstractInputRecordProcessor for Spark that processes input records from the dataframe and emits an Iterator of Row with DEFAULT_SCHEMA as the schema.
A Factory class to create individual SparkInputRecordProcessor for processing records in a Spark Dataframe.
Spark partitioning functionality in Dataframe and Dataset APIs is not very flexible.
 
 
Utility methods for restricting Scala-ism in Spark code in a single class.
 
A List implementation with some usability improvements around resizing.
A very simple subclass of SparseConcurrentList which adds some immutable offset to all index.
SpecificRecordChunkingAdapter<V extends org.apache.avro.specific.SpecificRecord>
 
SpecificRetriableStoreClient<K,V extends org.apache.avro.specific.SpecificRecord>
This class is necessary because Venice needs to maintain a separate interface: AvroSpecificStoreClient.
SpecificStatTrackingStoreClient<K,V extends org.apache.avro.specific.SpecificRecord>
This class is necessary because Venice needs to maintain a separate interface: AvroSpecificStoreClient.
Created by acurtis on 9/7/17.
 
 
 
 
 
Builds a new instance of SSLContext object from given key store and trust store parameters.
Created by acurtis on 4/19/18.
Factory interface to create SSLEngine and get SSLSessionContext objects for Netty4 pipelines
 
 
SSL Factory interface to get common SSL objects.
Venice SSL Factory interface.
Created by acurtis on 9/7/17.
 
This class is used to configure TLS for Venice components in integration tests.
 
This handler closes the connection when a ReadTimeoutException occurs.
This enum is used to denote why a particular store version was found to be stale
 
This ControlMessage is sent by the Controller, once per partition, after the EndOfPush ControlMessage, in Hybrid Stores that ingest from both offline and nearline sources.
This ControlMessage is sent per partition by each offline incremental push job, once per partition, at the beginning of a incremental push.
This ControlMessage is sent once per partition, at the beginning of a bulk load, before any of the data producers come online.
This ControlMessage is sent at least once per partition per producer.
This class notifies the Helix State Models (SM) about corresponding ingestion progress.
A StaticAccessController is an AccessController with an immutable resource list.
 
Currently, we only expose metrics for single-get and batch-get requests, and if there is a need to have metrics for other request types, we can add them later.
This class is in charge of all the metric emissions per request.
StatsAvroSpecificDaVinciClient<K,V extends org.apache.avro.specific.SpecificRecord>
 
StatsAvroSpecificStoreClient<K,V extends org.apache.avro.specific.SpecificRecord>
 
This enum tracks the error codes that we use to report anomalies in metrics.
 
 
 
This class is used to handle all the metric related logic.
Venice control message which is used to transfer command and status between storage node and controller, so that controller could control the whole cluster and make the global decision.
Channel used to send and receive control message.
Handler to deal with incoming StatusMessage instances.
The snapshot of status change.
Created by acurtis on 12/12/16.
 
 
 
An abstraction that represents the shared resources of a persistence engine.
This exception indicates that the server was unable to initialize on or more storage services or stores within a service.
StorageEngineMetadataService is wrapper service on top of storageEngineRepository to serve read/write to storage metadata.
 
A wrapper class that holds all the server's storage engines.
thrown by StorageEngineFactory and AbstractStorageEngine when storage creation fails
This is a superset of the OffsetManager APIs, which also provide functions for storing store-version level state.
 
 
Class used to represent the status of storage node.
Storage partition level config, which could be used to specify partition specific config when initializing/opening the corresponding AbstractStoragePartition.
This class maintains in-memory partition usage.
This exception indicates that the server was unable to initialize on or more storage services or stores within a service.
A type of Persona that enforces a storage (size) limit across multiple stores.
Zookeeper accessor for StoragePersona objects, allowing for read/write to persona paths.
Serializer used to convert the data between Persona and json.
This class annotates the constructor for StoragePersona and serves as a property-based creator.
Cached wrapper for the StoragePersonaAccessor class, which keeps a map of store names to personas that contain them, as well as caches Zookeeper personas.
Helper class which listens to changes from a ReadOnlyStoreRepository and updates the caches and ZK of the StoragePersonaRepository accordingly.
 
 
StorageReadRequestHandler will take the incoming read requests from routerRouterRequest, and delegate the lookup request to a thread pool StorageReadRequestHandler.executor, which is being shared by all the requests.
Storage interface to Venice Server, Da Vinci and Isolated Ingestion Service.
This class has the following responsibilities: 1.
This interface defines all the public APIs, and if you need to add accessors to some new fields, this interface needs to be changed accordingly.
This type of metadata contains various store properties/configs and describes the clusters that store is materialized in.
StoreAwarePartitionWiseKafkaConsumerService is used to allocate share consumer from consumer pool at partition granularity.
 
 
 
This service is in charge of cleaning up backup versions based on retention policy defined on store basis.
 
This class is serving as a PubSubMessage buffer with an accompanying pool of drainer threads.
 
 
 
 
This type describes the various Venice cluster information for a store
 
 
Configurations of a store which are non-cluster specified.
 
 
 
This is a class used to manage multiple replicas of the same store.
Interface used to register into metadata repository to listen the change of store data.
 
 
 
The graveyard used to keep all deleted stores.
This service is in charge of removing stores under /venice//StoreGraveyard zk path.
 
 
Json-serializable class for sending store information to the controller client
An interface for Store Ingestion Service for Venice.
A runnable Kafka Consumer consuming messages from all the partition assigned to current node for a Kafka Topic.
 
Builder class for StoreIngestionTaskFactory; once the build() function is invoked, no modification is allowed on any parameters.
Serializer used to convert the data between Store and json.
 
Mixin used to add the annotation to figure out the constructor used by Jackson lib when deserialize the version
 
Mixin used to add the annotation to figure out the constructor used by Jackson lib when deserialize the store.
Mixin used to add the annotation to figure out the constructor used by Jackson lib when deserialize the version
 
Throw this exception when try to add key schema to a store, whose key schema already exists.
This type of store metadata describes the key schemas of the store
This type describes the key schemas of the store
 
This interface defines a set of lifecycle events for stores and their store-versions.

Pre-hooks and post-hooks

Most events have a pre- and post- variant, all pre-hooks provide an option to control the outcome of the event (e.g.
Interface used to register into metadata repository to listen the change of store list.
This interface defines the APIs to retrieve store metadata and routing data, and it also includes the feedback APIs: StoreMetadata.trackHealthBasedOnRequestToInstance(java.lang.String, int, int, java.util.concurrent.CompletableFuture<com.linkedin.venice.client.store.transport.TransportClientResponse>) to decide the healthiness of each replica.
Modes that control how fast client will fetch store metadata
A composite key that divides the key space to retrieve different types of metadata for a Venice store.
 
 
A composite key that divides the key space to retrieve different types of metadata for a Venice store.
 
 
This class is a handle to refer to a store.
 
 
 
This type contains all the store configs and the corresponding versions
MetadataFetchRequest encapsulates a GET request to /store_properties/storename on the storage node to fetch metadata for that node.
This class stores all the information required for answering a server metadata fetch request.
This record will store store properties
This structure will contain all kinds of info related to one replica
StoreRepushCommand contains the details of executing/processing repush command.
 
 
 
 
This class is the public interface for Venice store user to fetch store schemas.
 
This class retrieves the corresponding Store object from a router endpoint which contains the latest store state.
 
 
 
Control description used to notify controller that the status of Offline push in Storage node.
 
This type describes a single version of the value schema of the store.
This type of store metadata describes the value schemas of the store.
This type describes the value schemas of the store.
This class implements the fast value schema/latest value schema lookup with acceptable delay.
Type describes all the version attributes
A hook which returns this enum has the option of proceeding, aborting, waiting or rolling back.
This class is a handle to refer to a store-version.
 
This record maintains store-version level state, such as the StartOfBufferReplay Control Message, in the case of Hybrid Stores.
 
A configuration for a particular view.
A configuration for a particular view.
 
This class handles Write Compute operations related to a specific store.
 
 
 
This record will store all the additional info after sending out streaming response
This class provides percentile tracking and stats emission for streaming response
 
This class serves as the annotated schema cache for merge conflict resolution purpose in Active/Active replication.
 
 
 
 
Interface defined subscription based readonly operations to access stores.
 
A simple implementation of AsyncFuture which behaves as if the Future has already completed successfully.
Full response to indicate that all the sub responses are with good status: HttpResponseStatus.OK for streaming request.
 
 
This class would copy the specified SupersetSchemaGeneratorWithCustomProp.customProp from the latest value schema to the generated superset schema and in the meantime, the compare method in this impl will consider this extra property as well.
 
 
 
 
This SystemStore class is a wrapper of the corresponding zk shared system store and the regular venice store to provide the exact same interface as the regular Venice Store for System Stores.
Currently, acl creation is still managed by Nuage but Nuage is not aware about the system stores.
 
 
This class is the metric class for SystemStoreRepairService
 
This class contains the logic to set up system stores.
 
 
This type describes all the distinct properties
This class is the system store repair service in Venice parent controller.
This class tries to scan all cluster which current parent controller is the leader controller.
Time implementation that just reads from the system clock
This type of store metadata contains the target version states announced by the parent controller leader for a given store that each fabric should be bootstrapping towards.
An interface to report task health and track task progress for different execution engines
Interface for creating metric names enum for tehuti metrics
Utils for venice metrics
Generate a ratio stat that is based on two arbitrary Rate.
Generate a ratio stat that is based on two arbitrary MeasurableStat.
 
A task that periodically polls the job state for existing version topics in the parent fabric.
 
A test log appender that can be used to verify log messages in unit tests.
A Time implementation which abstracts logical time away from real time, so that it can be deterministically controlled.
 
General-purpose utility functions for tests.
 
 
 
 
 
 
 
 
An ExecutorService that executes each submitted task using one of possibly several pooled threads, normally configured using Executors factory methods.
 
Stats used to collect the usage of a thread pool including: 1.
Claims that a class is threadsafe or admonishes implementors of an interface to ensure that their implementation is threadsafe.
Utility methods for time travelling and freezing time.
The interface to time itself.
 
This utility class provides methods to format and parse a timespan in a human readable form.
More efficient handling of cancellable schedulable events.
 
 
 
A ThreadPoolExecutor that can additionally schedule commands to run after a given delay, or to execute periodically.
A class to encapsulate time values along with their measurement units Works for both timestamps as well as time durations.
 
The topic cleanup in Venice adopts the following strategy: 1.
In parent controller, TopicCleanupServiceForParentController will remove all the deprecated topics: topic with low retention policy.
 
 
Topic manager is responsible for creating, deleting, and updating topics.
A context object that contains all the dependencies needed by TopicManager.
 
 
A repository of TopicManager instances, each associated with a specific PubSub region and cluster.
This class is used to find messages for a given key from a specified topic.
 
This class is a wrapper of pair of <Version Topic, Version /Real time topic partition>.
 
 
This class tracks consumed topic partitions' offsets
This ControlMessage is sent by the Controller, once per partition; it will only be used in leader/follower state transition model; this control message will indicate the leader to switch to a new source topic and start consuming from offset with a specific timestamp.
This class precomputes the conversion from Avro's CharSequence to String, and also hangs on to the correct instance of PubSubTopic, so that the lookup need not be performed in many locations.
TopicWiseKafkaConsumerService is used to allocate share consumer from consumer pool at topic granularity.
Extend this class to create response objects for async command.
This interface exposes one more function: TrackingStreamingCallback.onDeserializationCompletion(Optional, int, int), which will be used for metric tracking.
This PubSubProducerAdapter implementation allows tests to perform arbitrary transformations on the messages that are about to be written to Kafka.
 
 
Define interfaces for TransportClient, and the target customer is the sub-classes of AvroGenericStoreClient
Define the common functions for call back of TransportClient
This class is used to capture the result returned by TransportClient
 
Callback to support streaming in TransportClient
This class implements a generic trie-based lookup tree.
The policy controls the TTL behavior regarding how batch writes are treated.
Interface for a generic typed field accessor.
Helper class to determine at run time which implementation to use.
 
 
Thrown when a required property is not present
This class makes use of UnresolvedUnionException, which is not present in all versions of Avro.
This is thrown when some code encounters a message type it does not know about.
This service runs in the parent controller to delete historical unused value schemas.
Partial update operation, which merges the update value with the existing value.
This class applies the builder pattern to build a partial update record.
 
 
This enum represents status of something (a record or a field) after being updated.
 
 
 
 
Copies of the implementation of URLEncoder.encode(String, String) and URLDecoder.decode(String, String) except with the following changes: Accept a Charset argument for the character set instead of a string which would cause Charset.forName(String) to be called. Internally use StringBuilder instead of StringBuffer. These methods no longer throw a checked exception because it no longer calls Charset.forName(String). These methods are annotated with Nonnull annotations.
 
This class is responsible for automatically create and delete per user store system store resources when the corresponding user store is created or deleted.
Helper functions
This class encapsulates config entity information such as config name, default value, config document
 
Controller API version 1 HTTP GET routes
 
 
 
Mapper only MR to Validate Schema, Build compression dictionary if needed and persist some data (total file size and compression dictionary) in HDFS to be used by the VPJ Driver Note: processing all the files in this split are done sequentially and if it results in significant increase in the mapper time or resulting in timeouts, this needs to be revisited to be done via a thread pool.
The data protocol between ValidateSchemaAndBuildDictMapper and VenicePushJob
This class reads the data(total input size in bytes and zstd dictionary) persisted in HDFS by ValidateSchemaAndBuildDictMapper based on the schema ValidateSchemaAndBuildDictMapperOutput
This class provides a way to: 1.
Interface for targeted region push validation.
 
This class holds a value of type and its corresponding replication metadata.
Optional field which describes the previous state of a given row.
 
This class provides the following functionalities: 1.
 
 
 
A runtime exception which may be thrown by implementation of AuthorizerService interface.
 
 
 
This class is a thin wrapper of AvroSerializer and AvroGenericDeserializer.
 
A record reader that reads records from Avro file input into Avro-serialized keys and values.
Classes for P2P use case that are triggered by blob transfer file not found in target host
 
Venice change capture consumer to provide value change callback.
 
 
 
Beam Connector for Venice Change data capture IO.
 
 
Interface to provide a configured VeniceChangelogConsumer instance.
A Runnable that throws checked exceptions.
A Supplier that throws checked exceptions.
 
 
 
This class is responsible to filter records based on the RMD information and the ttl config by taking chunked records ChunkAssembler.ValueBytesAndSchemaId.
This is the class to talk to VeniceChunkedWriteHandler to send back partial response.
This class is the wrapper of ChunkedWriteHandler, and essentially it will introduce the following logic: 1.
 
Thrown by the venice thin client if something goes wrong with the request Parent class to VeniceClientHttpException
VeniceClientException with a http code
 
Protobuf type com.linkedin.venice.protocols.VeniceClientRequest
Protobuf type com.linkedin.venice.protocols.VeniceClientRequest
 
This class copies most of the logic from DefaultClientTlsStrategy and AbstractClientTlsStrategy to get rid of the cipher check to be backward compatible.
This class copies most of the logic from ClientTlsStrategyBuilder to get rid of the cipher check while using http/2 in TlsCiphers to be backward compatible.
class that maintains config very specific to a Venice cluster
 
 
 
 
VeniceConfigService loads the static configuration that does not change once the Server is started.
 
Venice Controller to manage the cluster.
Configuration which is specific to a Venice controller.
VeniceControllerContext contains dependencies required by VeniceController
 
Protobuf type com.linkedin.venice.protocols.controller.VeniceControllerGrpcErrorInfo
Protobuf type com.linkedin.venice.protocols.controller.VeniceControllerGrpcErrorInfo
 
 
 
 
 
 
This class is a gRPC service implementation for the VeniceController public API.
 
 
The core handler for processing incoming requests in the VeniceController.
A service venice controller.
State model used to handle the change of leader-standby relationship for controllers.
 
This class contains all the ScatterGatherMode being used in Venice Router.
 
Factory to create VeniceDistClusterControllerStateModel and provide some utility methods to get state model by given cluster.
N.B.: Although there is no way to force this via Java interfaces, the convention is that all enums implementing this interface should have static "valueOf" function to return the correct enum value from a given numeric value, i.e.: Note that VeniceEnumValueTest makes it easy to test the above, and we should have a subclass of that test for all implementations of this interface.
Abstract class which makes it as easy as possible to generically test all the assumptions for enums which implement the VeniceEnumValue interface.
Base exception that all other Venice exceptions extend
Custom Input Format with the following specs to be used for the feature PushJobSetting.useMapperToBuildDict with ValidateSchemaAndBuildDictMapper 1.
A specialized DefaultFullHttpResponse object to record the decompression time of the records in the response.
 
 
 
 
 
 
 
 
A table format that is used by Spark to read Avro files from HDFS for use in VenicePushJob.
This is the entrypoint of the Avro input source.
Helix Admin based on 0.8.4.215 APIs.
 
{code VeniceHostFinder} finds a set of hosts that can serve the given Venice resource partition.
VeniceHostHealth the aggregate statistics for HostHealthStats.
This utility is used to manage all the HTTP/2 related configs and offer a way to spin up a pipeline to support both HTTP/1.1 and HTTP/2.
 
Customized exception for inconsistent Avro schema error of input directory in VenicePushJob
This exception is thrown when we detect inconsistent or possibly corrupted store metadata on storage nodes.
 
 
Simple class that spins JVM platform stats into Venice stats.
 
This class is designed specifically for KafkaInputFormat, and right now, it is doing simple pass-through.
This class is designed specifically for KafkaInputFormat, and right now, it will pick up the latest entry according to the associated offset, and produce it to Kafka.
This class is responsible to filter records based on the RMD information and the ttl config by taking KIF input KafkaInputMapperValue.
Map objects to byte arrays and back again
 
Venice wrapper around a Lock object to record metrics and emit logs when lock acquisition is taking too long.
 
Uses KvCoder to encode/decode PubSubMessage key and value.
Custom Exception for Venice messages.
VeniceMetadataRepositoryBuilder is a centralized builder class for constructing a variety of metadata components including store repository, schema repository, ZK-shared schema repository, ZK client and cluster info provider for Da Vinci, Venice Service and Isolated Ingestion Service.
 
Configuration for metrics emitted by Venice: Holds OpenTelemetry as well as Tehuti configs
Configs starting with "otel.venice." are venice specific configs for OpenTelemetry metrics
other configs starting with "otel.exporter." are generic OpenTelemetry exporter configs but are parsed in this class and used setters to configure otel exporter.
 
 
This is used to initialize a Metrics for each request.
Repository to hold both tehuti and OpenTelemetry metrics.
VeniceMRPartitioner will use the exact same partitioner as the one that will be used inside VeniceWriter (e.g.
 
 
This enum is used to define various routing strategies for multi-key requests.
 
 
Thrown when an operation should return information about a store, but the store does not exist
Interface for listening to Notifications for Store consumption.
This class defines a wrapper object to wrap up the actual object and the corresponding update timestamp.
Venice implementation of OfflinePushAccessor.
 
 
Handler for checking the state of the region of the parent controller to handle requests in its region.
This class is a wrapper of VeniceHelixAdmin, which will be used in parent controller.
Determines partitioning, which is used for producing messages into the right Kafka partitions and routing reads to the correct Venice storage nodes.
VenicePartitionFinder provides methods to find the partition name or number for the given data.
 
Inbound single get request to the router will look like: GET /storage/storeName/key?f=fmt 'storage' is a literal, meaning we will request the value for a single key storeName will be the name of the requested store key is the key being looked up fmt is an optional format parameter, one of 'string' or 'b64'.
Created by mwise on 4/25/16.
Classes for P2P use case that are triggered by founded peers cannot connect
Classes for P2P usecase that are triggered by peers not found
The API for online applications to write to Venice.
 
 
A Pulsar Sink that sends messages to Venice.
Configuration class for the Sink Connector.
This class sets up the Hadoop job used to push data to Venice.
 
 
 
Enumeration of response status codes for Venice read requests.
 
 
 
 
 
 
 
 
VeniceReducer will be in charge of producing the messages to Kafka broker.
This class simply exposes publicly the protected methods of its parent, for debugging purposes.
 
Venice resource that could be refreshed to retrieve the newest data or clear the current data in memory.
Class for all Venice exceptions that are triggered by Kafka topic authorization related issues.
 
VeniceResponseAggregator aggregates the sub-responses returned by storage node for a multi-get request.
CompletableFuture implementation to handle partial response.
This class is used to handle all the decompression related logic in Router, and it will be used in VeniceResponseAggregator for regular requests and VeniceDispatcher for streaming requests.
Venice customized map, which could contains either a full response or a partial response.
 
How Venice categorizes the response status of a request: We are emitting both HttpResponseStatusCodeCategory and this enum to capture the http standard as well as the Venice specific categorization.
Interface for all Venice exceptions that are retriable.
This class is responsible to filter records based on the RMD information and the ttl config.
 
 
Common route handler implement the common part of error handling and writing response to json.
Configuration for Venice Router.
 
 
Customized exception for non-existing key/value field error in Avro schema in VenicePushJob
 
Interface defines how to serialize and deserialize the venice object.
Class that represents the Venice server.
VeniceServerConfig maintains configs specific to Venice Server, Da Vinci client and Isolated Ingestion Service.
 
VeniceServerContext contains dependencies required by VeniceServer
 
 
 
Protobuf type com.linkedin.venice.protocols.VeniceServerResponse
Protobuf type com.linkedin.venice.protocols.VeniceServerResponse
 
 
Creates socket connectors.
Spark server implementation
VeniceSparkPartitioner will use the exact same partitioner as the one that will be used inside VeniceWriter (e.g.
Creates instances of embedded jetty containers.
 
 
Thrown when an operation should create a store, but the store already exists
Interface for a cache on a venice store.
 
 
 
 
 
Customized exception for receiving invalid VersionCreationResponse in VenicePushJob
Thrown when an operation should return information about a store, but the store is migrated to another cluster
class that maintains all properties that are not specific to a venice server and cluster.
 
Samza jobs talk to either parent or child controller depending on the aggregate mode config.
VeniceSystemProducer defines the interfaces for Samza jobs to send data to Venice stores.
Enum used to differentiate the different types of Venice system stores when access their metadata.
 
 
Used when timeout happens in Venice.
 
Enums that lists most Venice user store types.
Customized exception for validation error for targeted colo push in VenicePushJob
 
 
VeniceVersionFinder provides methods to find the version for a given store.
This is the abstract implementation of the 'management' component for a venice view.
This class is kept separate from the VeniceView class to not leak certain concepts that currently exclusively reside in the server to other components.
 
 
Mapper that reads Vson input and deserializes it as Avro object and then Avro binary
A record reader that reads records from VSON SequenceFile file input into Avro-serialized keys and values.
Class which acts as the primary writer API.
 
An interface which enables the key to contain parts of the within it, which is useful for control messages and chunked values.
Factory used to create VeniceWriter.
VeniceWriterOptions is used to pass arguments to VeniceWriter constructor.
 
A host level metric to track number of active venice writers.
This class contains constants that represent Venice-managed ZooKeeper paths.
Currently this VerifySslHandler is only used in servers or the isolated ingestion process.
This interface defines all the public APIs, and if you need to add accessors to some new fields, this interface needs to be changed accordingly.
Producer type for writing data to Venice
 
 
Class defines the version of Venice store.
 
 
 
Enums of status of verion.
This controlMessage is written to the real-time topic by the controller or to the store-version topic by the current version's leader server.
 
 
 
 
 
 
 
This class contains some utils methods for VPJ jobs to obtain SSL-related configs.
Deprecated.
This class supports the special byte[] check.
This class supports the special byte[] check.
Deprecated.
 
 
Deprecated.
Deprecated.
Deprecated.
Deprecated.
Deprecated.
Deprecated.
This class was originally from Voldemort.
 
Deprecated.
Wait all strategy stands for Venice do not allow one replica to be failed.
Wait N-1 strategy stands for Venice could tolerate one of replica to be failed for each partition.
This utility class contains all String literal constants used in write compute.
This interface provides methods to execute write-compute operation a value
Write compute V1 handles value records that do not have replication metadata.
Write compute V2 handles value records with replication metadata.
This enum describe the possible write compute operations Venice supports.
This class is able to read write-computed value and apply it to original value.
This result wrapper is for non-Active-Active stores.
This class converts a Avro Schema.Type.RECORD schema to its write compute schema.
validate if a write-compute schema can be pair with value schema
 
This class serves as a latency sensor for write path that contains two types of stats: Avg and Max.
This class is a helper class that contains writer side chunking logics.
 
 
Class used to access to the allowlist stored in ZK.
 
The stats keep track of ZK Client status changes.
This class contains methods to 1)migrate Venice-specific metadata from a source ZooKeeper (ZK) to a destination ZK and 2)extract Venice-specific paths from an input text file containing ZK paths to an output text file.
 
 
The purpose of this class is to abstract Helix operations out of the VeniceHelixAdmin and eventually rename it to VeniceAdmin.
Manage live routers through Zookeeper.
Class defines the store of Venice.
The class is used to access the store configs in Zookeeper.