All Classes and Interfaces
Class
Description
Created by acurtis on 5/1/18.
Read compute and write compute chunking adapter
This abstract class contains all the common field and APIs for compute request builder;
for each new compute request version, there will be a new builder class that extends
this class and the new builder will include customized fields and APIs.
An abstraction for a record reader that reads records from input into Avro-serialized keys and values.
Forked from Netty's AbstractCoalescingBufferQueue (4.1.42)
Includes fix to guard readableBytes from re-entry.
The implementation of
DataWriterComputeJob
for Spark engine.Class for commonalities between
AbstractInputRecordProcessor
, AbstractPartitionWriter
,
KafkaInputFormatCombiner
and ValidateSchemaAndBuildDictMapper
.A copy of the netty Http2StreamFrameToHttpObjectCodec except with the
methods which actually return the converted objects have been made abstract.
An abstraction of the task that processes each record from the input, and returns serialized, and potentially
compressed, Avro key/value pairs.
An abstraction of Helix State model behavior registry that defines how participants
(Storage node) fetch data from Kafka.
An abstraction of the task that processes all key/value pairs, checks for duplicates and emits the final key/value
pairs to Venice's PubSub.
Using Avro Json encoder to print duplicate keys
in case there are tons of duplicate keys, only print first
AbstractPartitionWriter.DuplicateKeyPrinter.MAX_NUM_OF_LOG
of them so that it won't pollute Reducer's log.A base class which encapsulates the common plumbing needed by all
PollStrategy
implementations.AbstractPushMonitor is a high level abstraction that manages
OfflinePushStatus
.This class is used to store common fields shared by various read responses.
This abstract class is the container for response stats.
Streaming callback for batch-get/compute.
A simplified implementation of a
ShutdownableResource
.ModelFactory manages Helix state model creation.
A base storage abstract class which is actually responsible for data persistence.
An abstract implementation of a storage partition.
This is an abstraction of metadata maintained per Store.
Store-level access control handler, which is being used by both Router and Server.
Abstract class capturing the responsibilities of drainers threads doing store ingestion.
This class is an aggregate place that keeps stats objects for multiple stores.
An abstraction to filter given data type.
An abstraction of the mapper that would return serialized, and potentially
compressed, Avro key/value pairs.
A generic implementation of the
VeniceProducer
interfaceAn abstraction for a record reader that reads records from the configured input into Avro-serialized keys and values.
Blueprint for all Services initiated from Venice Server
A base class which users of
VeniceWriter
can leverage in order to
make unit tests easier.Deprecated.
An AccessController allows a request to be checked against an Access Control List (ACL).
Represents an Access Control Entry.
Represents an Access Control Binding.
Update access controller resource list when a store is created/deleted
This class contains logic that SNs must perform if a store-version is running in Active/Active mode.
Counts the number of the active HTTP/2 Streams
This class AdHash is used to keep track of the checksum of the data incrementally.
Running checksum implementations based on Adler32.
The execution object is a kind of context for a admin command including essential information used to track the
status of this command.
This class is used to track the execution of the async admin command.
One consumer service for one cluster.
This class is used to create a task, which will consume the admin messages from the special admin topics.
This class is used to create
Callable
that execute AdminOperation
s for a given store.AdminRequest
encapsulates an admin request from server admin tools.This class stores all the information required for answering a server admin request.
Response record for admin request, version 1
Controller admin API leveraging sparkjava: http://sparkjava.com/documentation.html
This class provides aggregate initialization support for host level ingestion stats class
HostLevelIngestionStats
AggKafkaConsumerService
supports Kafka consumer pool for multiple Kafka clusters from different data centers;
for each Kafka bootstrap server url, AggKafkaConsumerService
will create one KafkaConsumerService
.This class is an aggregate place that keeps stats objects for multiple stores and total stats for each region for
AggKafkaConsumerService.
Monitor the change of Helix's external view and warn in case that any partition is unhealthy.
Right now, Venice SN only reports aggregated metrics for RocksDB.
AggServerHttpRequestStats
is the aggregate statistics for ServerHttpRequestStats
corresponding to
the type of requests defined in RequestType
.AggServerQuotaUsageStats
is the aggregate statistics for ServerQuotaUsageStats
which
measures requests and quota rejections of each store.The store level stats for blob transfer
The store level stats or the total stats will be unpopulated because there is no easy and reliable way to aggregate
gauge stats such as rt topic offset lag.
Interface defines the ways to access to the allowlist of Helix NodeId.
An implementation of
PubSubAdminAdapter
for Apache Kafka.Implementation of
PubSubProducerAdapterFactory
used to create Apache Kafka admin clients.This class is not thread safe because of the internal
KafkaConsumer
is not thread safe.Configuration for Apache Kafka consumer.
Offset position for Apache Kafka topics
A wrapper over Apache Kafka producer which implements
PubSubProducerAdapter
Implementation of
PubSubProducerAdapterFactory
used to create Apache Kafka producers.A Kafka specific callback which wraps generic
PubSubProducerCallback
This class holds all properties used to construct
ApacheKafkaProducerAdapter
(This class could be refactored to hold consumer properties as well).Converts RecordMetadata to
PubSubProduceResult
CompletableFuture implementation, which is able to track the timeout behavior when happening.
A
PollStrategy
implementation which delivers messages in the order specified
at construction time.TODO: Merge this with
ControllerApiConstants
This iterator traverses the entire backing array, while skipping over null entries.
Expose an array of
ArrayCollection
as an immutable Collection<E>
Changes to the array are reflected in the collection.A thread-safe
SortedSet
implementation which uses an array as a backing store.A
Future
which has very netty-style semantics where AsyncFutureListener
can
be added to the future to be executed upon condition of Success or Failure has been satisfied.An interface (lambda) which is fired upon completion of an
AsyncFuture
.Created by acurtis on 1/26/18.
A completable
AsyncFuture
interface.This class is responsible for announcing and de-announcing a
ServiceDiscoveryAnnouncer
to a Service Discovery system
and taking care of failed registrations by retrying them asynchronously.An interface to provide functionality to manage ACL's for a
Resource
and provide access permission Permission
to principals Principal
when performing Method
.Concrete implementations are expected to lock their inner lock(s) in their own constructor.
This class should be used for experiments only.
This comparator is used to compare GenericRecord collection field elements which must have schemas.
This class is used to build a
ComputeRequestWrapper
object according to the specification,
and this class will invoke AbstractAvroStoreClient
to send the 'compute' request to
backend.Deprecated.
Venice avro generic client to provide read compute operations.
Venice avro generic client to communicate with Venice backend for key-value lookup.
AvroGenericStoreClient
implementation for Avro generic type.This enum lays out the basic specs of the various stateful protocols used in Venice.
AvroSerializer
provides the functionality to serialize objects by using Avro.AvroSpecificDeserializer<T extends org.apache.avro.specific.SpecificRecord>
AvroSpecificStoreClient for type safety purpose.
AvroSpecificStoreClient
implementation for Avro SpecificRecord.AvroSpecificStoreDeserializerCache<V extends org.apache.avro.specific.SpecificRecord>
Container for the deserializers of a single store.
Utility intended to convert Avro -> SQL, including DDL and DML statements.
Created by acurtis on 5/4/17.
Enums of the strategies used to backup older store versions in Venice.
This class is used to periodically scan inactive store versions and perform optimization if the inactive period
of some store version meets the pre-configured threshold, this class will trigger the database reopen action to
unload unnecessary RAM usage, which was built up when there were active reads coming.
BackupVersionOptimizationServiceStats
record the statistics for the database optimization done by the
BackupVersionOptimizationService
including both successes and failures.Created by acurtis on 9/28/17.
This class offers very basic metrics for client, and right now, it is directly used by DaVinci.
Created by acurtis on 3/22/17.
Created by acurtis on 3/27/17.
Created by acurtis on 4/19/17.
A combination of
HttpRequestEncoder
and BasicHttpResponseDecoder
which enables easier client side HTTP implementation.A replacement for the Netty HttpContentEncoder which can handle use in client and server pipelines.
Created by acurtis on 3/22/17.
Created by acurtis on 3/24/17.
Created by acurtis on 3/23/17.
Decodes
ByteBuf
s into HttpResponse
s and
HttpContent
s.BasicServerChannelInitializer<C extends io.netty.channel.Channel,FACTORY extends BasicServerChannelInitializer<C,FACTORY>>
This API controls the behavior of the user payload deserialization phase of the
batch get response handling.
This enum controls the behavior of the user payload deserialization phase of the batch get response handling.
Batch get chunking adapter
Keep track of the progress of a batch get request .
Key part of the heartbeat sent by a push job.
Value part of the heartbeat sent by a push job
Read-through cache for items retrievable by two positive integer IDs.
This
BatchDeserializer
does nothing, and always returns an empty list.This class will manage the snapshot creation, for batch store and hybrid store.
A BlobTransferManager is responsible for transferring blobs between two entities, either that Peer to Peer or node to
a blob store and vice versa.
This class is the metadata of a partition in the blob transfer client
POJO to hold the payload for blob transfer related informartion
Class that exposes stats related to blob transfers
This class is the reporting class for stats class
BlobTransferStats
Metrics reporting logics are registered into MetricsRepository
here and send out to external metrics
collection/visualization system.This is an implementation of
DaVinciRecordTransformer
that implements blocking.This
BatchDeserializer
does not do anything asynchronously.Created by acurtis on 10/16/17.
This
PollStrategy
delegates polling to another implementation, and also executes
an arbitrary function during each poll.This interface is meant for users where local state must be built off of the entirety of a venice data set
(i.e.
A map implementation with a bounded size.
This type of specialized
ScatterGatherRequest
does not use keys for routing, and carry some extra state.Enums of the policies used to decide how buffer replay start timestamps are calculated.
A low overhead immutable container of byte[] suitable for use as a map key.
This class is a wrapper of
BinaryDecoder
with the following optimization:
When deserializing byte array field, instead of copying the bytes into a new ByteBuffer
,
this class will create a ByteBuffer, which is wrapping the original array.This class overrides
JsonEncoder.writeBytes(byte[], int, int)
, in order to convert
all bytes schema data to hexadecimal strings in the output stream, since hexadecimal strings
are more readable.This class encapsulates a value from venice storage accompanied by the schema
id that was used to serialize the value.
Utility functions for munging on bytes
N.B.: Most functions taken from Voldemort's ByteUtils class.
CachedDnsResolver
caches the dns entries for hosts, which matches the specified host pattern.Listener used to monitor zk connection state change and refresh venice resource once zk connection is reconnected.
A utility class to chain two completable futures together.
ChangelogClientConfig<T extends org.apache.avro.specific.SpecificRecord>
ChannelInitializer<C extends io.netty.channel.Channel>
A special
ChannelInboundHandler
which offers an easy way to initialize a Channel
once it was
registered to its EventLoop
.The purpose of this class is to execute tasks in series without overlap, even if the
ChannelTaskSerializer.executeTask(Consumer, ChannelFutureListener)
method is called before the completion
if the previous tasks.Properties used by
VeniceChangelogConsumer
to seek
checkpoints.Parent class for various running checksum implementations.
Types of checksum algorithms supported by Venice's Data Ingest Validation.
This class accumulates all mapper values and assemble them to provide assembled complete large values or regular
message.
The
VeniceWriter
, upon detecting an instance of this class being passed to it, will always call
ChunkAwareCallback.setChunkingInfo(byte[], ByteBuffer[], ChunkedValueManifest, ByteBuffer[], ChunkedValueManifest, ChunkedValueManifest, ChunkedValueManifest)
whenever
processing a MessageType.PUT
, whether it is chunked or not.This record is appended to the end of keys in a store-version where chunking is enabled.
This class contains both chunked results and manifest for a specific payload.
This
InputStream
implementation allows us to pass a value made up of many chunks into the Avro decoder
without stitching it up into a single big byte array.This record maintains chunking information in order to re-assemble a value that was split in many chunks.
This record contains chunking information to uniquely identify each chunk.
This interface allows each code path which needs to interact with chunked values
to optimize the way the re-assembly is handled so that the final form in which
the is consumed is generated as efficiently as possible, via the
use of a temporary
ChunkingAdapter
.This class and the rest of this package encapsulate the complexity of assembling chunked values
from the storage engine.
This interface provides methods to split a key into raw key/value byte array and
ChunkedKeySuffix
.Utility class to help in implementing
Measurable.getHeapSize()
.Utilities for Classes / reflection.
ClientConfig<T extends org.apache.avro.specific.SpecificRecord>
Different routing strategy types for client side routing:
1.
A wrapper of
ThreadLocal
for AutoCloseable
objects.Takes care of executing each routine once per cluster.
This class encapsulates business logic which needs to be executed once per cluster the
first time a controller becomes leader of that cluster.
A centralized place to control the locking behavior, such as lock order and lock granularity.
Per-route request metrics.
This class includes the metrics in the cluster-level.
Protobuf type
com.linkedin.venice.protocols.controller.ClusterStoreGrpcInfo
Protobuf type
com.linkedin.venice.protocols.controller.ClusterStoreGrpcInfo
This class handles all operations on a collection field with replication metadata.
This class centralizes the logic of creating a collection replication metadata schema and providing a POJO representation
upon a collection replication metadata generic record.
This class handles merges with replication metadata with collection metadata timestamp
CollectionRmdTimestamp
for collection field.TODO: Merge this with
ControllerRoute
COMMAND (
"long-name",
"short-name",
"description",
"required-args",
"optional-args"
)Common config keys shared by venice-backend and venice-thin-client.
Handler for compacting Espresso headers to a single header entry.
A set that uses complement representation, which is useful when universal set cardinality is unknown.
Compose a CompletableFuture and Callback together to be a
CompletableFutureCallback
type.Created by acurtis on 4/4/17.
One or more variables that together maintain an accumulation.
A
PollStrategy
implementation which takes a queue of many poll strategies.Enums of the strategies used to compress/decompress Record's value
A simple wrapper to throw exception when retrieving failed computation.
An interface to abstract executing and monitoring a compute job running on any batch compute engine
This record only contains the operations and result schema, and keys will be appended after during serialization
Keep track of the progress of a compute request .
This record only contains the operations and result schema, and keys will be appended after during serialization
This record only contains the operations and result schema, and keys will be appended after during serialization
This record only contains the operations and result schema, and keys will be appended after during serialization
This record only contains the operations and result schema, and keys will be appended after during serialization
This class is used by the client to encapsulate the information it needs about a compute request.
This record used to store the response per key
This field will store all the related info for one key
ComputeRouterRequestWrapper
encapsulates a POST request for read-compute from routers.This class provides utilities for float-vector operations, and it also handles
PrimitiveFloatList
transparently to the user of this class.One or more variables that together maintain an accumulation.
A BlockingQueue implementation which uses a collection of concurrent Queues per thread
in order to reduce the hotspot of a highly contended mutex or atomic reference.
Thrown when a config property is invalid or missing
A command to migrate all incremental push stores in a cluster to a specific incremental push policy.
A
ChannelHandler
which regulates the number of open connections by disabling
the parent socket auto-read config when the number of active connections
exceeds the limit value.An abstraction of a receiver of data consumed from a message queue.
The class is used to asynchronously trigger behavior changes in the
StoreIngestionTask
.An Enum enumerating all valid types of
ConsumerAction
.Encapsulates the logic for deciding if a given topic-partition should be unsubscribed and, in that case, executing
the unsubscription.
Type describes all the version attributes
A POJO to store the response content and additional compression related metadata.
Protobuf enum
com.linkedin.venice.protocols.controller.ControllerGrpcErrorType
Dependencies for VeniceControllerRequestHandler
Extend this class to create response objects for the controller
Any fields that must be in all responses can go here.
ControlMessage payloads contain metadata about the stream of data, for validation and debuggability purposes.
A simple enum to map the values of
ControlMessage.controlMessageType
N.B.: We maintain this enum manually because Avro's auto-generated enums do
not support evolution (i.e.: adding values) properly.This exception is thrown when we detect corrupt data in the Kafka stream.
An interface to retrieve metrics related to data writer tasks after a MapReduce job has completed.
CRC32 algorithm as used by Deflate.
A pure-java implementation of the CRC32 checksum that uses
the CRC32-C polynomial, the same polynomial used by iSCSI
and implemented on many Intel chipsets supporting SSE4.2.
Running checksum implementations based on CRC32.
Protobuf type
com.linkedin.venice.protocols.controller.CreateStoreGrpcRequest
Protobuf type
com.linkedin.venice.protocols.controller.CreateStoreGrpcRequest
Protobuf type
com.linkedin.venice.protocols.controller.CreateStoreGrpcResponse
Protobuf type
com.linkedin.venice.protocols.controller.CreateStoreGrpcResponse
This class will add a new version to the given store.
This type of store metadata describes the current states of the store in a given fabric and Venice cluster.
This type of store metadata describes the current version states of the store in a given fabric and Venice cluster.
Runs a daemon thread that consistently sends HTTP GET request to
admin healthcheck endpoint.
This class provides a server manager with start/shutdown methods to be invoked to manage the lifecycle of the server
announcer.
This class is used to find the proper d2 service name for the given store through default D2 service
d2://VeniceRouter.
Deprecated.
D2Client
based TransportClient implementation.A thread factory that sets the threads to run as daemons.
DataModelBackedStructure<T extends org.apache.avro.specific.SpecificRecord>
This class gathers all common data provider patterns in test cases.
DataRecoveryClient is the central class to manage data recovery feature.
DataRecoveryExecutor is the engine to run tasks in data recovery.
DataRecoveryExecutor is the engine to run tasks in data recovery.
This class contains all the logics to:
1.
A Task is wrapper class that designed to execute multiple commands in sequence.
This interface defines all the public APIs, and if you need to add accessors to
some new fields, this interface needs to be changed accordingly.
The data replication policy determines which fabric to use for sending real-time writes from clients and
where to direct servers for real-time data consumption.
This class encompasses all error conditions which are related to the quality of
the data consumed from Kafka.
All the
AccumulatorV2
objects that are used in the Spark DataWriter jobs.An abstraction for executing and monitoring a data writer compute job running on any batch compute engine
The implementation of
DataWriterComputeJob
for MapReduce engine.The default implementation of
AbstractDataWriterSparkJob
for Avro and Vson file input formats.An interface to report and retrieve metrics related to data writer tasks.
DvcBlobFinder discovers live DaVinci peer nodes to facilitate blob transfers necessary for bootstrapping the database
Da Vinci Client to provide key-value lookups in embedded mode
This is a scheduler for sending batching push status in DaVinci.
This abstract class can be extended in order to transform records stored in the Da Vinci Client,
or write to a custom storage of your choice.
Configuration class for
DaVinciRecordTransformer
, which is passed into DaVinciConfig
.This describes the implementation for the functional interface of
DaVinciRecordTransformer
This class encapsulates the result of
DaVinciRecordTransformer.transform(com.linkedin.venice.utils.lazy.Lazy<K>, com.linkedin.venice.utils.lazy.Lazy<V>)
Utility class for
DaVinciRecordTransformer
.This class is used to systematically copy
OffsetRecord
instances rather than
passing them as is.This class is used to systematically copy
OffsetRecord
instances rather than
passing them as is.An implementation of
AsyncFuture
where much of the implementation is based upon
netty's org.jboss.netty.channel.DefaultChannelFuture
.A specialised instance of @{link AsyncFuture} for collecting the results of one or more
futures of lists into a single list.
The default ingestion backend implementation.
Default Kafka VeniceKafkaSerializer
Cloned from
com.linkedin.security.ssl.access.control.SSLEngineComponentFactoryImpl
;
Changes:
1.Default implementation of the
VenicePartitioner
class.Delegating layer for
DaVinciClient
.Inside Fast-Client, we choose to use n-tier architecture style to build a pipeline to separate different
types of logic in different layer.
Delete payloads contain fields related to replication metadata of the record.
This is a simple container class to hold replication metadata related fields together to be passed on to the Delete api in VeniceWriter
VeniceWriter#delete(Object, Callback, LeaderMetadataWrapper, long, Optional)
.Derived schemas are designed for write compute operations.
This class is registered in ZKClient to handle derive schema serialization.
This interface provides method to write map entries in a deterministic order.
An Avro generic datum writer that sorts the map entries by keys before
serializing the map.
An Avro specific datum writer that sorts the map entries by keys before
serializing the map.
DictionaryFetchRequest
encapsulates a GET request to storage/storename/version on the storage node to
fetch the compression dictionary for that version.DictionaryRetrievalService runs in a producer-consumer pattern.
The callback method to disable leader replicas whenever they are in ERROR state so that helix
can elect a new leader.
Protobuf type
com.linkedin.venice.protocols.controller.DiscoverClusterGrpcRequest
Protobuf type
com.linkedin.venice.protocols.controller.DiscoverClusterGrpcRequest
Protobuf type
com.linkedin.venice.protocols.controller.DiscoverClusterGrpcResponse
Protobuf type
com.linkedin.venice.protocols.controller.DiscoverClusterGrpcResponse
DiskHealthCheckService will wake up every 10 seconds by default and run a health check
in the disk by writing 64KB random data, read them back and verify the content; if there
is any error within the process, an in-memory state variable "diskHealthy" will be updated
to false; otherwise, "diskHealthy" will be kept as true.
DiskHealthStats
measures the disk health conditions based on the periodic tests ran by the DiskHealthCheckService
.We define a threshold where we consider the disk to be full.
This class is in charge of routing and serialization/de-serialization.
Deprecated.
This class contains stats for DIV.
This class is the reporting class for stats class
DIVStats
.Linked list implementation of the List interface.
In order to maintain a doubly-linked list, each element needs to establish links to its adjacent elements.
The following class has capability to send dual read requests via Fast-client and Thin-client.
The design consideration to consume in admin tool directly instead of letting controller to
consume the required admin message and return them back to admin tool:
1.
This is a benign
DataValidationException
.The source and destination topic for topic switching are the same topic
A
PollStrategy
implementation which can introduce duplicates.This class is used as the return type of the
CompletableFuture
that is returned
by the Venice producer.A DynamicAccessController is an AccessController with a mutable resource list.
A POJO containing an index and a timestamp for a collection element.
This ControlMessage is sent per partition by each offline incremental push job, once per partition, at the end of a incremental push
This ControlMessage is sent once per partition, at the end of a bulk load, after all of the data producers come online.
This ControlMessage is sent at least once per partition per producer.
An interface to get information about a running job and task on any batch computing engine.
Expects a GetRequestObject which has store name, key, and partition
Queries the local store for the associated value
writes the value (as a byte[]) back down the stack
A task that iterates over store version resources and reset error partitions if they meet the following criteria:
The store version resource is the current version.
The error partition only has exactly one error replica.
For L/F model, using EV to check error replicas misses internal error replica states.
ExceptionType
enum did not have a default deserialization annotation and this makes it non-evolvable till all
clients upgrade to newer versions.An extension of the
Http2FrameCodec
which provides a facility to
be able to block stream creation from clients without incurring any cost
of stream creation on the server side.Utility class to expose package private members.
Forked from Netty's Http2MultiplexHandler (4.1.42)
(https://github.com/netty/netty/blob/4.1/codec-http2/src/main/java/io/netty/handler/codec/http2/Http2MultiplexHandler.java)
In HTTP/1.1, the requests from the client are distributed across 48 workers threads of router.
Forked from Netty 4
Http2StreamChannelBootstrap
to allow using EspressoHttp2MultiplexHandler
in the pipeline.This interface defines all the public APIs, and if you need to add accessors to
some new fields, this interface needs to be changed accordingly.
A container of ETL Enabled Store related configurations.
In ETL jobs, when a "delete" record is seen, "value" is set to "null" and "DELETED_TS" is set to the offset where
this delete record was seen.
A class to throttle Events to a certain rate
This class takes a maximum rate in events/sec and a minimum interval over
which to check the rate.
This interface is used to abstract the strategy to handle the quota exceeding case.
Deprecated.
since v0.2.620
Previously, this enum didn't have the @JsonEnumDefaultValue annotation.
Static utility functions to deal with exceptions.
Accessor user to get/update the last execution id and the generated id from a storage.
Status of executing off-line push.
Factory and utility methods for
Executor
, ExecutorService
,
ScheduledExecutorService
, ThreadFactory
, and Callable
classes defined in this package.An analog to
ExecutorService
except that the futures
returned are AsyncFuture
s.Created by acurtis on 6/12/17.
This record is only used to training a ZSTD dictionary on a small amount of generated avro data
* This class is leveraging linkedin-avro-fastserde-impl to speed up deserialization.
This class is leveraging linkedin-avro-fastserde-impl to speed up serialization,
and so far it doesn't work for Avro-1.4, and it will fail back to regular Avro.
FastAvroSpecificDeserializer<T extends org.apache.avro.specific.SpecificRecord>
This class is leveraging linkedin-avro-fastserde-impl to speed up deserialization.
Only for the sake of accessing a package-private method...
This class encompasses all error conditions which:
1.
The FilterChain class takes a list of
AbstractVeniceFilter
to assemble a filter chain to manage the life cycles'
of filters and perform filtering based on the order of filters.A
PollStrategy
implementation which can pluck out records from the stream.Forked from com.linkedin.databus.core.util @ r293057
If a remote peer is unresponsive due to CPU throttling, GC or otherwise, ForceCloseOnWriteTimeoutHandler closes
the connection on write timeout, the time out by default is 30 sec.
Adapted from https://stackoverflow.com/a/723914/791758
Created by acurtis on 3/22/17.
Created by acurtis on 3/27/17.
An implementation of SslHandler which limits the number of objects held within the
pendingUnencryptedWrites queue.
Created by acurtis on 2/1/17.
This is an abstract class which represents a generic class associating a generated schema id with it's
corresponding value schema id
In Venice, some schemas are generated, by deriving them from value schemas.
GenericChunkingAdapter<V extends org.apache.avro.generic.GenericRecord>
Read compute and write compute chunking adapter
Just for the sake of casting the generic type to
GenericRecord
...GetRouterRequest
encapsulates a GET request to storage/resourcename/partition/key on the storage node for a single-get operation.We need to keep track of each request as it goes through the pipeline so that we can record the necessary metrics
and separate different parts of the logic for the response.
This class is an incomplete copypasta of the logic in
StorageReadRequestHandler.channelRead(ChannelHandlerContext, Object)
.A wrapper around Guava's RateLimiter to provide a common interface for rate limiting.
Utility class for generating GUIDs.
Hadoop-specific utils.
Forked from com.linkedin.databus.core.util @ r293057
HDFSSchemaSource caches the Value schemes and RMD schemas for a given store on the HDFS and serve them during the Mapper
stage.
HealthCheckRequest
encapsulates information of a health-check request from routers.A Guid generator which uses the
UUID.nameUUIDFromBytes(byte[])
.This service monitors heartbeats.
Serializer to adapt venice serializer to PathBasedZkSerializer.
Interface and wrapper for Helix related admin operations needed by Venice when running Helix as a service.
Get routing data from Helix and convert it to our Venice partition and replica objects.
Extend
HelixBaseRoutingRepository
to leverage customized view data for offline push.An implementation of
AbstractPushMonitor
that listens to Helix
Customized View to determine the push statusExtend HelixBaseRoutingRepository to leverage external view data.
This strategy is tried to distribute more load to Helix Groups with more capacity.
This simple strategy will try to distribute the load evenly to every Helix Group.
This class is a thin wrapper of
HelixInstanceConfigRepository
and HelixGroupSelectionStrategy
, and it
will delegate all the related API calls to the corresponding objects.Extend RoutingTableChangeListener to leverage customized view data for hybrid store quota.
This repository is used to store the instance config per instance.
Convert between ZNRecord in Helix instanceConfig and Venice instance.
This class is used to monitor the current live instances(Helix participators).
Venice Participation Service wrapping Helix Participant.
An Enum enumerating all valid types of
HelixPartitionState
.A parent class to access Helix customized partition state, which is different from the states
defined in the state model.
A class for accessing partition offline push and hybrid quota status in Helix customized state (per Helix
instance) on Zookeeper
This class is used to cache live cluster configs.
This class is used to cache store schema and provide various query operations.
This repository provides the read-only access to both system stores and regular stores.
This class is used to fetch all store configs from ZK and cached locally, then provide the way to read those
configs.
This repository provides an read only interface to access both system store and regular venice store.
This repository is used to retrieve all the zk shared system store schemas from the system store cluster.
This repository provides an interface to access zk shared system stores only.
This class is used to modify live cluster configs.
This class is used to add schema entries for stores.
This repository supports the following operations:
1.
Read write store repository which uses Helix as storage.
This repository provides an read-write interface to access both system store and regular venice store.
This strategy will assign an ordering of helix groups via round-robin for even distribution.
States of Venice node in Helix.
The control message changed built on Helix message service.
Helper functions for Helix.
Aggregate all essentials resources required by controller to manage a Venice cluster.
Provides methods to locate the storage node or nodes for database partitions.
This is an interface for monitoring the healthiness of storage nodes.
This class is used to monitor the various reasons for unhealthy hosts.
This class contains stats for stats on the storage node host level.
An implementation of
BasicFullHttpRequest
which always uses a Http1Headers
for
holding the headers.An implementation of
BasicFullHttpResponse
which always uses a Http1Headers
for
holding the headers.An implementation of
BasicHttpRequest
which always uses a Http1Headers
for
holding the headers.An implementation of
BasicHttpResponse
which always uses a Http1Headers
for
holding the headers.This class is ExceptionHandler for all HTTP/2.
Initializes HTTP/2 Pipeline.
Logs the HTTP/2 Settings Frames.
Client-facing exception for HTTP/2 too many active streams exception with a clearer message for the clients.
HTTP/2 utility methods.
Created by acurtis on 3/23/17.
This utility class could construct a HttpClient5 based R2 Client.
This class is used to provide a facility to initialize httpclient5 lib based Http/2 Client.
Created by acurtis on 3/22/17.
This _only_ knows how to aggregate HttpMultiPart messages and "bad things"(tm) will occur when it encounters other
types of messages.
Created by acurtis on 4/26/18.
Maps the provided HTTP response status
HttpResponseStatus
to one of
1xx, 2xx, 3xx, 4xx, 5xx categories.An abstract handler which may be used to examine latencies in the Netty4 HTTP request processing.
Created by mwise on 3/11/16.
Created by acurtis on 3/23/17.
CloseableHttpAsyncClient
based TransportClient implementation.This will be used to stash reoccurring common bits of code.
Created by acurtis on 9/28/17.
This interface defines all the public APIs, and if you need to add accessors to
some new fields, this interface needs to be changed accordingly.
A container of Hybrid Store related configurations.
Hybrid store quota status response for a resource; this is a response
that will be returned by Router.
An interface for implementation of IC( invocation-context) provider class for calls between various deployable services.
An interface that provides a method to parse identity information from a certificate.
This class is used as an identity function to return the same bytes that were passed in the input to the output.
A record reader that returns the input key and value as is.
This exception is thrown when we detect a new producer which did not start with a
ControlMessageType.START_OF_SEGMENT
.In order to dump the HTTP headers when there is a decode failure, surround the HTTP decode handler with these
handles...example:
This exception is thrown when the server reads data from same segment after the segment is ended.
This is a fork of the standard
HashMap
which implements
IndexedMap
.The IndexedMap interface implements Map and several functions for dealing with its
content via the index corresponding to their insertion order.
The IndexedMap interface implements Map and several functions for dealing with its
content via the index corresponding to their insertion order.
This class is used to process the AA/WC messages in parallel to address the long-tail partition lagging issues.
IngestionCommandType is an Enum class for specifying different commands for ingestion isolation.
IngestionComponentType is an Enum class for specifying different components for graceful shutdown in forked ingestion process.
IngestionMode is an Enum class that contains all modes for ingestion.
IngestionReportType is an Enum class for specifying different ingestion reports for ingestion isolation.
This class contains stats for store ingestion.
This class is the reporting class for stats class
IngestionStats
.This throttler has the following functionality:
1.
In memory implementation of
AdminTopicMetadataAccessor
, should only be used for testing purposes.In memory implementation of
ExecutionIdAccessor
, should only be used for testing purposes.Used in unit tests in order to avoid spinning a full Kafka broker with network stack
and disk IO.
A single Kafka message, strongly typed for the types that Venice uses.
In memory implementation of OffsetManager, should really only be used for tests
A simple non-persistent, in-memory store.
In memory implementation of StorageMetadataService, should really only be used for tests
An in-memory hashmap implementation of a storage partition
Assumptions:
1.No need to worry about synchronizing write/deletes as the model is based on a single writer.
This interface lets users get input data information
A POJO that contains input data information (schema information and input data file size)
This class is used to keep track of store storage quota and storage overhead ratio and check whether the total
input data size exceeds the quota
Class defines the Instance in Venice.
The class is used to measure the healthiness about the cluster the store belongs to.
This utility class provides functions to measure the heap size of objects for a limited number of classes.
Status of the instance in Venice cluster.
This class is not Thread-safe.
InternalAvroSpecificSerializer<SPECIFIC_RECORD extends org.apache.avro.specific.SpecificRecord>
Serializer for translating a versioned protocol of Avro records.
This class includes some necessary functions to deal with certain metric-handling activities that only
the client implementation can be aware of.
All the internal implementations of different tiers should extend this class.
Abstract class for Fast-Client transport layer implementation, and this layer is mostly decided which methods
need to be implemented at different stages.
Implementation of the
VenicePartitioner
class for test purpose.This class is the implementation of ingestion backend designed for ingestion isolation.
IsolatedIngestionProcessStats is a metrics collecting class that aims to collect metrics from isolated ingestion process.
This class sends requests to monitor service in main process and retrieves responses.
This class is the server service of the isolated ingestion.
This is the handler class for
IsolatedIngestionServer
.This class contains methods used for communication between ingestion client and server.
Utility methods for
Iterator
s.A Guid generator which internally uses
UUID.randomUUID()
.Utility methods for JDBC.
Hash algorithm by Bob Jenkins, 1996.
Response for querying job status.
This class is used to throttle records consumed per Kafka cluster
Custom Exception for Venice Kafka Consumer Related Issues.
KafkaConsumerService
is used to manage a pool of consumption-related resources connected to a specific Kafka
cluster.This consumer assignment strategy specify how consumers from consumer pool are allocated.
This delegator impl is used to distribute different partition requests into different consumer service.
This class provides the stats for Kafka consumer service per region or per store.
This class is a library that can validate the Kafka message during consumption, which can be used in
Venice-Server/Da-Vinci and ETL.
Zstd dict trainer for Kafka Repush.
We borrowed some idea from the open-sourced attic-crunch lib:
https://github.com/apache/attic-crunch/blob/master/crunch-kafka/src/main/java/org/apache/crunch/kafka/record/KafkaInputFormat.java
This
InputFormat
implementation is used to read data off a Kafka topic.This class is a Combiner, which is a functionality of the MR framework where we can plug a
Reducer
implementation to be executed within the Mapper
task, on its output.This class is used to support secondary sorting for KafkaInput Repush.
The protocol between Mapper and Reducer with KafkaInputFormat for key field
The data protocol between Mapper and Reducer for KafkaInputFormat
This class is used for KafkaInput Repush, and it only considers the key part of the composed key (ignoring the offset).
We borrowed some idea from the open-sourced attic-crunch lib:
https://github.com/apache/attic-crunch/blob/master/crunch-kafka/src/main/java/org/apache/crunch/kafka/record/KafkaRecordReader.java
This class is used to read data off a Kafka topic partition.
We borrowed some idea from the open-sourced attic-crunch lib:
https://github.com/apache/attic-crunch/blob/master/crunch-kafka/src/main/java/org/apache/crunch/kafka/record/KafkaInputSplit.java
InputSplit that represent retrieving data from a single
TopicPartition
between the specified start
and end offsets.This class together with
KafkaInputKeyComparator
supports secondary sorting of KafkaInput Repush.Class which stores the components of a Kafka Key, and is the format specified in the
KafkaKeySerializer
.VeniceKafkaSerializer
to encode/decode KafkaKey
for Venice customized kafka message
Used by Kafka to convert to/from byte arrays.Assumes: One to One mapping between a Venice Store and Kafka Topic.
This class contains logic to dump Venice Kafka topics.
Serializer for the Avro-based kafka protocol defined in:
KafkaMessageEnvelope
This class needs to be defined explicitly, rather than just instantiating a
InternalAvroSpecificSerializer
because it is used as a config passed to
the Kafka producer.A helper class to return the same lock for the same raw key bytes.
Deprecated.
, use
Gauge
instead.A dummy implementation which simply allocates a new object on every
ObjectPool.get()
and lets the garbage collector
dispose of it rather than reusing.Created by acurtis on 3/29/17.
This utility provides lazy initialization for a wrapped object.
Fork of the
LazyInitializer
with some additional APIs.Completion state of the leader partition.
Protobuf type
com.linkedin.venice.protocols.controller.LeaderControllerGrpcRequest
Protobuf type
com.linkedin.venice.protocols.controller.LeaderControllerGrpcRequest
Protobuf type
com.linkedin.venice.protocols.controller.LeaderControllerGrpcResponse
Protobuf type
com.linkedin.venice.protocols.controller.LeaderControllerGrpcResponse
This class used to override
VeniceNotifier.catchUpVersionTopicOffsetLag(String, int)
to
release the latch when the resource is the current version.Venice partition's state model to manage Leader/Follower(Standby) state transitions.
A leader session id checker will be created for each consumer action;
server checks whether the session id is still valid before processing
the consumer action.
State Transition Handler factory for handling Leader/Follower resources in the storage node.
The leader follower state thread pool strategy specifies how thread pools are allocated for Helix state transition.
This class contains the state transition work between leader and follower; both leader and follower
will keep track of information like which topic leader is consuming from and the corresponding offset
as well as the latest successfully consumed or produced offset in the version topic (VT).
This is a simple container class to hold Leader metadata related fields together to be passed on to the Put and Delete api in VeniceWriter
Caller should construct an instance of this object by properly filling up all the fields of this object.
This class holds all the necessary context information which is passed from
kafka consumer thread -> kafka producer callback thread -> drainer thread.
General purpose object reference leak detector with recovery.
LeakedPushStatusCleanUpService will wake up regularly (interval is determined by controller config
ConfigKeys.LEAKED_PUSH_STATUS_CLEAN_UP_SERVICE_SLEEP_INTERVAL_MS
), get all existing push
status ZNodes on Zookeeper that belong to the specified cluster, without scanning through the replica statuses, find
all leaked push status and delete them on Zookeeper.LeakedResourceCleaner is a background thread which wakes up regularly
to release leaked resources from local disk.
LeakedResourceCleanerStats
records the occurrences of store resources get removed by LeakedResourceCleaner
.This strategy is trying to route the request to the least-loaded instances.
This class provides the unified way to manager venice listener.
Service that listens on configured port to accept incoming GET requests
This is a wrapper class on top of InternalBootstrappingVeniceChangelogConsumerImpl.
LocalThreadEventLoopGroup<E extends io.netty.channel.MultithreadEventLoopGroup>
A specialised
EventLoopGroup
which the MultithreadEventLoopGroup.next()
method prefers to return
an EventLoop
for the current thread, if the current thread is a thread of the wrapped
MultithreadEventLoopGroup.Provides a configured
VeniceChangelogConsumer
instance.Logs the Notification received.
This metric class is to optimize for high write throughput, low read throughput measurement use case instead of real-time
measurement.
This class is a stats generator for dds router framework to record scattering/gathering
related metrics.
This class is the listener service in main process which handles various kinds of reports sent from
isolated ingestion service.
This class is the handler class for
MainIngestionMonitorService
.This class sends requests to isolated ingestion process and retrieves responses.
MainIngestionStorageMetadataService is an in-memory storage metadata service for
IsolatedIngestionBackend
.This class manages the ingestion status for each partition for a given topic when isolated ingestion is enabled.
MapOrderPreservingDatumReader
converts map type serialized items into instances with
a consistent ordering of entries.MapOrderPreservingDeserializer
is a Avro deserializer using MapOrderPreservingDatumReader
.Factory to create fast serializer/deserializer of supporting map ordering.
MapOrderPreservingGenericDatumWriter
converts map type instances to in-memory serialized format
* with a consistent ordering of map entries.This class is a factory that creates
MapOrderPreservingSerializer
and MapOrderPreservingDeserializer
with given schemas and cache them.MapOrderPreservingSerializer
is AvroSerializer
that maintains a consistent ordering of map entries by
using MapOrderPreservingGenericDatumWriter
and MapOrderPreservingSpecificDatumWriter
.MapOrderPreservingSpecificDatumWriter
converts map type instances to in-memory serialized format
with a consistent ordering of map entries.An implementation of
EngineTaskConfigProvider
to get information about a running job on the MapReduce engine.Running checksum implementation based on BouncyCastle's implementation of MD5.
A subclass of
LinkedHashMap
which does a best-effort attempt at guessing its size on heap + the size of its
values.This class is a generic implementation of a memory bound blocking queue.
API for merging existing data with incoming writes.
This class handles byte-level merge.
The workflow is
Query old replication metadata.
An object to encapsulate the results of conflict resolution to denote how the operation and value that should be
applied or if the current update should be ignored.
This wrapper is used to keep the context after handling Active/Active messages.
Implementations of the API defined in
Merge
based on V1 metadata timestamp Schema generated by
RmdSchemaGeneratorV1
.The purpose of this interface is to extract common merge logic.
A simple enum to map the values of
KafkaMessageEnvelope.messageType
N.B.: We maintain this enum manually because Avro's auto-generated enums do
not support evolution (i.e.: adding values) properly.MetadataAggregator is a custom HttpObjectAggregator that
only aggregated HttpResponse messages for metadata.
MetadataFetchRequest
encapsulates a GET request to /metadata/storename on the storage node to fetch metadata
for that node.This MetaDataHandle is used to handle the following meta data requests:
1.
This class stores all the information required for answering a server metadata fetch request.
This record will store version properties, key & value schemas, and routing information
MetadataUpdateStats records metrics related to storage metadata update via
MainIngestionStorageMetadataService
All the data types supported in meta system store.
This class serves as the shadow read handler of the meta system store.
This class is to write metadata: store properties/key schema/value schemas/replica statuses to meta system store.
Collection of all available methods that a
Principal
can perform.Metric entity class to define a metric with all its properties
Operational state of a metric.
create the metrics/Sensors
Utility functions to help create common metrics repository.
Utility functions to help test metrics
Metric type enum to define the type of metrics Venice supports via OpenTelemetry
Metric Unit enum to define list of Units supported for metrics
This exception is thrown when we detect missing data in the Kafka stream.
A mock time that takes a list of synthetic timestamps in MS and return them one by one in a circular manner.
A
PubSubConsumerAdapter
implementation which reads messages from the InMemoryKafkaBroker
.A
PubSubProducerAdapter
implementation which interacts with the
InMemoryKafkaBroker
in order to make unit tests more lightweight.A mock
AuthorizerService
implementation to help in unit test.All MapReducer job counters (identified by a group name + a counter name) are defined here and this util class
provides method to increment counters and read counters
Bundle counter group name and counter name in this POJO
Message holder object for use with lazy logging.
This field will store all the related info for one record
This field will store all the related info for one key
MultiGetRouterRequestWrapper
encapsulates a POST request to storage/resourcename on the storage node for a multi-get operation.Keep track of the progress of a multi-key requests - batch get and compute.
represents a key requested on a route and tracks info related to it but specific to the route
MultiKeyRouterRequestWrapper
is a base class for a multi-key operation.Created by mwise on 5/3/16.
ThreadFactory which creates named threads for use in an ExecutorService.
A thread factory that sets the threads to create threads with a specific prefix name.
A repository of
StoreName
and StoreVersionName
, which are intended to be used as shared instances.Venice in-house implementation of a read only metadata repository where callers can retrieve various metadata such as
Store objects and their corresponding schemas.
This class was copied from
ChunkedWriteHandler
, and the only change we made here is to
using ConcurrentLinkedQueue
for NettyChunkedWriteHandler.queue
since NettyChunkedWriteHandler.resumeTransfer()
or NettyChunkedWriteHandler.discard(Throwable)
could happen in any sequence with any parallelism because of scatter/gather logic in Venice Router.Netty based P2P blob transfer manager implementation.
Created by acurtis on 3/30/17.
Response to represent the status of a node
Locking is not necessary for
NoopCompressor
, so this class overrides all the public APIs to avoid locking.No Op validator.
Warns that a class is not threadsafe.
This iterator traverses the entire backing iterator, while skipping over null entries.
This class is in charge of managing access and maintenance of a VeniceStoreCache.
An interface to get and give back objects that are intended to be long-lived and recycled, but where the
location in the code for beginning to use the object is far from the location where we stop using it, e.g.
Accessor to execute the CURD operations for offline push and its replicas statuses.
Class stores all the statuses and history of one offline push.
Serializer used to convert the data between
OfflinePushStatus
and json.Enum of strategies used to decide the when the data is ready to serve in off-line push.
This class records the offset for every pair(topic,partition) this node is responsible for.
If OffsetRecord is initialized with a serializer that contains SchemaReader, old version of server codes
will be able to deserialize OffsetRecord that is serialized with a newer protocol version, which can happen
after rolling back a server release with new protocol version to an old server release with old protocol version.
Look up online instances related to a topic's partition.
An implementation of a VeniceProducer suitable for online applications.
This class is used to optimize bytes field decoding.
This class is to reuse the original array for put payload of each message
to reduce the unnecessary byte array allocation.
wraps raw bytes into an HTTP response object that HttpServerCodec expects
Peer-to-Peer (P2P) Blob Transfer Manager.
The client-side Netty handler to process responses for P2P file transfer.
The server-side Netty handler to process requests for P2P file transfer.
The client-side Netty handler to process responses for P2P metadata transfer.
A simple container for a pair of (2) values.
Deprecated.
Instead of this, please create a dedicated class with well-named, non-generic (potentially primitive) properties and getters.
This class is a utility class for Parent Controller store update logics.
Enum representing the state of the region where the parent controller resides.
Interceptor to verify that the parent controller is active before processing requests within its region.
This class is used to track the thread pool stats for the state transitions of the participant.
A helper class that wraps the readers and writers for the participant stores.
Class defines the partition in Venice.
This class represent the assignments of one resource including all of assigned partitions and expected number of
partitions.
This class is used to maintain internal state for consumption of each partition.
This immutable class holds a association between a key and value and the source offset of the consumed message.
PartitionDispatchHandler<H,P extends ResourcePath<K>,K,HTTP_REQUEST extends BasicRequest,HTTP_RESPONSE,HTTP_RESPONSE_STATUS>
This interface defines all the public APIs, and if you need to add accessors to
some new fields, this interface needs to be changed accordingly.
Provides methods to find the partition name for a piece of data.
Resource level partition health stats.
This class is for wrapping the information about the role of the partition replica on that host to
AggKafkaConsumerService
to achieve finer granularity of consumer assignment.Sort the rows based on the key and value in ascending order using unsigned byte comparison.
This record holds the state necessary for a consumer to checkpoint its progress when consuming a Venice partition.
Class stores the status of one partition including all the replicas statuses in this partition.
An implementation of
AbstractPushMonitor
that listens to ZK PartitionStatus
to determine the push status.Serializer used to convert the data between
PartitionStatus
and json.Listener used to listen the data change of partition status.
This class maintains state about all the upstream producers for a given partition.
PartitionWiseKafkaConsumerService
is used to allocate share consumer from consumer pool at partition granularity.The implementation of this method is mostly similar to DefaultChannelFuture except that
we store the Channel inside a CompletableFuture because it is not yet known at the time
of construction.
The implementation of this method is mostly similar to DefaultChannelFuture except that
we store the Channel inside a CompletableFuture because it is not yet known at the time
of construction.
This class maintains a simple counter for capacity management.
Thrown by the StorageEngine or Storage Partitions if storage fails
Enums of persistence types in Venice.
A class to represent a Persona, a relationship between stores and a quota system that is enforced.
This interface is used inside of the
MockInMemoryConsumer
in order to mess around with the way messages are delivered to the consuming code.This interface exposes the commonalities between the Apache httpasyncclient and Netty and R2 client.
Utility methods for precondition checks.
Functor interface for performing a predicate test on
GenericRecord
Represents an actor entry.
Utility methods for Processes.
A record containing the state pertaining to the data sent by one upstream producer into one partition.
PropertyBuilder is the suggested Way to construct VeniceProperties.
An adapter for PubSubAdmin to create/delete/list/update topics.
Generic admin factory interface.
A wrapper around pub-sub producer, consumer, and admin adapter factories
This will be passed as one of the arguments to the component which depends on the pub-sub APIs.
Constants used by pub-sub components.
The purpose of this class is only to make the package private test-only functions of
PubSubConstants
available to all packages within tests.An adapter for consuming messages from a Pub-Sub topic.
Generic consumer factory interface.
Uses
AvroCoder
to encode/decode PubSubMessage
s.The class for deserializing messages from the pubsub specific message format to
PubSubMessage
A key-value pair that is associated with a message
Set of key-value pairs to tagged with messages produced to a topic.
Used when an operation against PubSub failed to complete in time.
Represents a position of a message in a partition of a topic.
Factory class for creating PubSubPosition objects from wire format
Constants for the different types of positions
Do not change the values of these constants.
The pub-sub producer interface with which venice writer's interact to send messages to pub-sub topic.
This class will spin up multiple producer instances to speed up the producing.
Generic producer factory interface.
A callback interface that users of PubSubProducerAdapter should implement if they want
to execute some code once PubSubProducerAdapter#sendMessage request is completed.
A simple implementation of PubSubProducerCallback interface for testing purposes.
An interface implemented by specific PubSubProducerAdapters to return the result of a produce action.
This enum is equivalent to Kafka's SecurityProtocol enum.
Class for all Venice exceptions that are triggered by Kafka topic authorization related issues.
Represents a
PubSubTopic
configuration.The source or destination topic for the replication request does not exit
This class is a simple schema builder to generate Venice Push Job input file schema.
Different successful checkpoints and known error scenarios of the VPJ flow.
0 => STARTED, 1 => COMPLETED, 2 => ERROR, 3 => NOT_CREATED, 4 => UNKNOWN, 5 => TOPIC_CREATED, 6 => WRITE_TO_KAFKA_COMPLETED, 7 => KILLED, 8 => END_OF_PUSH_RECEIVED, 9 => START_OF_INCREMENTAL_PUSH_RECEIVED, 10 => END_OF_INCREMENTAL_PUSH_RECEIVED
This interface provides methods to send push job heartbeats
This interface provides methods to let users create a heartbeat provider
This class carries the state for the duration of the VenicePushJob.
Response for uploading a job status record.
In Venice, Push is a task that asynchronously writes data to Venice.
This class maintains the mapping of Kafka topic to each
PushMonitor
instance and delegates calls to the
correct instance.This enum controls the behavior of how
PushMonitorDelegator
chooses
proper PushMonitor for resources.This class contains some common util methods for push monitoring purpose.
This class serves as a collector of offline push status for both Venice Server and Da Vinci clients.
Decide the offline push status by checking all replicas statuses under different offline push strategies.
Notifies both OfflinePushStatus and Helix Customized View
Push job status response for a resource; this is a response
that will be returned by Router.
This class is a helper class for Venice controller to read PushStatus / Heartbeat messages.
Helper class for PushStatusStore related logic.
Wrapper class for acquiring VeniceWriter for push status stores.
PushStatusStoreWriter is a helper class for Da Vinci to write PushStatus and heartbeat message into PushStatus store
real-time topic.
Put payloads contain a record value, and information on how to deserialize it.
This is a simple container class to hold replication metadata related fields together to be passed on to the Put api in VeniceWriter
VeniceWriter.put(Object, Object, int, Callback, LeaderMetadataWrapper, long, PutMetadata)
.A simple container with builder functions to sugar-coat the code a bit.
Imported this implementation from netty 3.2.4 because the implementation within 3.5.11 will corrupt
uri paths, converting ';' semicolons into '&' ampersands.
A tool use thin client to query the value from a store by the specified key.
R2 based Transport Client Interface.
Created by acurtis on 12/12/16.
A simple
PollStrategy
which delivers messages from any partition, while respecting the
ordering guarantee of individual partitions.A ChunkingAdapter to be used when we want to read the value from storage engine as raw bytes.
A POJO containing a byte array and a serialized
ChunkedKeySuffix
or an object of ChunkedKeySuffix
TDOO: Consider merging with AvroProtocolDefinition
This class provides support to deserialize customized records even the input doesn't contain
the full record.
Exception when the deserialization hits partial record.
This class contains two fields:
1.
A read-only wrapper of
Store
, and all the modification to delegated instance
will throw UnsupportedOperationException
.A read-only wrapper of
Version
Interface defined the way to retrieve the store config from a repository.
Interface defined readonly operations to access stores.
A type of
Persona
that enforces a read (bandwidth) limit across multiple stores.This class define the throttler on reads request.
This is used to store common fields shared by various read responses.
This class is used to accumulate stats associated with a read response.
This class is used to record stats associated with a read response.
Enums of the strategies used to read data from multiple replications in Venice.
Interface defined read and write operations to access stores.
Class which implements the process of switching from a batch topic (e.g.
Schema which describes a record change event in a Venice store.
TODO: This class is a bit of a hack.
Utilities for reflection
TODO This class may not be needed if we decide to proceed with Guice library
for reflections and need to be discarded then
RelayNotifier is a VeniceNotifier which takes in a target VeniceNotifier and forward all actions to the target notifier.
Singleton class meant to keep track of subscription failures for ingestion tasks where the ingestion
task needs to keep running despite cluster failures.
Replica is the basic unit to distribute data, replica is belong to a partition and running in a instance.
Class stores replica status and history.
This
ReplicationMetadataRocksDBStoragePartition
is built to store key value pair along with the timestamp
metadata.This class is registered in ZKClient to handle Replication metadata schema serialization.
An interface to report and retrieve metrics related to data writer tasks during the execution of a MapReduce job.
Store metadata class that uses the server's endpoint to fetch metadata and keep the local cache up to date.
Created by mwise on 3/16/16.
This class is used to include all the intermediate fields required for the communication between the different tiers.
Provide the appropriate headers based on the request type for all remote Venice clients.
Represents a resource identified by "name".
Assignments for all of resources.
This class maintains a map from resource of a certain type to its lock.
This interface defines the API to track the read usage per resource.
A registry to manage
Shutdownable
resources.Factories must implement this interface.
Resources which implement
ResourceRegistry.ShutdownFirst
will be shut down during the first phase.Resources which implement
ResourceRegistry.ShutdownLast
will be shut down during the last phase.Resources which implement
ResourceRegistry.Sync
will be shut down after preceding resources have finished shutting down
and further resources will shut down after it has completed shutting down.Created by acurtis on 6/19/17.
This class is mostly used to trigger retry in the following scenarios:
1.
TODO: make retry work for compute request.
A non-thread safe counter that should be used and increased in a single thread env when the increment is called
This class implements a stream filter for writing compressed data in the GZIP file format.
This is an util class that contains constants related to replication metadata.
Container for the deserializers of a single store.
This holds the schema to perform conflict resolution on records when samza produces data in
active active replication mode.
This is simply a wrapper class to delegate the actual schema generation to version specific adapter.
This class helps in generating a active-active metadata schema to hold update timestamps and offsets from a value schema of a store .
This class is responsible for serialization and deserialization related tasks.
Type of the replication metadata timestamp.
Ths class is responsible for deserializing RMDs and extract some information from it.
A POJO class that contains 3 things/fields:
1.
Class that exposes RocksDB memory consumption stats based on all properties
that are made available in https://github.com/facebook/rocksdb/blob/master/include/rocksdb/db.h#L870
Properties exist on a per RockDB database basis (equivalent to a Venice partition).
Check
TickerType
to find more details about RocksDB metrics.In
RocksDBStoragePartition
, it assumes the update(insert/delete) will happen sequentially.This class is used to throttle RocksDB operations.
This class stores the stats for combined all type(SINGLE/MULTI/COMPUTE etc) of Router requests to the HttpAsync client compared
to
RouteHttpStats
which stores only per type stats.Router4Impl<C extends io.netty.channel.Channel>
Router4PipelineFactory<C extends io.netty.channel.Channel>
This push monitor is able to query hybrid store quota status from routers
This push monitor is able to query push job status from routers; it only works for
stores running in Leader/Follower mode and it will be built for STREAM_REPROCESSING job.
Router based implementation of
StoreSchemaFetcher
.Serializer used to convert data between RouterClusterConfig Object and JSON string
Exception class for checked exceptions within the Espresso Router.
Right now, there is no simple way to capture all the exception cases with DDS Router.
RouterHeartbeat
is a service that monitors and reports the health of current live instances in the Venice cluster.RouterKey
encapsulates the required information for a key in a router request.List all Metric entities for router
RouterRequest
is an abstract base class for single-get and multi-get operations.Monitors the stream, when it gets enough bytes that form a genuine object,
it deserializes the object and passes it along the stack.
Cluster level metadata for all routers.
This interface define the operations used to manage all routers in a cluster.
Store-level access control handler, which is being used by both Router and Server.
Repository to access routing data like Partition and replica.
Enums of strategies used to route the key to partition in Venice.
A utility class to convert Spark SQL Row to an Avro GenericRecord with the specified schema.
A simple filter
Runnable
which will only invoke the contained runnable no more than once.A wrapper around
HelixDataAccessor
which disables the unsafe functionality.A wrapper around
HelixManager
which disables the unsafe functionality.Various methods for stopping/exiting a Samza task.
Collection of ScatterGatherRequests for a request to the router.
ScatterGatherHelper<H,P extends ResourcePath<K>,K,R,BASIC_HTTP_REQUEST extends BasicRequest,HTTP_RESPONSE,HTTP_RESPONSE_STATUS>
ScatterGatherHelper.Builder<H,P extends ResourcePath<K>,K,R,HTTP_REQUEST extends BasicRequest,HTTP_RESPONSE,HTTP_RESPONSE_STATUS>
Created by acurtis on 6/20/17.
Routing information for part of a request which the router has mapped to a single storage node.
ScatterGatherRequestHandlerImpl<H,P extends ResourcePath<K>,K,R,CHC,BHS extends BasicRequest,HR,HRS extends io.netty.handler.codec.http.HttpResponseStatus,SCATTER_GATHER_HELPER extends ScatterGatherHelper<H,P,K,R,BHS,HR,HRS>>
An analog to
ScheduledExecutorService
which extends ExecutorService
.Try to adapt the to the .
Used to cache the toString of a given schema, since it is expensive to compute.
Evaluate the compatibility between a reader schema and a writer schema.
Identifies the type of a schema compatibility result.
Provides information about the compatibility of a single reader and writer schema pair.
This class is used to store all the schemas related to a given store:
1.
SchemaEntry
is composed of a schema and its corresponding id.This class to use to serialize/deserialize Zookeeper node
Throw this exception when the schema is not fully compatible with the previous ones.
This class helps checks presence of a schema version in ZK.
This interface is considered as an internal interface to Venice codebase.
A high-level interface as a source specifically for RmdSchema.
This is a port of the com.linkedin.security.netty.ssl.access.control.SecureClientHandler class as
a netty 4 implementation.
A segment is a sequence of messages sent by a single producer into a single partition.
Interface of class that is used to keep track of push job details sent to the Venice controller.
This store buffer services maintains two separate drainer queues for store ingestions.
For one given store, the client only needs the followings:
1.
Together with
ServerStoreAclHandler
, Server will allow two kinds of access pattern:
1.ServerHttpRequestStats
contains a list of counters in order to mainly measure the performance of
handling requests from Routers.A wrapper that holds reference for various repositories responsible for constructing metadata responses upon request.
For measuring quota requests and rejections for a given store.
We need to be able to record server side statistics for gRPC requests.
Together with
ServerAclHandler
, Server will allow two kinds of access pattern:
1.This interface is used to announce the service to a Service Discovery system on startup and de-announce the service
on graceful shutdown.
This class is responsible for retrying the registration of a
ServiceDiscoveryAnnouncer
in the case of registration failure.This class is intended to only be used in Controller, which is sharing one instance across
all the clusters.
This class is intended to only be used in Controller, which is sharing one instance across
all the clusters.
A class containing shared kafka producer related configurations.
A factory class to create shared objects that need to release resources cleanly.
ShutdownableEventLoopGroup<E extends io.netty.channel.EventLoopGroup>
Created by acurtis on 3/30/17.
An
ExecutorService
interface which also extends the
Shutdownable
interface.A
ScheduledExecutorService
interface which also extends the
Shutdownable
interface.SimpleChannelInitializer<C extends io.netty.channel.Channel>
Created by acurtis on 4/27/18.
A VERY simple JSON mapper.
This
VenicePartitioner
implementation takes the first byte of the key,
and computes the partition as such:
firstByte % numPartitions
It is meant to be used in tests where we want to control the partition that a
key belongs to, but without needing to work out a proper input that yields the
desired partition when hashed by the
DefaultVenicePartitioner
.A simple implementation of PubSubProduceResult interface for testing purposes.
Single get chunking adapter
Created by acurtis on 3/30/17.
Created by acurtis on 3/30/17.
Field accessor using Reflection.
Helper class for the
SlowFieldAccessor
implementation.This class provides thread-local variables with
SoftReference
semantics.An implementation of
AbstractInputRecordProcessor
for Spark that processes input records from the dataframe
and emits an Iterator
of Row
with DEFAULT_SCHEMA
as the schema.A Factory class to create individual
SparkInputRecordProcessor
for processing records in a Spark Dataframe.Spark partitioning functionality in Dataframe and Dataset APIs is not very flexible.
Utility methods for restricting Scala-ism in Spark code in a single class.
A
List
implementation with some usability improvements around resizing.A very simple subclass of
SparseConcurrentList
which adds some immutable offset to all index.SpecificRecordChunkingAdapter<V extends org.apache.avro.specific.SpecificRecord>
This class is necessary because Venice needs to maintain a separate interface:
AvroSpecificStoreClient
.This class is necessary because Venice needs to maintain a separate interface:
AvroSpecificStoreClient
.Created by acurtis on 9/7/17.
Builds a new instance of
SSLContext
object from given key store and trust store parameters.Created by acurtis on 4/19/18.
Factory interface to create
SSLEngine
and get SSLSessionContext
objects for Netty4 pipelinesSSL Factory interface to get common SSL objects.
Venice SSL Factory interface.
Created by acurtis on 9/7/17.
This class is used to configure TLS for Venice components in integration tests.
This handler closes the connection when a ReadTimeoutException occurs.
This enum is used to denote why a particular store version was found to be stale
This ControlMessage is sent by the Controller, once per partition, after the EndOfPush ControlMessage, in Hybrid Stores that ingest from both offline and nearline sources.
This ControlMessage is sent per partition by each offline incremental push job, once per partition, at the beginning of a incremental push.
This ControlMessage is sent once per partition, at the beginning of a bulk load, before any of the data producers come online.
This ControlMessage is sent at least once per partition per producer.
This class notifies the Helix State Models (SM) about corresponding ingestion progress.
A StaticAccessController is an AccessController with an immutable resource list.
Currently, we only expose metrics for single-get and batch-get requests, and if there
is a need to have metrics for other request types, we can add them later.
This class is in charge of all the metric emissions per request.
This enum tracks the error codes that we use to report anomalies in metrics.
This class is used to handle all the metric related logic.
Venice control message which is used to transfer command and status between storage node and controller, so that
controller could control the whole cluster and make the global decision.
Channel used to send and receive control message.
Handler to deal with incoming
StatusMessage
instances.The snapshot of status change.
Created by acurtis on 12/12/16.
An abstraction that represents the shared resources of a persistence engine.
This exception indicates that the server was unable to initialize on or more
storage services or stores within a service.
StorageEngineMetadataService is wrapper service on top of storageEngineRepository to serve read/write to storage metadata.
A wrapper class that holds all the server's storage engines.
thrown by StorageEngineFactory and AbstractStorageEngine when storage creation fails
This is a superset of the OffsetManager APIs, which also provide functions for storing store-version level state.
Class used to represent the status of storage node.
Storage partition level config, which could be used to specify partition specific config when
initializing/opening the corresponding
AbstractStoragePartition
.This class maintains in-memory partition usage.
This exception indicates that the server was unable to initialize on or more
storage services or stores within a service.
A type of
Persona
that enforces a storage (size) limit across multiple stores.Zookeeper accessor for
StoragePersona
objects, allowing for read/write to persona paths.Serializer used to convert the data between
Persona
and json.This class annotates the constructor for
StoragePersona
and serves as a property-based creator.Cached wrapper for the
StoragePersonaAccessor
class, which keeps a map of store names to personas that
contain them, as well as caches Zookeeper personas.Helper class which listens to changes from a
ReadOnlyStoreRepository
and updates the caches and ZK of the
StoragePersonaRepository accordingly.StorageReadRequestHandler
will take the incoming read requests from routerRouterRequest
, and delegate
the lookup request to a thread pool StorageReadRequestHandler.executor
, which is being shared by all the requests.Storage interface to Venice Server, Da Vinci and Isolated Ingestion Service.
This class has the following responsibilities:
1.
This interface defines all the public APIs, and if you need to add accessors to
some new fields, this interface needs to be changed accordingly.
This type of metadata contains various store properties/configs and describes the clusters that store is materialized in.
StoreAwarePartitionWiseKafkaConsumerService
is used to allocate share consumer from consumer pool at partition
granularity.This service is in charge of cleaning up backup versions based on retention policy defined on store basis.
This class is serving as a
PubSubMessage
buffer with an accompanying pool of drainer threads.This type describes the various Venice cluster information for a store
Configurations of a store which are non-cluster specified.
This is a class used to manage multiple replicas of the same store.
Interface used to register into metadata repository to listen the change of store data.
The graveyard used to keep all deleted stores.
This service is in charge of removing stores under /venice//StoreGraveyard zk path.
Json-serializable class for sending store information to the controller client
An interface for Store Ingestion Service for Venice.
A runnable Kafka Consumer consuming messages from all the partition assigned to current node for a Kafka Topic.
This enum represents all potential results after calling
StoreIngestionTask.delegateConsumerRecord(PubSubMessageProcessedResultWrapper, int, String, int, long, long)
.Builder class for
StoreIngestionTaskFactory
; once the build() function is invoked,
no modification is allowed on any parameters.Serializer used to convert the data between Store and json.
Mixin used to add the annotation to figure out the constructor used by Jackson lib when deserialize the version
Mixin used to add the annotation to figure out the constructor used by Jackson lib when deserialize the store.
Mixin used to add the annotation to figure out the constructor used by Jackson lib when deserialize the version
Throw this exception when try to add key schema to a store, whose key schema already exists.
This type of store metadata describes the key schemas of the store
This type describes the key schemas of the store
This interface defines a set of lifecycle events for stores and their store-versions.
Pre-hooks and post-hooks
Most events have a pre- and post- variant, all pre-hooks provide an option to control the outcome of the event (e.g.
Pre-hooks and post-hooks
Most events have a pre- and post- variant, all pre-hooks provide an option to control the outcome of the event (e.g.
Interface used to register into metadata repository to listen the change of store list.
This interface defines the APIs to retrieve store metadata and routing data,
and it also includes the feedback APIs:
StoreMetadata.trackHealthBasedOnRequestToInstance(java.lang.String, int, int, java.util.concurrent.CompletableFuture<com.linkedin.venice.client.store.transport.TransportClientResponse>)
to decide the healthiness of each replica.Modes that control how fast client will fetch store metadata
A composite key that divides the key space to retrieve different types of metadata for a Venice store.
A composite key that divides the key space to retrieve different types of metadata for a Venice store.
This class is a handle to refer to a store.
This type contains all the store configs and the corresponding versions
MetadataFetchRequest
encapsulates a GET request to /store_properties/storename on the storage node to fetch metadata
for that node.This class stores all the information required for answering a server metadata fetch request.
This record will store store properties
This structure will contain all kinds of info related to one replica
StoreRepushCommand contains the details of executing/processing repush command.
This class is the public interface for Venice store user to fetch store schemas.
This class retrieves the corresponding Store object from a router endpoint which contains the latest store state.
Control description used to notify controller that the status of Offline push in Storage node.
This type describes a single version of the value schema of the store.
This type of store metadata describes the value schemas of the store.
This type describes the value schemas of the store.
This class implements the fast value schema/latest value schema lookup with acceptable delay.
Type describes all the version attributes
A hook which returns this enum has the option of proceeding, aborting, waiting or rolling back.
This class is a handle to refer to a store-version.
This record maintains store-version level state, such as the StartOfBufferReplay Control Message, in the case of Hybrid Stores.
A configuration for a particular view.
A configuration for a particular view.
This class handles Write Compute operations related to a specific store.
This record will store all the additional info after sending out streaming response
This class provides percentile tracking and stats emission for streaming response
This class serves as the annotated schema cache for merge conflict resolution purpose in Active/Active replication.
Interface defined subscription based readonly operations to access stores.
A simple implementation of
AsyncFuture
which behaves as if the Future has already
completed successfully.Full response to indicate that all the sub responses are with good status:
HttpResponseStatus.OK
for streaming request.This class would copy the specified
SupersetSchemaGeneratorWithCustomProp.customProp
from the latest value schema to the generated
superset schema and in the meantime, the compare method in this impl will consider this extra property as well.This SystemStore class is a wrapper of the corresponding zk shared system store and the regular venice store to provide
the exact same interface as the regular Venice Store for System Stores.
Currently, acl creation is still managed by Nuage but Nuage is not aware about the system stores.
This class is the metric class for
SystemStoreRepairService
This class contains the logic to set up system stores.
This type describes all the distinct properties
This class is the system store repair service in Venice parent controller.
This class tries to scan all cluster which current parent controller is the leader controller.
Time
implementation that just reads from the system clockThis type of store metadata contains the target version states announced by the parent controller leader for a given store that each fabric should be bootstrapping towards.
An interface to report task health and track task progress for different execution engines
Interface for creating metric names enum for tehuti metrics
Utils for venice metrics
Generate a ratio stat that is based on two arbitrary
Rate
.Generate a ratio stat that is based on two arbitrary
MeasurableStat
.A task that periodically polls the job state for existing version topics in the parent fabric.
A test log appender that can be used to verify log messages in unit tests.
A
Time
implementation which abstracts logical time away from real
time, so that it can be deterministically controlled.General-purpose utility functions for tests.
An
ExecutorService
that executes each submitted task using
one of possibly several pooled threads, normally configured
using Executors
factory methods.Stats used to collect the usage of a thread pool including: 1.
Claims that a class is threadsafe or admonishes implementors of an interface
to ensure that their implementation is threadsafe.
Utility methods for time travelling and freezing time.
The interface to time itself.
This utility class provides methods to format and parse a timespan in a human readable form.
More efficient handling of cancellable schedulable events.
A
ThreadPoolExecutor
that can additionally schedule
commands to run after a given delay, or to execute
periodically.A class to encapsulate time values along with their measurement units
Works for both timestamps as well as time durations.
The topic cleanup in Venice adopts the following strategy:
1.
In parent controller,
TopicCleanupServiceForParentController
will remove all the deprecated topics:
topic with low retention policy.Topic manager is responsible for creating, deleting, and updating topics.
A context object that contains all the dependencies needed by
TopicManager
.A repository of
TopicManager
instances, each associated with a specific PubSub region and cluster.This class is used to find messages for a given key from a specified topic.
This class is a wrapper of pair of <Version Topic, Version /Real time topic partition>.
This class tracks consumed topic partitions' offsets
This ControlMessage is sent by the Controller, once per partition; it will only be used in leader/follower state transition model; this control message will indicate the leader to switch to a new source topic and start consuming from offset with a specific timestamp.
This class precomputes the conversion from Avro's
CharSequence
to String, and also hangs on to the
correct instance of PubSubTopic
, so that the lookup need not be performed in many locations.TopicWiseKafkaConsumerService
is used to allocate share consumer from consumer pool at topic granularity.Extend this class to create response objects for async command.
This interface exposes one more function:
TrackingStreamingCallback.onDeserializationCompletion(Optional, int, int)
, which
will be used for metric tracking.This
PubSubProducerAdapter
implementation allows tests to perform
arbitrary transformations on the messages that are about to be written to
Kafka.Define interfaces for TransportClient, and the target customer is the sub-classes of
AvroGenericStoreClient
Define the common functions for call back of
TransportClient
This class is used to capture the result returned by
TransportClient
Callback to support streaming in
TransportClient
This class implements a generic trie-based lookup tree.
The policy controls the TTL behavior regarding how batch writes are treated.
Interface for a generic typed field accessor.
Helper class to determine at run time which implementation to use.
Thrown when a required property is not present
This class makes use of
UnresolvedUnionException
, which is not present in all versions of Avro.This is thrown when some code encounters a message type it does not know about.
This service runs in the parent controller to delete historical unused value schemas.
Partial update operation, which merges the update value with the existing value.
This class applies the builder pattern to build a partial update record.
This enum represents status of something (a record or a field) after being updated.
Copies of the implementation of
URLEncoder.encode(String, String)
and
URLDecoder.decode(String, String)
except with the following changes:
Accept a Charset
argument for the character set instead of a string
which would cause Charset.forName(String)
to be called.
Internally use StringBuilder
instead of StringBuffer
.
These methods no longer throw a checked exception because it
no longer calls Charset.forName(String)
.
These methods are annotated with Nonnull
annotations.
This class is responsible for automatically create and delete per user store system store resources when the
corresponding user store is created or deleted.
Helper functions
This class encapsulates config entity information such as config name, default value, config document
Controller API version 1 HTTP GET routes
Mapper only MR to Validate Schema, Build compression dictionary if needed and persist
some data (total file size and compression dictionary) in HDFS to be used by the VPJ Driver
Note: processing all the files in this split are done sequentially and if it
results in significant increase in the mapper time or resulting in timeouts,
this needs to be revisited to be done via a thread pool.
The data protocol between ValidateSchemaAndBuildDictMapper and VenicePushJob
This class reads the data(total input size in bytes and zstd dictionary) persisted in HDFS
by
ValidateSchemaAndBuildDictMapper
based on the schema ValidateSchemaAndBuildDictMapperOutput
This class provides a way to:
1.
Interface for targeted region push validation.
This class holds a value of type and its corresponding replication metadata.
Optional field which describes the previous state of a given row.
This class provides the following functionalities:
1.
A runtime exception which may be thrown by implementation of AuthorizerService interface.
This class is a thin wrapper of
AvroSerializer
and AvroGenericDeserializer
.A record reader that reads records from Avro file input into Avro-serialized keys and values.
Classes for P2P use case that are triggered by blob transfer file not found in target host
Venice change capture consumer to provide value change callback.
Beam Connector for Venice Change data capture IO.
Interface to provide a configured
VeniceChangelogConsumer
instance.A Runnable that throws checked exceptions.
A Supplier that throws checked exceptions.
This class is responsible to filter records based on the RMD information and the ttl config by taking chunked records
ChunkAssembler.ValueBytesAndSchemaId
.This is the class to talk to
VeniceChunkedWriteHandler
to send back partial response.This class is the wrapper of
ChunkedWriteHandler
, and essentially it will introduce the following logic:
1.Thrown by the venice thin client if something goes wrong with the request
Parent class to VeniceClientHttpException
VeniceClientException
with a http codeProtobuf type
com.linkedin.venice.protocols.VeniceClientRequest
Protobuf type
com.linkedin.venice.protocols.VeniceClientRequest
This class copies most of the logic from
DefaultClientTlsStrategy
and AbstractClientTlsStrategy
to
get rid of the cipher check to be backward compatible.This class copies most of the logic from
ClientTlsStrategyBuilder
to get rid of the cipher check while using
http/2 in TlsCiphers
to be backward compatible.class that maintains config very specific to a Venice cluster
VeniceConfigService loads the static configuration that does not change
once the Server is started.
Venice Controller to manage the cluster.
Configuration which is specific to a Venice controller.
VeniceControllerContext contains dependencies required by
VeniceController
Protobuf type
com.linkedin.venice.protocols.controller.VeniceControllerGrpcErrorInfo
Protobuf type
com.linkedin.venice.protocols.controller.VeniceControllerGrpcErrorInfo
This class is a gRPC service implementation for the VeniceController public API.
The core handler for processing incoming requests in the VeniceController.
A service venice controller.
State model used to handle the change of leader-standby relationship for controllers.
This class contains all the
ScatterGatherMode
being used in Venice Router.Factory to create VeniceDistClusterControllerStateModel and provide some utility methods to get state model by given
cluster.
N.B.: Although there is no way to force this via Java interfaces, the convention is that all enums implementing this
interface should have static "valueOf" function to return the correct enum value from a given numeric value, i.e.:
Note that VeniceEnumValueTest makes it easy to test the above, and we should have a subclass of that test for all
implementations of this interface.
Abstract class which makes it as easy as possible to generically test all the assumptions for enums which implement
the
VeniceEnumValue
interface.Base exception that all other Venice exceptions extend
Custom Input Format with the following specs to be used for the feature
PushJobSetting.useMapperToBuildDict
with
ValidateSchemaAndBuildDictMapper
1.A specialized
DefaultFullHttpResponse
object to record the decompression time of the records in the response.A table format that is used by Spark to read Avro files from HDFS for use in VenicePushJob.
This is the entrypoint of the Avro input source.
Helix Admin based on 0.8.4.215 APIs.
{code VeniceHostFinder} finds a set of hosts that can serve the given Venice resource partition.
VeniceHostHealth
the aggregate statistics for HostHealthStats.This utility is used to manage all the HTTP/2 related configs and offer a way to spin up a pipeline to support
both HTTP/1.1 and HTTP/2.
Customized exception for inconsistent Avro schema error of input directory
in
VenicePushJob
This exception is thrown when we detect inconsistent or possibly corrupted store metadata on storage nodes.
Simple class that spins JVM platform stats into Venice stats.
This class is designed specifically for
KafkaInputFormat
, and right now, it is doing simple pass-through.This class is designed specifically for
KafkaInputFormat
, and right now, it will pick up the latest
entry according to the associated offset, and produce it to Kafka.This class is responsible to filter records based on the RMD information and the ttl config by taking KIF input
KafkaInputMapperValue
.Map objects to byte arrays and back again
Venice wrapper around a
Lock
object to record metrics and emit logs when lock acquisition is taking too long.Uses
KvCoder
to encode/decode PubSubMessage
key
and value.Custom Exception for Venice messages.
VeniceMetadataRepositoryBuilder is a centralized builder class for constructing a variety of metadata components
including store repository, schema repository, ZK-shared schema repository, ZK client and cluster info provider
for Da Vinci, Venice Service and Isolated Ingestion Service.
Configuration for metrics emitted by Venice: Holds OpenTelemetry as well as Tehuti configs
Configs starting with "otel.venice." are venice specific configs for OpenTelemetry metrics
other configs starting with "otel.exporter." are generic OpenTelemetry exporter configs but are parsed in this class and used setters to configure otel exporter.
Configs starting with "otel.venice." are venice specific configs for OpenTelemetry metrics
other configs starting with "otel.exporter." are generic OpenTelemetry exporter configs but are parsed in this class and used setters to configure otel exporter.
This is used to initialize a
Metrics
for each request.Repository to hold both tehuti and OpenTelemetry metrics.
VeniceMRPartitioner
will use the exact same partitioner as the one that will be used inside
VeniceWriter
(e.g.This enum is used to define various routing strategies for multi-key requests.
Thrown when an operation should return information about a store, but the store does not exist
Interface for listening to Notifications for Store consumption.
This class defines a wrapper object to wrap up the actual object and the corresponding update timestamp.
Venice implementation of
OfflinePushAccessor
.Handler for checking the state of the region of the parent controller to handle requests in its region.
This class is a wrapper of
VeniceHelixAdmin
, which will be used in parent controller.Determines partitioning, which is used for producing messages into the right
Kafka partitions and routing reads to the correct Venice storage nodes.
VenicePartitionFinder
provides methods to find the partition name or number for the given data.Inbound single get request to the router will look like:
GET /storage/storeName/key?f=fmt
'storage' is a literal, meaning we will request the value for a single key
storeName will be the name of the requested store
key is the key being looked up
fmt is an optional format parameter, one of 'string' or 'b64'.
Created by mwise on 4/25/16.
Classes for P2P use case that are triggered by founded peers cannot connect
Classes for P2P usecase that are triggered by peers not found
The API for online applications to write to Venice.
A Pulsar Sink that sends messages to Venice.
Configuration class for the Sink Connector.
This class sets up the Hadoop job used to push data to Venice.
Enumeration of response status codes for Venice read requests.
VeniceReducer
will be in charge of producing the messages to Kafka broker.This class simply exposes publicly the protected methods of its parent, for debugging purposes.
Venice resource that could be refreshed to retrieve the newest data or clear the current data in memory.
Class for all Venice exceptions that are triggered by Kafka topic authorization related issues.
VeniceResponseAggregator
aggregates the sub-responses returned by storage node for a multi-get request.CompletableFuture
implementation to handle partial response.This class is used to handle all the decompression related logic in Router, and it will be used in
VeniceResponseAggregator
for regular requests and VeniceDispatcher
for streaming requests.Venice customized map, which could contains either a full response or a partial response.
How Venice categorizes the response status of a request:
We are emitting both
HttpResponseStatusCodeCategory
and this enum to capture the http standard as
well as the Venice specific categorization.Interface for all Venice exceptions that are retriable.
This class is responsible to filter records based on the RMD information and the ttl config.
Common route handler implement the common part of error handling and writing response to json.
Configuration for Venice Router.
Customized exception for non-existing key/value field error in Avro schema
in
VenicePushJob
Interface defines how to serialize and deserialize the venice object.
Class that represents the Venice server.
VeniceServerConfig maintains configs specific to Venice Server, Da Vinci client and Isolated Ingestion Service.
VeniceServerContext contains dependencies required by VeniceServer
Protobuf type
com.linkedin.venice.protocols.VeniceServerResponse
Protobuf type
com.linkedin.venice.protocols.VeniceServerResponse
Creates socket connectors.
Spark server implementation
VeniceSparkPartitioner
will use the exact same partitioner as the one that will be used inside
VeniceWriter
(e.g.Creates instances of embedded jetty containers.
Thrown when an operation should create a store, but the store already exists
Interface for a cache on a venice store.
Customized exception for receiving invalid
VersionCreationResponse
in VenicePushJob
Thrown when an operation should return information about a store, but the store is migrated to another cluster
class that maintains all properties that are not specific to a venice server and cluster.
Samza jobs talk to either parent or child controller depending on the aggregate mode config.
VeniceSystemProducer
defines the interfaces for Samza jobs to send data to Venice stores.Enum used to differentiate the different types of Venice system stores when access their metadata.
Used when timeout happens in Venice.
Enums that lists most Venice user store types.
Customized exception for validation error for targeted colo push
in
VenicePushJob
VeniceVersionFinder
provides methods to find the version for a given store.This is the abstract implementation of the 'management' component for a venice view.
This class is kept separate from the VeniceView class to not leak certain concepts that
currently exclusively reside in the server to other components.
Mapper that reads Vson input and deserializes it as Avro object and then Avro binary
A record reader that reads records from VSON SequenceFile file input into Avro-serialized keys and values.
Class which acts as the primary writer API.
An interface which enables the key to contain parts of the within it, which is
useful for control messages and chunked values.
Factory used to create
VeniceWriter
.VeniceWriterOptions is used to pass arguments to VeniceWriter constructor.
A host level metric to track number of active venice writers.
This class contains constants that represent Venice-managed ZooKeeper paths.
Currently this VerifySslHandler is only used in servers or the isolated ingestion process.
This interface defines all the public APIs, and if you need to add accessors to
some new fields, this interface needs to be changed accordingly.
Producer type for writing data to Venice
Class defines the version of Venice store.
Enums of status of verion.
This controlMessage is written to the real-time topic by the controller or to the store-version topic by the current version's leader server.
This class contains some utils methods for VPJ jobs to obtain SSL-related configs.
Deprecated.
This class supports the special byte[] check.
This class supports the special byte[] check.
Deprecated.
Deprecated.
Deprecated.
Deprecated.
Deprecated.
Deprecated.
Deprecated.
This class was originally from Voldemort.
Deprecated.
Wait all strategy stands for Venice do not allow one replica to be failed.
Wait N-1 strategy stands for Venice could tolerate one of replica to be failed for each partition.
This utility class contains all String literal constants used in write compute.
This interface provides methods to execute write-compute operation a value
Write compute V1 handles value records that do not have replication metadata.
Write compute V2 handles value records with replication metadata.
This enum describe the possible write compute operations Venice supports.
This class is able to read write-computed value and apply it to original value.
This result wrapper is for non-Active-Active stores.
This class converts a Avro
Schema.Type.RECORD
schema to its write compute schema.validate if a write-compute schema can be pair with value schema
This class serves as a latency sensor for write path that contains two types of stats: Avg and Max.
This class is a helper class that contains writer side chunking logics.
Class used to access to the allowlist stored in ZK.
The stats keep track of ZK Client status changes.
This class contains methods to 1)migrate Venice-specific metadata from a source ZooKeeper (ZK) to a destination ZK
and 2)extract Venice-specific paths from an input text file containing ZK paths to an output text file.
The purpose of this class is to abstract Helix operations out of the
VeniceHelixAdmin
and eventually rename
it to VeniceAdmin.Manage live routers through Zookeeper.
Class defines the store of Venice.
The class is used to access the store configs in Zookeeper.