AbstractVeniceFilter<INPUT_VALUE> |
An abstraction to filter given data type.
|
DefaultInputDataInfoProvider |
|
FilterChain<INPUT_VALUE> |
The FilterChain class takes a list of AbstractVeniceFilter to assemble a filter chain to manage the life cycles'
of filters and perform filtering based on the order of filters.
|
InputDataInfoProvider.InputDataInfo |
A POJO that contains input data information (schema information and input data file size)
|
InputStorageQuotaTracker |
This class is used to keep track of store storage quota and storage overhead ratio and check whether the total
input data size exceeds the quota
|
KafkaInputDataInfoProvider |
|
NoOpSentPushJobDetailsTracker |
|
PushJobSetting |
This class carries the state for the duration of the VenicePushJob.
|
PushJobZstdConfig |
|
ValidateSchemaAndBuildDictMapper |
Mapper only MR to Validate Schema, Build compression dictionary if needed and persist
some data (total file size and compression dictionary) in HDFS to be used by the VPJ Driver
Note: processing all the files in this split are done sequentially and if it
results in significant increase in the mapper time or resulting in timeouts,
this needs to be revisited to be done via a thread pool.
|
ValidateSchemaAndBuildDictMapperOutputReader |
|
ValidateSchemaAndBuildDictOutputFormat |
This class provides a way to:
1.
|
VeniceFileInputFormat |
|
VenicePushJob |
This class sets up the Hadoop job used to push data to Venice.
|
VsonSequenceFileInputFormat |
This class was originally from Voldemort.
|