Class SparkInputRecordProcessor
java.lang.Object
com.linkedin.venice.hadoop.task.datawriter.AbstractDataWriterTask
com.linkedin.venice.hadoop.task.datawriter.AbstractInputRecordProcessor<ByteBuffer,ByteBuffer>
com.linkedin.venice.spark.datawriter.recordprocessor.SparkInputRecordProcessor
- All Implemented Interfaces:
Closeable
,AutoCloseable
An implementation of
AbstractInputRecordProcessor
for Spark that processes input records from the dataframe
and emits an Iterator
of Row
with DEFAULT_SCHEMA
as the schema.-
Field Summary
Fields inherited from class com.linkedin.venice.hadoop.task.datawriter.AbstractInputRecordProcessor
veniceRecordReader
Fields inherited from class com.linkedin.venice.hadoop.task.datawriter.AbstractDataWriterTask
TASK_ID_NOT_SET
-
Constructor Summary
ConstructorDescriptionSparkInputRecordProcessor
(Properties jobProperties, DataWriterAccumulators accumulators) -
Method Summary
Modifier and TypeMethodDescriptionprotected AbstractVeniceRecordReader<ByteBuffer,
ByteBuffer> getRecordReader
(VeniceProperties props) A method for child classes to setupAbstractInputRecordProcessor.veniceRecordReader
.Iterator<org.apache.spark.sql.Row>
processRecord
(org.apache.spark.sql.Row record) Methods inherited from class com.linkedin.venice.hadoop.task.datawriter.AbstractInputRecordProcessor
close, configureTask, process, processRecord, readDictionaryFromKafka
Methods inherited from class com.linkedin.venice.hadoop.task.datawriter.AbstractDataWriterTask
configure, getEngineTaskConfigProvider, getPartitionCount, getTaskId, isChunkingEnabled, isRmdChunkingEnabled, setChunkingEnabled
-
Constructor Details
-
SparkInputRecordProcessor
-
-
Method Details
-
processRecord
-
getRecordReader
Description copied from class:AbstractInputRecordProcessor
A method for child classes to setupAbstractInputRecordProcessor.veniceRecordReader
.- Specified by:
getRecordReader
in classAbstractInputRecordProcessor<ByteBuffer,
ByteBuffer>
-