java.lang.Object

com.linkedin.venice.spark.chunk.SparkChunkAssembler

All Implemented Interfaces:: Serializable

public class SparkChunkAssembler extends Object implements Serializable

Spark adapter for ChunkAssembler that handles chunked values and RMDs. Converts Spark Rows to the format expected by ChunkAssembler, assembles chunks, and returns the result as a Spark Row.

See Also:

Serialized Form

Constructor Summary

Constructors

Constructor

Description

SparkChunkAssembler(boolean isRmdChunkingEnabled)

SparkChunkAssembler(boolean isRmdChunkingEnabled, boolean isTTLFilteringEnabled, VeniceProperties filterProperties)
Method Summary

Modifier and Type

Method

Description

org.apache.spark.sql.Row

assembleChunks(byte[] keyBytes, Iterator<org.apache.spark.sql.Row> rows)

Assemble chunks for a single key.

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Constructor Details
- SparkChunkAssembler
  
  public SparkChunkAssembler(boolean isRmdChunkingEnabled)
- SparkChunkAssembler
  
  public SparkChunkAssembler(boolean isRmdChunkingEnabled, boolean isTTLFilteringEnabled, VeniceProperties filterProperties)
Method Details
- assembleChunks
  
  public org.apache.spark.sql.Row assembleChunks(byte[] keyBytes, Iterator<org.apache.spark.sql.Row> rows)
  
  Assemble chunks for a single key. If TTL filtering is enabled, also filters the assembled record.
  
  Parameters:
  
  keyBytes - The key bytes
  
  rows - Iterator of rows for this key (MUST be sorted by offset DESC - highest offset first)
  
  Returns:
  
  Assembled row with DEFAULT_SCHEMA_WITH_SCHEMA_ID schema, or null if DELETE, incomplete chunks, or filtered by TTL

Class SparkChunkAssembler

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Constructor Details

SparkChunkAssembler

SparkChunkAssembler

Method Details

assembleChunks