Class PubSubPartitionSplit

java.lang.Object
com.linkedin.venice.vpj.pubsub.input.PubSubPartitionSplit
All Implemented Interfaces:
Serializable

public class PubSubPartitionSplit extends Object implements Serializable
Describes a contiguous range of records within a single PubSubTopicPartition. The range is bounded by a start position (inclusive) and an end position (exclusive). It is used to enable parallel reading during repush jobs, where data is read from pub-sub topics and written into a new version of a Venice store.

Scope and constraints:

  • One PubSubPartitionSplit covers exactly one topic partition.
  • startPubSubPosition must be <= endPubSubPosition.
  • Instances should be treated as immutable once constructed.

Engine independence: This is a framework-agnostic descriptor. Adapters can translate it to a Hadoop InputSplit or a Spark DataSource V2 InputPartition.

See Also: