Interface VeniceChangelogConsumer<K,V>

Type Parameters:
K - The Type for key
V - The Type for value
All Known Implementing Classes:
LocalBootstrappingVeniceChangelogConsumer, VeniceAfterImageConsumerImpl, VeniceChangelogConsumerImpl

@Experimental public interface VeniceChangelogConsumer<K,V>
Venice change capture consumer to provide value change callback.
  • Method Details

    • getPartitionCount

      int getPartitionCount()
      Returns:
      total number of store partitions
    • subscribe

      CompletableFuture<Void> subscribe(Set<Integer> partitions)
      Subscribe a set of partitions for a store to this VeniceChangelogConsumer. The VeniceChangelogConsumer should try to consume messages from all partitions that are subscribed to it.
      Parameters:
      partitions - the set of partition to subscribe and consume
      Returns:
      a future which completes when the partitions are ready to be consumed data
      Throws:
      a - VeniceException if subscribe operation failed for any of the partitions
    • seekToBeginningOfPush

      CompletableFuture<Void> seekToBeginningOfPush(Set<Integer> partitions)
      Seek to the beginning of the push for a set of partitions. This is analogous to doing a bootstrap of data for the consumer. This seek will ONLY seek to the beginning of the version which is currently serving data, and the consumer will switch to reading data from a new version (should one get created) once it has read up to the point in the change capture stream that indicates the version swap (which can only occur after consuming all the data in the last push). This instructs the consumer to consume data from the batch push.
      Parameters:
      partitions - the set of partitions to seek with
      Returns:
      a future which completes when the operation has succeeded for all partitions.
      Throws:
      VeniceException - if seek operation failed for any of the partitions, or seeking was performed on unsubscribed partitions
    • seekToBeginningOfPush

      CompletableFuture<Void> seekToBeginningOfPush()
      Seek to the beginning of the push for subscribed partitions. See seekToBeginningOfPush(Set) for more information.
      Returns:
      a future which completes when the partitions are ready to be consumed data
      Throws:
      VeniceException - if seek operation failed for any of the partitions.
    • seekToEndOfPush

      CompletableFuture<Void> seekToEndOfPush(Set<Integer> partitions)
      Seek to the end of the last push for a given set of partitions. This instructs the consumer to begin consuming events which are transmitted to Venice following the last batch push.
      Parameters:
      partitions - the set of partitions to seek with
      Returns:
      a future which completes when the operation has succeeded for all partitions.
      Throws:
      VeniceException - if seek operation failed for any of the partitions, or seeking was performed on unsubscribed partitions
    • pause

      void pause()
      Pause the client on all subscriptions. See pause(Set) for more information.
      Throws:
      VeniceException - if operation failed for any of the partitions.
    • resume

      void resume(Set<Integer> partitions)
      Resume the client on all or subset of partitions this client is subscribed to and has paused.
      Throws:
      VeniceException - if operation failed for any of the partitions.
    • resume

      void resume()
      Pause the client on all subscriptions. See resume(Set) for more information.
      Throws:
      VeniceException - if operation failed for any of the partitions.
    • pause

      void pause(Set<Integer> partitions)
      Pause the client on all or subset of partitions this client is subscribed to. Calls to poll(long) will not return results from paused partitions until resume(Set) is called again later for those partitions.
      Throws:
      VeniceException - if operation failed for any of the partitions.
    • seekToEndOfPush

      CompletableFuture<Void> seekToEndOfPush()
      Seek to the end of the push for all subscribed partitions. See seekToEndOfPush(Set) for more information.
      Returns:
      a future which completes when the operation has succeeded for all partitions.
      Throws:
      VeniceException - if seek operation failed for any of the partitions.
    • seekToTail

      CompletableFuture<Void> seekToTail(Set<Integer> partitions)
      Seek to the end of events which have been transmitted to Venice and start consuming new events. This will ONLY consume events transmitted via nearline and incremental push. It will not read batch push data.
      Parameters:
      partitions - the set of partitions to seek with
      Returns:
      a future which completes when the operation has succeeded for all partitions.
      Throws:
      VeniceException - if seek operation failed for any of the partitions, or seeking was performed on unsubscribed partitions
    • seekToTail

      CompletableFuture<Void> seekToTail()
      Seek to the end of events which have been transmitted to Venice for all subscribed partitions. See seekToTail(Set) for more information.
      Returns:
      a future which completes when the operation has succeeded for all partitions.
      Throws:
      VeniceException - if seek operation failed for any of the partitions.
    • seekToCheckpoint

      Seek the provided checkpoints for the specified partitions. Note about checkpoints: Checkpoints have the following properties and should be considered: -Checkpoints are NOT comparable or valid across partitions. -Checkpoints are NOT comparable or valid across colos -Checkpoints are NOT comparable across store versions -It is not possible to determine the number of events between two checkpoints -It is possible that a checkpoint is no longer on retention. In such case, we will return an exception to the caller.
      Parameters:
      checkpoints -
      Returns:
      a future which completes when seek has completed for all partitions
      Throws:
      VeniceException - if seek operation failed for any of the partitions
      VeniceCoordinateOutOfRangeException - if passed checkpoint is no longer valid
    • subscribeAll

      CompletableFuture<Void> subscribeAll()
      Subscribe all partitions belonging to a specific store.
      Returns:
      a future which completes when all partitions are ready to be consumed data
      Throws:
      a - VeniceException if subscribe operation failed for any of the partitions
    • seekToTimestamps

      CompletableFuture<Void> seekToTimestamps(Map<Integer,Long> timestamps)
      Seek to the provided timestamps for the specified partitions based on wall clock time for when this message was processed by Venice and produced to change capture. Note, this API can only be used to seek on nearline data applied to the current serving version in Venice. This will not seek on data transmitted via Batch Push. If the provided timestamp is lower than the earliest timestamp on a given stream, the earliest event will be returned. THIS WILL NOT SEEK TO DATA WHICH WAS APPLIED ON A PREVIOUS VERSION. You should never seek back in time to a timestamp which is smaller than the current time - rewindTimeInSeconds configured in the hybrid settings for this Venice store. The timestamp passed to this function should be associated to timestamps processed by this interface. The timestamp returned by {@link PubSubMessage.getPubSubMessageTime()} refers to the time when Venice processed the event, and calls to this method will seek based on that sequence of events. Note: it bears no relation to timestamps provided by upstream producers when writing to Venice where a user may optionally provide a timestamp at time of producing a record.
      Parameters:
      timestamps - a map keyed by a partition ID, and the timestamp checkpoints to seek for each partition.
      Returns:
      Throws:
      VeniceException - if seek operations failed for any of the specified partitions.
    • seekToTimestamp

      CompletableFuture<Void> seekToTimestamp(Long timestamp)
      Seek to the specified timestamp for all subscribed partitions. See seekToTimestamps(Map) for more information.
      Returns:
      a future which completes when the operation has succeeded for all partitions.
      Throws:
      VeniceException - if seek operation failed for any of the partitions.
    • unsubscribe

      void unsubscribe(Set<Integer> partitions)
      Stop ingesting messages from a set of partitions for a specific store.
      Parameters:
      partitions - The set of topic partitions to unsubscribe
      Throws:
      a - VeniceException if unsubscribe operation failed for any of the partitions
    • unsubscribeAll

      void unsubscribeAll()
      Stop ingesting messages from all partitions.
      Throws:
      a - VeniceException if unsubscribe operation failed for any of the partitions
    • poll

      Polling function to get any available messages from the underlying system for all partitions subscribed.
      Parameters:
      timeoutInMs - The maximum time to block/wait in between two polling requests (must not be greater than Long.MAX_VALUE milliseconds)
      Returns:
      a collection of messages since the last fetch for the subscribed list of topic partitions
      Throws:
      a - VeniceException if polling operation fails
    • isCaughtUp

      boolean isCaughtUp()
      Checks whether all subscribed partitions are caught up during bootstrap. If a partition's (currentTimestamp - latestMessageTimestamp) is smaller or equal to 1 min, we consider this partition is caught up.
      Returns:
      True if all subscribed partitions have caught up.
    • close

      void close()
      Release the internal resources.