T
- the type of record returned by this source@PublicEvolving public class HiveSource<T> extends AbstractFileSource<T,HiveSourceSplit>
HiveSourceSplit
and
uses BulkFormat
to read the data. A built-in BulkFormat is provided to return records in
type of RowData
. It's also possible to implement a custom BulkFormat to return data in
different types. Use HiveSourceBuilder
to build HiveSource instances.AbstractFileSource.AbstractFileSourceBuilder<T,SplitT extends FileSourceSplit,SELF extends AbstractFileSource.AbstractFileSourceBuilder<T,SplitT,SELF>>
Modifier and Type | Method and Description |
---|---|
SplitEnumerator<HiveSourceSplit,PendingSplitsCheckpoint<HiveSourceSplit>> |
createEnumerator(SplitEnumeratorContext<HiveSourceSplit> enumContext)
Creates a new SplitEnumerator for this source, starting a new input.
|
SimpleVersionedSerializer<PendingSplitsCheckpoint<HiveSourceSplit>> |
getEnumeratorCheckpointSerializer()
Creates the serializer for the
SplitEnumerator checkpoint. |
SimpleVersionedSerializer<HiveSourceSplit> |
getSplitSerializer()
Creates a serializer for the source splits.
|
SplitEnumerator<HiveSourceSplit,PendingSplitsCheckpoint<HiveSourceSplit>> |
restoreEnumerator(SplitEnumeratorContext<HiveSourceSplit> enumContext,
PendingSplitsCheckpoint<HiveSourceSplit> checkpoint)
Restores an enumerator from a checkpoint.
|
createReader, getAssignerFactory, getBoundedness, getContinuousEnumerationSettings, getProducedType
public SimpleVersionedSerializer<HiveSourceSplit> getSplitSerializer()
Source
getSplitSerializer
in interface Source<T,HiveSourceSplit,PendingSplitsCheckpoint<HiveSourceSplit>>
getSplitSerializer
in class AbstractFileSource<T,HiveSourceSplit>
public SimpleVersionedSerializer<PendingSplitsCheckpoint<HiveSourceSplit>> getEnumeratorCheckpointSerializer()
Source
SplitEnumerator
checkpoint. The serializer is used for
the result of the SplitEnumerator.snapshotState(long)
method.getEnumeratorCheckpointSerializer
in interface Source<T,HiveSourceSplit,PendingSplitsCheckpoint<HiveSourceSplit>>
getEnumeratorCheckpointSerializer
in class AbstractFileSource<T,HiveSourceSplit>
public SplitEnumerator<HiveSourceSplit,PendingSplitsCheckpoint<HiveSourceSplit>> createEnumerator(SplitEnumeratorContext<HiveSourceSplit> enumContext)
Source
createEnumerator
in interface Source<T,HiveSourceSplit,PendingSplitsCheckpoint<HiveSourceSplit>>
createEnumerator
in class AbstractFileSource<T,HiveSourceSplit>
enumContext
- The context
for the split enumerator.public SplitEnumerator<HiveSourceSplit,PendingSplitsCheckpoint<HiveSourceSplit>> restoreEnumerator(SplitEnumeratorContext<HiveSourceSplit> enumContext, PendingSplitsCheckpoint<HiveSourceSplit> checkpoint)
Source
restoreEnumerator
in interface Source<T,HiveSourceSplit,PendingSplitsCheckpoint<HiveSourceSplit>>
restoreEnumerator
in class AbstractFileSource<T,HiveSourceSplit>
enumContext
- The context
for the restored split
enumerator.checkpoint
- The checkpoint to restore the SplitEnumerator from.Copyright © 2014–2023 The Apache Software Foundation. All rights reserved.