Modifier and Type | Class and Description |
---|---|
class |
AbstractFileSource<T,SplitT extends FileSourceSplit>
The base class for File Sources.
|
protected static class |
AbstractFileSource.AbstractFileSourceBuilder<T,SplitT extends FileSourceSplit,SELF extends AbstractFileSource.AbstractFileSourceBuilder<T,SplitT,SELF>>
The generic base builder.
|
class |
FileSourceSplitState<SplitT extends FileSourceSplit>
State of the reader, essentially a mutable version of the
FileSourceSplit . |
class |
PendingSplitsCheckpoint<SplitT extends FileSourceSplit>
A checkpoint of the current state of the containing the currently pending splits that are not yet
assigned.
|
class |
PendingSplitsCheckpointSerializer<T extends FileSourceSplit>
A serializer for the
PendingSplitsCheckpoint . |
Modifier and Type | Method and Description |
---|---|
static <T extends FileSourceSplit> |
PendingSplitsCheckpoint.fromCollectionSnapshot(Collection<T> splits) |
static <T extends FileSourceSplit> |
PendingSplitsCheckpoint.fromCollectionSnapshot(Collection<T> splits,
Collection<Path> alreadyProcessedPaths) |
Modifier and Type | Method and Description |
---|---|
FileSourceSplit |
FileSourceSplitSerializer.deserialize(int version,
byte[] serialized) |
FileSourceSplit |
FileSourceSplit.updateWithCheckpointedPosition(CheckpointedPosition position)
Creates a copy of this split where the checkpointed position is replaced by the given new
position.
|
Modifier and Type | Method and Description |
---|---|
SimpleVersionedSerializer<FileSourceSplit> |
FileSource.getSplitSerializer() |
Modifier and Type | Method and Description |
---|---|
byte[] |
FileSourceSplitSerializer.serialize(FileSourceSplit split) |
Modifier and Type | Method and Description |
---|---|
static <T> FileSource.FileSourceBuilder<T> |
FileSource.forBulkFileFormat(BulkFormat<T,FileSourceSplit> bulkFormat,
Path... paths)
Builds a new
FileSource using a BulkFormat to read batches of records from
files. |
Modifier and Type | Method and Description |
---|---|
Optional<FileSourceSplit> |
SimpleSplitAssigner.getNext(String hostname) |
Optional<FileSourceSplit> |
FileSplitAssigner.getNext(String hostname)
Gets the next split.
|
Optional<FileSourceSplit> |
LocalityAwareSplitAssigner.getNext(String host) |
Collection<FileSourceSplit> |
SimpleSplitAssigner.remainingSplits() |
Collection<FileSourceSplit> |
FileSplitAssigner.remainingSplits()
Gets the remaining splits that this assigner has pending.
|
Collection<FileSourceSplit> |
LocalityAwareSplitAssigner.remainingSplits() |
Modifier and Type | Method and Description |
---|---|
void |
SimpleSplitAssigner.addSplits(Collection<FileSourceSplit> newSplits) |
void |
FileSplitAssigner.addSplits(Collection<FileSourceSplit> splits)
Adds a set of splits to this assigner.
|
void |
LocalityAwareSplitAssigner.addSplits(Collection<FileSourceSplit> splits) |
FileSplitAssigner |
FileSplitAssigner.Provider.create(Collection<FileSourceSplit> initialSplits)
Creates a new
FileSplitAssigner that starts with the given set of initial splits. |
Constructor and Description |
---|
LocalityAwareSplitAssigner(Collection<FileSourceSplit> splits) |
SimpleSplitAssigner(Collection<FileSourceSplit> splits) |
Modifier and Type | Method and Description |
---|---|
Collection<FileSourceSplit> |
NonSplittingRecursiveEnumerator.enumerateSplits(Path[] paths,
int minDesiredSplits) |
Collection<FileSourceSplit> |
FileEnumerator.enumerateSplits(Path[] paths,
int minDesiredSplits)
Generates all file splits for the relevant files under the given paths.
|
Modifier and Type | Method and Description |
---|---|
protected void |
NonSplittingRecursiveEnumerator.addSplitsForPath(FileStatus fileStatus,
FileSystem fs,
ArrayList<FileSourceSplit> target) |
protected void |
NonSplittingRecursiveAllDirEnumerator.addSplitsForPath(FileStatus fileStatus,
FileSystem fs,
ArrayList<FileSourceSplit> target) |
protected void |
BlockSplittingRecursiveAllDirEnumerator.addSplitsForPath(FileStatus fileStatus,
FileSystem fs,
ArrayList<FileSourceSplit> target) |
protected void |
NonSplittingRecursiveEnumerator.convertToSourceSplits(FileStatus file,
FileSystem fs,
List<FileSourceSplit> target) |
protected void |
BlockSplittingRecursiveEnumerator.convertToSourceSplits(FileStatus file,
FileSystem fs,
List<FileSourceSplit> target) |
Modifier and Type | Class and Description |
---|---|
class |
DynamicFileSplitEnumerator<SplitT extends FileSourceSplit>
A SplitEnumerator implementation that supports dynamic filtering.
|
class |
FileSourceReader<T,SplitT extends FileSourceSplit>
A
SourceReader that read records from FileSourceSplit . |
Modifier and Type | Method and Description |
---|---|
PendingSplitsCheckpoint<FileSourceSplit> |
StaticFileSplitEnumerator.snapshotState(long checkpointId) |
PendingSplitsCheckpoint<FileSourceSplit> |
ContinuousFileSplitEnumerator.snapshotState(long checkpointId) |
Modifier and Type | Method and Description |
---|---|
BulkFormat.Reader<T> |
StreamFormatAdapter.createReader(Configuration config,
FileSourceSplit split) |
BulkFormat.Reader<T> |
FileRecordFormatAdapter.createReader(Configuration config,
FileSourceSplit split)
Deprecated.
|
BulkFormat.Reader<T> |
StreamFormatAdapter.restoreReader(Configuration config,
FileSourceSplit split) |
BulkFormat.Reader<T> |
FileRecordFormatAdapter.restoreReader(Configuration config,
FileSourceSplit split)
Deprecated.
|
Modifier and Type | Method and Description |
---|---|
void |
StaticFileSplitEnumerator.addSplitsBack(List<FileSourceSplit> splits,
int subtaskId) |
void |
ContinuousFileSplitEnumerator.addSplitsBack(List<FileSourceSplit> splits,
int subtaskId) |
Constructor and Description |
---|
ContinuousFileSplitEnumerator(SplitEnumeratorContext<FileSourceSplit> context,
FileEnumerator enumerator,
FileSplitAssigner splitAssigner,
Path[] paths,
Collection<Path> alreadyDiscoveredPaths,
long discoveryInterval) |
StaticFileSplitEnumerator(SplitEnumeratorContext<FileSourceSplit> context,
FileSplitAssigner splitAssigner) |
Modifier and Type | Interface and Description |
---|---|
interface |
BulkFormat<T,SplitT extends FileSourceSplit>
The
BulkFormat reads and decodes batches of records at a time. |
Modifier and Type | Class and Description |
---|---|
class |
LimitableBulkFormat<T,SplitT extends FileSourceSplit>
A
BulkFormat that can limit output record number. |
interface |
PartitionFieldExtractor<T extends FileSourceSplit>
Interface to extract partition field from split.
|
Modifier and Type | Method and Description |
---|---|
static <T,SplitT extends FileSourceSplit> |
LimitableBulkFormat.create(BulkFormat<T,SplitT> format,
Long limit) |
Modifier and Type | Method and Description |
---|---|
static PartitionFieldExtractor<FileSourceSplit> |
PartitionFieldExtractor.forFileSystem(String defaultPartValue) |
Modifier and Type | Method and Description |
---|---|
org.apache.flink.connector.file.table.DeserializationSchemaAdapter.Reader |
DeserializationSchemaAdapter.createReader(Configuration config,
FileSourceSplit split) |
org.apache.flink.connector.file.table.DeserializationSchemaAdapter.Reader |
DeserializationSchemaAdapter.restoreReader(Configuration config,
FileSourceSplit split) |
Constructor and Description |
---|
FileSystemTableSource(ObjectIdentifier tableIdentifier,
DataType physicalRowDataType,
List<String> partitionKeys,
ReadableConfig tableOptions,
DecodingFormat<BulkFormat<RowData,FileSourceSplit>> bulkReaderFormat,
DecodingFormat<DeserializationSchema<RowData>> deserializationFormat) |
Modifier and Type | Method and Description |
---|---|
static <T> CompactReader.Factory<T> |
CompactBulkReader.factory(BulkFormat<T,FileSourceSplit> format) |
Modifier and Type | Method and Description |
---|---|
Collection<FileSourceSplit> |
HiveSourceDynamicFileEnumerator.enumerateSplits(Path[] paths,
int minDesiredSplits) |
Collection<FileSourceSplit> |
HiveSourceFileEnumerator.enumerateSplits(Path[] paths,
int minDesiredSplits) |
Modifier and Type | Class and Description |
---|---|
class |
HiveSourceSplit
A sub-class of
FileSourceSplit that contains extra information needed to read a hive
table. |
Modifier and Type | Method and Description |
---|---|
FileSourceSplit |
HiveSourceSplit.updateWithCheckpointedPosition(CheckpointedPosition position) |
Modifier and Type | Class and Description |
---|---|
class |
AbstractAvroBulkFormat<A,T,SplitT extends FileSourceSplit>
Provides a
BulkFormat for Avro records. |
Modifier and Type | Method and Description |
---|---|
BulkFormat<RowData,FileSourceSplit> |
CsvFileFormatFactory.CsvBulkDecodingFormat.createRuntimeDecoder(DynamicTableSource.Context context,
DataType physicalDataType,
int[][] projections) |
Modifier and Type | Class and Description |
---|---|
class |
ParquetColumnarRowInputFormat<SplitT extends FileSourceSplit>
A
ParquetVectorizedInputFormat to provide RowData iterator. |
class |
ParquetVectorizedInputFormat<T,SplitT extends FileSourceSplit>
Parquet
BulkFormat that reads data from the file to VectorizedColumnBatch in
vectorized mode. |
Modifier and Type | Method and Description |
---|---|
static <SplitT extends FileSourceSplit> |
ParquetColumnarRowInputFormat.createPartitionedFormat(org.apache.hadoop.conf.Configuration hadoopConfig,
RowType producedRowType,
TypeInformation<RowData> producedTypeInfo,
List<String> partitionKeys,
PartitionFieldExtractor<SplitT> extractor,
int batchSize,
boolean isUtcTimestamp,
boolean isCaseSensitive)
Create a partitioned
ParquetColumnarRowInputFormat , the partition columns can be
generated by Path . |
Modifier and Type | Method and Description |
---|---|
BulkFormat<RowData,FileSourceSplit> |
ParquetFileFormatFactory.ParquetBulkDecodingFormat.createRuntimeDecoder(DynamicTableSource.Context sourceContext,
DataType producedDataType,
int[][] projections) |
Modifier and Type | Interface and Description |
---|---|
interface |
ColumnBatchFactory<SplitT extends FileSourceSplit>
Interface to create
VectorizedColumnBatch . |
Modifier and Type | Method and Description |
---|---|
static <SplitT extends FileSourceSplit> |
ColumnBatchFactory.withoutExtraFields() |
Modifier and Type | Class and Description |
---|---|
class |
AbstractOrcFileInputFormat<T,BatchT,SplitT extends FileSourceSplit>
The base for ORC readers for the
FileSource . |
class |
OrcColumnarRowFileInputFormat<BatchT,SplitT extends FileSourceSplit>
Deprecated.
Please use
OrcColumnarRowInputFormat . |
class |
OrcColumnarRowInputFormat<BatchT,SplitT extends FileSourceSplit>
An ORC reader that produces a stream of
ColumnarRowData records. |
Modifier and Type | Method and Description |
---|---|
static <SplitT extends FileSourceSplit> |
OrcColumnarRowInputFormat.createPartitionedFormat(OrcShim<org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch> shim,
org.apache.hadoop.conf.Configuration hadoopConfig,
RowType tableType,
List<String> partitionKeys,
PartitionFieldExtractor<SplitT> extractor,
int[] selectedFields,
List<OrcFilters.Predicate> conjunctPredicates,
int batchSize,
Function<RowType,TypeInformation<RowData>> rowTypeInfoFactory)
Create a partitioned
OrcColumnarRowInputFormat , the partition columns can be
generated by split. |
Modifier and Type | Method and Description |
---|---|
BulkFormat<RowData,FileSourceSplit> |
OrcFileFormatFactory.OrcBulkDecodingFormat.createRuntimeDecoder(DynamicTableSource.Context sourceContext,
DataType producedDataType,
int[][] projections) |
Modifier and Type | Method and Description |
---|---|
static <SplitT extends FileSourceSplit> |
OrcNoHiveColumnarRowInputFormat.createPartitionedFormat(org.apache.hadoop.conf.Configuration hadoopConfig,
RowType tableType,
List<String> partitionKeys,
PartitionFieldExtractor<SplitT> extractor,
int[] selectedFields,
List<OrcFilters.Predicate> conjunctPredicates,
int batchSize,
Function<RowType,TypeInformation<RowData>> rowTypeInfoFactory)
Create a partitioned
OrcColumnarRowInputFormat , the partition columns can be
generated by split. |
Modifier and Type | Interface and Description |
---|---|
interface |
ColumnBatchFactory<BatchT,SplitT extends FileSourceSplit>
Interface to create
VectorizedColumnBatch . |
Copyright © 2014–2024 The Apache Software Foundation. All rights reserved.