public class ParquetRowInputFormat extends ParquetInputFormat<Row> implements ResultTypeQueryable<Row>
ParquetInputFormat
to read Row
records from Parquet files.FileInputFormat.FileBaseStatistics, FileInputFormat.InputSplitOpenThread
PARQUET_SKIP_CORRUPTED_RECORD, PARQUET_SKIP_WRONG_SCHEMA_SPLITS
currentSplit, ENUMERATE_NESTED_FILES_FLAG, enumerateNestedFiles, filePath, INFLATER_INPUT_STREAM_FACTORIES, minSplitSize, numSplits, openTimeout, READ_WHOLE_SPLIT_FLAG, splitLength, splitStart, stream, unsplittable
Constructor and Description |
---|
ParquetRowInputFormat(Path path,
org.apache.parquet.schema.MessageType messageType) |
Modifier and Type | Method and Description |
---|---|
protected Row |
convert(Row row)
This ParquetInputFormat read parquet record as Row by default.
|
TypeInformation<Row> |
getProducedType()
Gets the data type (as a
TypeInformation ) produced by this function or input format. |
close, configure, getCurrentState, getFieldNames, getFieldTypes, getPredicate, nextRecord, open, reachedEnd, reopen, selectFields, setFilterPredicate
acceptFile, createInputSplits, decorateInputStream, extractFileExtension, getFilePath, getFilePaths, getFileStats, getFileStats, getInflaterInputStreamFactory, getInputSplitAssigner, getMinSplitSize, getNestedFileEnumeration, getNumSplits, getOpenTimeout, getSplitLength, getSplitStart, getStatistics, registerInflaterInputStreamFactory, setFilePath, setFilePath, setFilePaths, setFilePaths, setFilesFilter, setMinSplitSize, setNestedFileEnumeration, setNumSplits, setOpenTimeout, supportsMultiPaths, testForUnsplittable, toString
closeInputFormat, getRuntimeContext, openInputFormat, setRuntimeContext
public ParquetRowInputFormat(Path path, org.apache.parquet.schema.MessageType messageType)
public TypeInformation<Row> getProducedType()
ResultTypeQueryable
TypeInformation
) produced by this function or input format.getProducedType
in interface ResultTypeQueryable<Row>
protected Row convert(Row row)
ParquetInputFormat
convert
in class ParquetInputFormat<Row>
row
- row read from parquet fileCopyright © 2014–2020 The Apache Software Foundation. All rights reserved.