Class TextLineInputFormat

  • All Implemented Interfaces:
    Serializable, ResultTypeQueryable<String>, StreamFormat<String>

    @PublicEvolving
    public class TextLineInputFormat
    extends SimpleStreamFormat<String>
    A reader format that text lines from a file.

    The reader uses Java's built-in InputStreamReader to decode the byte stream using various supported charset encodings.

    This format does not support optimized recovery from checkpoints. On recovery, it will re-read and discard the number of lined that were processed before the last checkpoint. That is due to the fact that the offsets of lines in the file cannot be tracked through the charset decoders with their internal buffering of stream input and charset decoder state.

    See Also:
    Serialized Form