Ctrl+K
Logo image Logo image

Site Navigation

  • API Reference
  • Examples

Site Navigation

  • API Reference
  • Examples

Section Navigation

  • PyFlink Table
  • PyFlink DataStream
    • StreamExecutionEnvironment
    • DataStream
    • Functions
    • State
    • Timer
    • Window
    • Checkpoint
    • Side Outputs
    • Connectors
    • Formats
  • PyFlink Common

pyflink.datastream.data_stream.AllWindowedStream.side_output_late_data#

AllWindowedStream.side_output_late_data(output_tag: pyflink.datastream.output_tag.OutputTag) → pyflink.datastream.data_stream.AllWindowedStream[source]#

Send late arriving data to the side output identified by the given OutputTag. Data is considered late after the watermark has passed the end of the window plus the allowed lateness set using allowed_lateness().

You can get the stream of late data using get_side_output() on the DataStream resulting from the windowed operation with the same OutputTag.

Example:

>>> tag = OutputTag("late-data", Types.TUPLE([Types.INT(), Types.STRING()]))
>>> main_stream = ds.window_all(TumblingEventTimeWindows.of(Time.seconds(5))) \
...                 .side_output_late_data(tag) \
...                 .process(MyProcessAllWindowFunction(),
...                          Types.TUPLE([Types.LONG(), Types.LONG(), Types.INT()]))
>>> late_stream = main_stream.get_side_output(tag)

previous

pyflink.datastream.data_stream.AllWindowedStream.allowed_lateness

next

pyflink.datastream.data_stream.AllWindowedStream.reduce

Show Source

Created using Sphinx 4.5.0.