DataStream#
DataStream#
A DataStream represents a stream of elements of the same type.
Gets the name of the current data stream. |
|
|
Sets the name of the current data stream. |
|
Sets an ID for this operator. |
|
Sets an user provided hash for this operator. |
|
Sets the parallelism for this operator. |
|
Sets the maximum parallelism of this operator. |
Gets the type of the stream. |
|
Returns the StreamExecutionEnvironment that was used to create this DataStream. |
|
Sets the parallelism and maximum parallelism of this operator to one. |
|
|
Sets the buffering timeout for data produced by this operation. |
Starts a new task chain beginning at this operator. |
|
Turns off chaining for this operator so thread co-location will not be used as an optimization. |
|
|
Sets the slot sharing group of this operation. |
|
Sets the description for this operator. |
|
Applies a Map transformation on a DataStream. |
|
Applies a FlatMap transformation on a DataStream. |
|
Creates a new KeyedStream that uses the provided key for partitioning its operator states. |
|
Applies a Filter transformation on a DataStream. |
|
Creates a new DataStream by merging DataStream outputs of the same type with each other. |
Creates a new 'ConnectedStreams' by connecting 'DataStream' outputs of (possible) different types with each other. |
|
Sets the partitioning of the DataStream so that the output elements are shuffled uniformly randomly to the next operation. |
|
|
Initiates a Project transformation on a Tuple DataStream. |
Sets the partitioning of the DataStream so that the output elements are distributed evenly to a subset of instances of the next operation in a round-robin fashion. |
|
Sets the partitioning of the DataStream so that the output elements are distributed evenly to instances of the next operation in a round-robin fashion. |
|
Sets the partitioning of the DataStream so that the output elements are forwarded to the local sub-task of the next operation. |
|
Sets the partitioning of the DataStream so that the output elements are broadcasted to every parallel instance of the next operation. |
|
|
Applies the given ProcessFunction on the input stream, thereby creating a transformed output stream. |
Assigns timestamps to the elements in the data stream and generates watermarks to signal event time progress. |
|
|
Partitions a DataStream on the key returned by the selector, using a custom partitioner. |
|
Adds the given sink to this DataStream. |
|
Adds the given sink to this DataStream. |
Triggers the distributed execution of the streaming dataflow and returns an iterator over the elements of the given DataStream. |
|
|
Writes a DataStream to the standard output stream (stdout). |
DataStreamSink#
A Stream Sink. This is used for emitting elements from a streaming topology.
|
Sets the name of this sink. |
|
Sets an ID for this operator. |
|
Sets an user provided hash for this operator. |
|
Sets the parallelism for this operator. |
|
Sets the description for this sink. |
Turns off chaining for this operator so thread co-location will not be used as an optimization. |
|
Sets the slot sharing group of this operation. |
KeyedStream#
A Stream Sink. This is used for emitting elements from a streaming topology.
|
Applies a Map transformation on a KeyedStream. |
|
Applies a FlatMap transformation on a KeyedStream. |
|
Applies a reduce transformation on the grouped data stream grouped on by the given key position. |
|
Applies a Filter transformation on a DataStream. |
|
Adds the given sink to this DataStream. |
|
Creates a new KeyedStream that uses the provided key for partitioning its operator states. |
|
Applies the given ProcessFunction on the input stream, thereby creating a transformed output stream. |
|
Windows this data stream to a WindowedStream, which evaluates windows over a key grouped stream. |
|
Creates a new DataStream by merging DataStream outputs of the same type with each other. |
Creates a new 'ConnectedStreams' by connecting 'DataStream' outputs of (possible) different types with each other. |
|
|
Partitions a DataStream on the key returned by the selector, using a custom partitioner. |
|
Writes a DataStream to the standard output stream (stdout). |
WindowedStream#
A WindowedStream represents a data stream where elements are grouped by key, and for each key, the stream of elements is split into windows based on a WindowAssigner. Window emission is triggered based on a Trigger.
The windows are conceptually evaluated for each key individually, meaning windows can trigger at different points for each key.
Note that the WindowedStream is purely an API construct, during runtime the WindowedStream will be collapsed together with the KeyedStream and the operation over the window into one single operation.
|
Sets the Trigger that should be used to trigger window emission. |
|
Sets the time by which elements are allowed to be late. |
|
Applies the given window function to each window. |
|
Applies the given window function to each window. |
ConnectedStreams#
ConnectedStreams represent two connected streams of (possibly) different data types. Connected streams are useful for cases where operations on one stream directly affect the operations on the other stream, usually via shared state between the streams.
An example for the use of connected streams would be to apply rules that change over time onto another stream. One of the connected streams has the rules, the other stream the elements to apply the rules to. The operation on the connected stream maintains the current set of rules in the state. It may receive either a rule update and update the state or a data element and apply the rules in the state to the element.
The connected stream can be conceptually viewed as a union stream of an Either type, that holds either the first stream’s type or the second stream’s type.
|
KeyBy operation for connected data stream. |
|
Applies a CoMap transformation on a ConnectedStreams and maps the output to a common type. |
|
Applies a CoFlatMap transformation on a ConnectedStreams and maps the output to a common type. |
|