AggregateFunction (flink 1.3-SNAPSHOT API)

java.lang.Object
- org.apache.flink.table.functions.UserDefinedFunction
- - org.apache.flink.table.functions.AggregateFunction<T,ACC>

All Implemented Interfaces:: Serializable

Direct Known Subclasses:: BigIntegralAvgAggFunction, CountAggFunction, DecimalAvgAggFunction, DecimalSumAggFunction, DecimalSumWithRetractAggFunction, FloatingAvgAggFunction, IntegralAvgAggFunction, MaxAggFunction, MaxWithRetractAggFunction, MinAggFunction, MinWithRetractAggFunction, SumAggFunction, SumWithRetractAggFunction

public abstract class AggregateFunction<T,ACC>
extends UserDefinedFunction

Base class for User-Defined Aggregates.

The behavior of an AggregateFunction can be defined by implementing a series of custom methods. An AggregateFunction needs at least three methods: - createAccumulator, - accumulate, and - getValue.

There are a few other methods that can be optional to have: - retract, - merge, - resetAccumulator, and - getAccumulatorType.

All these methods must be declared publicly, not static and named exactly as the names mentioned above. The methods createAccumulator and getValue are defined in the AggregateFunction functions, while other methods are explained below.


 Processes the input values and update the provided accumulator instance. The method
 accumulate can be overloaded with different custom types and arguments. An AggregateFunction
 requires at least one accumulate() method.

 @param accumulator           the accumulator which contains the current aggregated results
 @param [user defined inputs] the input value (usually obtained from a new arrived data).

 def accumulate(accumulator: ACC, [user defined inputs]): Unit


 Retracts the input values from the accumulator instance. The current design assumes the
 inputs are the values that have been previously accumulated. The method retract can be
 overloaded with different custom types and arguments. This function must be implemented for
 datastream bounded over aggregate.

 @param accumulator           the accumulator which contains the current aggregated results
 @param [user defined inputs] the input value (usually obtained from a new arrived data).

 def retract(accumulator: ACC, [user defined inputs]): Unit


 Merges a group of accumulator instances into one accumulator instance. This function must be
 implemented for datastream session window grouping aggregate and dataset grouping aggregate.

 @param accumulator  the accumulator which will keep the merged aggregate results. It should
                     be noted that the accumulator may contain the previous aggregated
                     results. Therefore user should not replace or clean this instance in the
                     custom merge method.
 @param its          an {@link java.lang.Iterable} pointed to a group of accumulators that will be
                     merged.
 
 def merge(accumulator: ACC, its: java.lang.Iterable[ACC]): Unit


 Resets the accumulator for this {@link AggregateFunction}. This function must be implemented for
 dataset grouping aggregate.

 @param accumulator  the accumulator which needs to be reset
 
 def resetAccumulator(accumulator: ACC): Unit


 Returns the {@link org.apache.flink.api.common.typeinfo.TypeInformation} of the accumulator. This
 function is optional and can be implemented if the accumulator type cannot be automatically
 inferred from the instance returned by createAccumulator method.

 @return  the type information for the accumulator.
 
 def getAccumulatorType: TypeInformation[_]


 Returns the {@link org.apache.flink.api.common.typeinfo.TypeInformation} of the return value. This
 function is optional and needed in case Flink's type extraction facilities are not sufficient
 to extract the TypeInformation. Flink's type extraction facilities can handle basic types or
 simple POJOs but might be wrong for more complex, custom, or composite types.

 @return  the type information for the return value.

 def getResultType: TypeInformation[_]

See Also:: Serialized Form

Constructor Summary

Constructors
Constructor and Description

AggregateFunction()

Constructors
Constructor and Description
`AggregateFunction()`

Method Summary

All Methods Instance Methods Abstract Methods Concrete Methods
Modifier and Type	Method and Description
`abstract ACC`	`createAccumulator()` Creates and init the Accumulator for this `AggregateFunction`.
`abstract T`	`getValue(ACC accumulator)` Called every time when an aggregation result should be materialized.
`boolean`	`requiresOver()` whether this aggregate only used in OVER clause

Methods inherited from class org.apache.flink.table.functions.UserDefinedFunction
close, functionIdentifier, open

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Constructor Detail
  - AggregateFunction
```
public AggregateFunction()
```
- Method Detail
  - createAccumulator
```
public abstract ACC createAccumulator()
```
    Creates and init the Accumulator for this AggregateFunction.
    
    Returns:
    
    the accumulator with the initial value
  - getValue
```
public abstract T getValue(ACC accumulator)
```
    Called every time when an aggregation result should be materialized. The returned value could be either an early and incomplete result (periodically emitted as data arrive) or the final result of the aggregation.
    
    Parameters:
    
    accumulator - the accumulator which contains the current aggregated results
    
    Returns:
    
    the aggregation result
  - requiresOver
```
public boolean requiresOver()
```
    whether this aggregate only used in OVER clause
    
    Returns:
    
    (undocumented)

Back to Flink Website

Class AggregateFunction<T,ACC>

Constructor Summary

Method Summary

Methods inherited from class org.apache.flink.table.functions.UserDefinedFunction

Methods inherited from class java.lang.Object

Constructor Detail

AggregateFunction

Method Detail

createAccumulator

getValue

requiresOver

Back to Flink Website