@Public public final class HadoopReduceCombineFunction<KEYIN,VALUEIN,KEYOUT,VALUEOUT> extends RichGroupReduceFunction<Tuple2<KEYIN,VALUEIN>,Tuple2<KEYOUT,VALUEOUT>> implements GroupCombineFunction<Tuple2<KEYIN,VALUEIN>,Tuple2<KEYIN,VALUEIN>>, ResultTypeQueryable<Tuple2<KEYOUT,VALUEOUT>>, Serializable
Constructor and Description |
---|
HadoopReduceCombineFunction(org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYOUT,VALUEOUT> hadoopReducer,
org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYIN,VALUEIN> hadoopCombiner)
Maps two Hadoop Reducer (mapred API) to a combinable Flink GroupReduceFunction.
|
HadoopReduceCombineFunction(org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYOUT,VALUEOUT> hadoopReducer,
org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYIN,VALUEIN> hadoopCombiner,
org.apache.hadoop.mapred.JobConf conf)
Maps two Hadoop Reducer (mapred API) to a combinable Flink GroupReduceFunction.
|
Modifier and Type | Method and Description |
---|---|
void |
combine(Iterable<Tuple2<KEYIN,VALUEIN>> values,
Collector<Tuple2<KEYIN,VALUEIN>> out)
The combine method, called (potentially multiple timed) with subgroups of elements.
|
TypeInformation<Tuple2<KEYOUT,VALUEOUT>> |
getProducedType()
Gets the data type (as a
TypeInformation ) produced by this function or input format. |
void |
open(OpenContext openContext)
Initialization method for the function.
|
void |
reduce(Iterable<Tuple2<KEYIN,VALUEIN>> values,
Collector<Tuple2<KEYOUT,VALUEOUT>> out)
The reduce method.
|
close, getIterationRuntimeContext, getRuntimeContext, open, setRuntimeContext
public HadoopReduceCombineFunction(org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYOUT,VALUEOUT> hadoopReducer, org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYIN,VALUEIN> hadoopCombiner)
hadoopReducer
- The Hadoop Reducer that is mapped to a GroupReduceFunction.hadoopCombiner
- The Hadoop Reducer that is mapped to the combiner function.public HadoopReduceCombineFunction(org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYOUT,VALUEOUT> hadoopReducer, org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYIN,VALUEIN> hadoopCombiner, org.apache.hadoop.mapred.JobConf conf)
hadoopReducer
- The Hadoop Reducer that is mapped to a GroupReduceFunction.hadoopCombiner
- The Hadoop Reducer that is mapped to the combiner function.conf
- The JobConf that is used to configure both Hadoop Reducers.@PublicEvolving public void open(OpenContext openContext) throws Exception
RichFunction
The openContext object passed to the function can be used for configuration and initialization. The openContext contains some necessary information that were configured on the function in the program composition.
public class MyFilter extends RichFilterFunction<String> {
private String searchString;
public void open(OpenContext openContext) {
// initialize the value of searchString
}
public boolean filter(String value) {
return value.equals(searchString);
}
}
By default, this method does nothing.
1. If you implement open(OpenContext openContext)
, the open(OpenContext
openContext)
will be invoked and the open(Configuration parameters)
won't be
invoked. 2. If you don't implement open(OpenContext openContext)
, the open(Configuration parameters)
will be invoked in the default implementation of the open(OpenContext openContext)
.
open
in interface RichFunction
openContext
- The context containing information about the context in which the function
is opened.Exception
- Implementations may forward exceptions, which are caught by the runtime.
When the runtime catches an exception, it aborts the task and lets the fail-over logic
decide whether to retry the task execution.public void reduce(Iterable<Tuple2<KEYIN,VALUEIN>> values, Collector<Tuple2<KEYOUT,VALUEOUT>> out) throws Exception
GroupReduceFunction
reduce
in interface GroupReduceFunction<Tuple2<KEYIN,VALUEIN>,Tuple2<KEYOUT,VALUEOUT>>
reduce
in class RichGroupReduceFunction<Tuple2<KEYIN,VALUEIN>,Tuple2<KEYOUT,VALUEOUT>>
values
- All records that belong to the given input key.out
- The collector to hand results to.Exception
- This method may throw exceptions. Throwing an exception will cause the
operation to fail and may trigger recovery.public void combine(Iterable<Tuple2<KEYIN,VALUEIN>> values, Collector<Tuple2<KEYIN,VALUEIN>> out) throws Exception
GroupCombineFunction
combine
in interface GroupCombineFunction<Tuple2<KEYIN,VALUEIN>,Tuple2<KEYIN,VALUEIN>>
values
- The elements to be combined.out
- The collector to use to return values from the function.Exception
- The function may throw Exceptions, which will cause the program to cancel,
and may trigger the recovery logic.public TypeInformation<Tuple2<KEYOUT,VALUEOUT>> getProducedType()
ResultTypeQueryable
TypeInformation
) produced by this function or input format.getProducedType
in interface ResultTypeQueryable<Tuple2<KEYOUT,VALUEOUT>>
Copyright © 2014–2024 The Apache Software Foundation. All rights reserved.