@Public public final class HadoopReduceCombineFunction<KEYIN,VALUEIN,KEYOUT,VALUEOUT> extends RichGroupReduceFunction<Tuple2<KEYIN,VALUEIN>,Tuple2<KEYOUT,VALUEOUT>> implements GroupCombineFunction<Tuple2<KEYIN,VALUEIN>,Tuple2<KEYIN,VALUEIN>>, ResultTypeQueryable<Tuple2<KEYOUT,VALUEOUT>>, Serializable
Constructor and Description |
---|
HadoopReduceCombineFunction(org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYOUT,VALUEOUT> hadoopReducer,
org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYIN,VALUEIN> hadoopCombiner)
Maps two Hadoop Reducer (mapred API) to a combinable Flink GroupReduceFunction.
|
HadoopReduceCombineFunction(org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYOUT,VALUEOUT> hadoopReducer,
org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYIN,VALUEIN> hadoopCombiner,
org.apache.hadoop.mapred.JobConf conf)
Maps two Hadoop Reducer (mapred API) to a combinable Flink GroupReduceFunction.
|
Modifier and Type | Method and Description |
---|---|
void |
combine(Iterable<Tuple2<KEYIN,VALUEIN>> values,
Collector<Tuple2<KEYIN,VALUEIN>> out)
The combine method, called (potentially multiple timed) with subgroups of elements.
|
TypeInformation<Tuple2<KEYOUT,VALUEOUT>> |
getProducedType()
Gets the data type (as a
TypeInformation ) produced by this function or input format. |
void |
open(Configuration parameters)
Initialization method for the function.
|
void |
reduce(Iterable<Tuple2<KEYIN,VALUEIN>> values,
Collector<Tuple2<KEYOUT,VALUEOUT>> out)
The reduce method.
|
close, getIterationRuntimeContext, getRuntimeContext, setRuntimeContext
public HadoopReduceCombineFunction(org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYOUT,VALUEOUT> hadoopReducer, org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYIN,VALUEIN> hadoopCombiner)
hadoopReducer
- The Hadoop Reducer that is mapped to a GroupReduceFunction.hadoopCombiner
- The Hadoop Reducer that is mapped to the combiner function.public HadoopReduceCombineFunction(org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYOUT,VALUEOUT> hadoopReducer, org.apache.hadoop.mapred.Reducer<KEYIN,VALUEIN,KEYIN,VALUEIN> hadoopCombiner, org.apache.hadoop.mapred.JobConf conf)
hadoopReducer
- The Hadoop Reducer that is mapped to a GroupReduceFunction.hadoopCombiner
- The Hadoop Reducer that is mapped to the combiner function.conf
- The JobConf that is used to configure both Hadoop Reducers.public void open(Configuration parameters) throws Exception
RichFunction
The configuration object passed to the function can be used for configuration and initialization. The configuration contains all parameters that were configured on the function in the program composition.
public class MyFilter extends RichFilterFunction<String> {
private String searchString;
public void open(Configuration parameters) {
this.searchString = parameters.getString("foo");
}
public boolean filter(String value) {
return value.equals(searchString);
}
}
By default, this method does nothing.
open
in interface RichFunction
open
in class AbstractRichFunction
parameters
- The configuration containing the parameters attached to the contract.Exception
- Implementations may forward exceptions, which are caught by the runtime.
When the runtime catches an exception, it aborts the task and lets the fail-over logic
decide whether to retry the task execution.Configuration
public void reduce(Iterable<Tuple2<KEYIN,VALUEIN>> values, Collector<Tuple2<KEYOUT,VALUEOUT>> out) throws Exception
GroupReduceFunction
reduce
in interface GroupReduceFunction<Tuple2<KEYIN,VALUEIN>,Tuple2<KEYOUT,VALUEOUT>>
reduce
in class RichGroupReduceFunction<Tuple2<KEYIN,VALUEIN>,Tuple2<KEYOUT,VALUEOUT>>
values
- All records that belong to the given input key.out
- The collector to hand results to.Exception
- This method may throw exceptions. Throwing an exception will cause the
operation to fail and may trigger recovery.public void combine(Iterable<Tuple2<KEYIN,VALUEIN>> values, Collector<Tuple2<KEYIN,VALUEIN>> out) throws Exception
GroupCombineFunction
combine
in interface GroupCombineFunction<Tuple2<KEYIN,VALUEIN>,Tuple2<KEYIN,VALUEIN>>
values
- The elements to be combined.out
- The collector to use to return values from the function.Exception
- The function may throw Exceptions, which will cause the program to cancel,
and may trigger the recovery logic.public TypeInformation<Tuple2<KEYOUT,VALUEOUT>> getProducedType()
ResultTypeQueryable
TypeInformation
) produced by this function or input format.getProducedType
in interface ResultTypeQueryable<Tuple2<KEYOUT,VALUEOUT>>
Copyright © 2014–2021 The Apache Software Foundation. All rights reserved.