Program to test a large chunk of DataSet API operators and primitives:
- Map, FlatMap, Filter
- GroupReduce, Reduce
- Join
- CoGroup
- BulkIteration
- Different key definitions (position, name, KeySelector)
Program parameters:
- loadFactor (int): controls generated data volume. Does not affect result.
- outputPath (String): path to write the result
- infinite (Boolean): if set to true one of the sources will be infinite. The job will never
end. (default: false(