Program to test a large chunk of DataSet API operators and primitives: Map, FlatMap, Filter GroupReduce, Reduce Join CoGroup BulkIteration Different key definitions (position, name, KeySelector)
InputFormat that generates a deterministic DataSet of Tuple2(String, Integer) String: key, can be repeated. Integer: uniformly distributed int between 0 and 127
Copyright © 2014–2019 The Apache Software Foundation. All rights reserved.