Data structure that encapsulates statistical information of data that has only been processed by one pass
This statistical information is meant to help determine the distribution of the data that has been processed
in an operator so that we can determine if it is necessary to repartition the data
The statistics to be gathered are configurable and represented by a
OperatorStatisticsConfig
object.
The information encapsulated in this class is min, max, a structure enabling estimation of count distinct and a
structure holding the heavy hitters along with their frequency.