MINOR: Make ByteUtilsBenchmark deterministic #13307

divijvaidya · 2023-02-25T16:17:55Z

Motivation

The current implementation of the benchmark may produce different set of integers across benchmarks and hence, may not provide apples to apples comparison.

Changes

With this change, we initialize random number generator with a fixed seed at the start of a benchmark. This ensures that for each benchmark, random number generator produces the same sequence of random values. Hence, the input data across benchmarks would be consistent providing a reliable apples to apples comparison.
We ensure that a new set of random numbers are generated per iteration, so that the bechmark calculation is performed over a diverse range of values

Sample result of a benchmark run:

Benchmark                                           Mode  Cnt     Score   Error  Units
ByteUtilsBenchmark.testSizeOfUnsignedVarint        thrpt   30  1302.193 ± 0.396  ops/s
ByteUtilsBenchmark.testSizeOfUnsignedVarintSimple  thrpt   30   328.678 ± 0.269  ops/s
ByteUtilsBenchmark.testSizeOfVarlong               thrpt   30   880.113 ± 0.676  ops/s
ByteUtilsBenchmark.testSizeOfVarlongSimple         thrpt   30   109.592 ± 0.071  ops/s
JMH benchmarks done

showuon

Thanks for the improvement! The first point makes sense to me. But for the second point:

We ensure that a new set of random numbers are generated per iteration, so that the bechmark calculation is performed over a diverse range of values

I didn't understand why we need this change. Could you elaborate more on it? From what I can see, it should make no difference if we use the same values or not for each iteration.

Thank you.

divijvaidya · 2023-02-26T14:16:54Z

I didn't understand why we need this change. Could you elaborate more on it? From what I can see, it should make no difference if we use the same values or not for each iteration.

Having different values for each iteration gives us ability to benchmark over diverse sample set. This increases the probability that the algorithm we are testing is optimal over a larger set of values.

divijvaidya · 2023-02-27T20:03:33Z

@showuon I am discarding this PR as I have included it's changes in #13312

divijvaidya added 2 commits February 25, 2023 17:00

Improve ByteUtilsBenchmark

26ab121

add old simple benchmark

a42f12a

showuon reviewed Feb 26, 2023

View reviewed changes

divijvaidya closed this Feb 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MINOR: Make ByteUtilsBenchmark deterministic #13307

MINOR: Make ByteUtilsBenchmark deterministic #13307

divijvaidya commented Feb 25, 2023

showuon left a comment •

edited

Loading

divijvaidya commented Feb 26, 2023

divijvaidya commented Feb 27, 2023

MINOR: Make ByteUtilsBenchmark deterministic #13307

MINOR: Make ByteUtilsBenchmark deterministic #13307

Conversation

divijvaidya commented Feb 25, 2023

Motivation

Changes

showuon left a comment • edited Loading

Choose a reason for hiding this comment

divijvaidya commented Feb 26, 2023

divijvaidya commented Feb 27, 2023

showuon left a comment •

edited

Loading