Make it possible to perform more than one benchmark per pytest test #166

ntninja · 2020-04-10T01:16:54Z

Say, I have a test function like this:

@pytest.mark.benchmark(group="write_100_files_1K_serial")
def test_bench_write_100_files_1K_serial(temp_path, benchmark1, benchmark2):
	benchmark1.name = "trio"
	benchmark1(trio.run, bench_trio_write_100_files_1K_serial, temp_path)
	
	benchmark2.name = "datastore"
	benchmark2(trio.run, bench_fsds_write_100_files_1K_serial, temp_path)
	
	assert benchmark2.stats.stats.median < (2 * benchmark1.stats.stats.median)

Since both of these benchmark calls are I/O bound (or they should be anyway… different story), I cannot compare them to fixed values. Instead, I'd like to compare the relative slow-down/speed-up of my piece of code to some reference code – that is what the test assert does.

Any while the above code actually works fine, it only does so because some private API usage (it does work flawlessly however!):

import pytest
import pytest_benchmark.plugin

@pytest.fixture(scope="function")
def benchmark1(request):
	return pytest_benchmark.plugin.benchmark.__pytest_wrapped__.obj(request)
@pytest.fixture(scope="function")
def benchmark2(request):
	return pytest_benchmark.plugin.benchmark.__pytest_wrapped__.obj(request)

See also pytest-dev/pytest#2703 for the pytest-side limitation of things.
The “official solution” recommended by pytest is to make fixtures factory functions. Would this be something that you would be comfortable with exposing as part of this library?

ionelmc · 2020-05-10T14:23:33Z

Well I guess we could have an make_benchmark or benchmark_setup (pytest-django's style) fixture ...

I still don't get your usecase. You only need this to compare and assert relative results of 2 benchmarks?

patrick91 · 2020-07-15T09:38:25Z

@ionelmc I might have a use case for this, I'm rewriting an API and I'd like to compare the performance with previous api to make sure the new one is not slower. I'm doing this with fixtures at the moment, but maybe calling the benchmark function twice and check the time might be better :)

ionelmc · 2020-11-02T09:22:02Z

@patrick91 perhaps you could use one of the hooks (eg: pytest_benchmark_update_json) to make some assertions on the results?

Or perhaps pytest_benchmark_group_stats if you compare to past data?

I doubt the plugin could have a nicer way to deal with your usecase as there are so many ways of looking and doing things with the data. I mean that's why the plugin has options to output json in the first place.

sarahbx · 2021-04-02T10:41:07Z

Hi @ionelmc, I have a use case for this. It is a long-running test with multiple stages I would like to individually benchmark. Due to the current behavior, to get the necessary data points, the test must be run multiple times, benchmarking only one stage at a time. This can significantly increase the overall testing time having to teardown and setup each run. My initial thought is the pedantic mode could be expanded to include any additional arguments that may be required to facilitate this functionality. Thoughts?

EDIT... what if target could take a list...
eg:

def test_the_thing(fixture):
  def setup(): ...
  def stage1(args): ...
  def stage2(args): ...
  trigger_external_async_process()  # Call not included in benchmark
  benchmark.pedantic(target=[stage1, stage2], setup=setup, rounds=1, ...)
...

lpsinger · 2021-11-30T18:46:34Z

I really love pytest-benchmark, but I am also in a situation where my use case requires multiple benchmarks per test case in order to avoid unreasonable setup/teardown time.

I am benchmarking some software that involves setting up and tearing down the database, and my tests are parametrized by the number of sample rows in the database so that I can measure and plot the scaling of the code and compare it with the expected big-O scaling. The database gets populated with random data, but it is expensive to repeatedly set up and tear down the database. What I would like to do is put the benchmark inside a for-loop that adds more random data to the database on each iteration.

cafhach · 2022-07-05T06:28:21Z

my use case requires multiple benchmarks per test case in order to avoid unreasonable setup/teardown time.

Could you alternatively solve this by reusing a fixture (e.g. module scope)?

ionelmc mentioned this issue Apr 29, 2021

pytest_benchmark_group_stats not working as expected #201

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make it possible to perform more than one benchmark per pytest test #166

Make it possible to perform more than one benchmark per pytest test #166

ntninja commented Apr 10, 2020

ionelmc commented May 10, 2020

patrick91 commented Jul 15, 2020

ionelmc commented Nov 2, 2020

sarahbx commented Apr 2, 2021 •

edited

Loading

lpsinger commented Nov 30, 2021

cafhach commented Jul 5, 2022

Make it possible to perform more than one benchmark per pytest test #166

Make it possible to perform more than one benchmark per pytest test #166

Comments

ntninja commented Apr 10, 2020

ionelmc commented May 10, 2020

patrick91 commented Jul 15, 2020

ionelmc commented Nov 2, 2020

sarahbx commented Apr 2, 2021 • edited Loading

lpsinger commented Nov 30, 2021

cafhach commented Jul 5, 2022

sarahbx commented Apr 2, 2021 •

edited

Loading